Skip to main content

LUP Student Papers

LUND UNIVERSITY LIBRARIES

3D Privacy Masking using Monocular Depth Estimation

Lundström, Mattias and Pettersson, Jakob (2022)
Department of Automatic Control
Abstract
This thesis strives to dive deeper within the area of Monocular Depth Estimation, approximating distance information from one single image using deep neural networks. It introduces a thorough evaluation and analysis of state-of-the-art depth estimation models regarding proposed aspects of relevance for downstream video applications, specifically in a surveillance domain. This leads to three custom data sets where two include ground truth depth data. Results on accuracy, temporal inconsistency, and range resolution is presented and analysed utilising the collected data sets, with selected metrics. It is concluded that the accuracy performance of the models, even though impressive, is also highly scene dependant. Regarding temporal... (More)
This thesis strives to dive deeper within the area of Monocular Depth Estimation, approximating distance information from one single image using deep neural networks. It introduces a thorough evaluation and analysis of state-of-the-art depth estimation models regarding proposed aspects of relevance for downstream video applications, specifically in a surveillance domain. This leads to three custom data sets where two include ground truth depth data. Results on accuracy, temporal inconsistency, and range resolution is presented and analysed utilising the collected data sets, with selected metrics. It is concluded that the accuracy performance of the models, even though impressive, is also highly scene dependant. Regarding temporal inconsistency, which causes apparent video instability, it is concluded to be a prominent concern for typical downstream video applications that calls for further attention. This leads to a proposed minor post-processing step, with promising results.

Furthermore, this thesis also presents a novel end-to-end algorithm referred to as ”3D Privacy Masking”. Privacy masking is a typical task in camera surveillance, where a certain region of the image scene needs to be anonymised. This functionality is here extended by including depth, such as that from monocular depth estimation, resulting in a depth aware privacy mask. Thereby, events in front of the mask as seen by the camera can still be observable. The suggested algorithm and proof-ofconcept application also includes a stabilising technique to account for sub-perfect depth data. Conclusively, this thesis showcases the potential of monocular depth estimation in downstream computer vision tasks, like that of 3D privacy masking, and proposes continued directions forward. (Less)
Please use this url to cite or link to this publication:
author
Lundström, Mattias and Pettersson, Jakob
supervisor
organization
year
type
H3 - Professional qualifications (4 Years - )
subject
report number
TFRT-6168
ISSN
0280-5316
language
English
id
9095030
date added to LUP
2022-08-12 09:53:27
date last changed
2022-08-12 09:53:27
@misc{9095030,
  abstract     = {{This thesis strives to dive deeper within the area of Monocular Depth Estimation, approximating distance information from one single image using deep neural networks. It introduces a thorough evaluation and analysis of state-of-the-art depth estimation models regarding proposed aspects of relevance for downstream video applications, specifically in a surveillance domain. This leads to three custom data sets where two include ground truth depth data. Results on accuracy, temporal inconsistency, and range resolution is presented and analysed utilising the collected data sets, with selected metrics. It is concluded that the accuracy performance of the models, even though impressive, is also highly scene dependant. Regarding temporal inconsistency, which causes apparent video instability, it is concluded to be a prominent concern for typical downstream video applications that calls for further attention. This leads to a proposed minor post-processing step, with promising results.

Furthermore, this thesis also presents a novel end-to-end algorithm referred to as ”3D Privacy Masking”. Privacy masking is a typical task in camera surveillance, where a certain region of the image scene needs to be anonymised. This functionality is here extended by including depth, such as that from monocular depth estimation, resulting in a depth aware privacy mask. Thereby, events in front of the mask as seen by the camera can still be observable. The suggested algorithm and proof-ofconcept application also includes a stabilising technique to account for sub-perfect depth data. Conclusively, this thesis showcases the potential of monocular depth estimation in downstream computer vision tasks, like that of 3D privacy masking, and proposes continued directions forward.}},
  author       = {{Lundström, Mattias and Pettersson, Jakob}},
  issn         = {{0280-5316}},
  language     = {{eng}},
  note         = {{Student Paper}},
  title        = {{3D Privacy Masking using Monocular Depth Estimation}},
  year         = {{2022}},
}