Comparison of monocular depth estimation methods using geometrically relevant metrics on the IBims-1 dataset

Tobias Koch; Lukas Liebel; Marco Körner; Friedrich Fraundorfer

doi:10.1016/j.cviu.2019.102877

Comparison of monocular depth estimation methods using geometrically relevant metrics on the IBims-1 dataset

Tobias Koch^*, Lukas Liebel, Marco Körner, Friedrich Fraundorfer

^*Corresponding author for this work

Institute of Computer Graphics and Vision (7100)

Research output: Contribution to journal › Article › peer-review

Abstract

The task of predicting a dense depth map from a monocular RGB image, commonly known as single-image depth estimation (SIDE) or monocular depth estimation (MDE), is an active research topic in computer vision for decades. With the significant progress of deep models in recent years, new standards were set yielding remarkable results in capturing the 3D structure from a single image. However, established evaluation schemes of predicted depth maps are still limited, as they only consider global statistics of the depth residuals. In order to allow for a geometry-aware analysis, we propose a set of novel quality criteria addressing the preservation of depth discontinuities and planar regions, the depth consistency across the image, and a distance-related assessment. As current datasets do not fulfill the requirements of all proposed error metrics, we provide a new high-quality indoor RGB-D test dataset, acquired by a digital single-lens reflex (DSLR) camera together with a laser scanner. New insights into the performance of current state-of-the-art SIDE approaches, as well as subtle differences among them, could be unveiled by employing the proposed error metrics on our reference dataset. Additionally, investigations on the real-world applicability of SIDE methods by a series of experiments regarding different image augmentations, illumination changes and textured planar regions have shown current limitations in this research field.

Original language	English
Article number	102877
Pages (from-to)	102877
Journal	Computer Vision and Image Understanding
Volume	191
DOIs	https://doi.org/10.1016/j.cviu.2019.102877
Publication status	Published - 1 Feb 2020

ASJC Scopus subject areas

Software
Signal Processing
Computer Vision and Pattern Recognition

Access to Document

10.1016/j.cviu.2019.102877

Cite this

@article{a75401a15a5749f081bc6452ffafd843,

title = "Comparison of monocular depth estimation methods using geometrically relevant metrics on the IBims-1 dataset",

abstract = "The task of predicting a dense depth map from a monocular RGB image, commonly known as single-image depth estimation (SIDE) or monocular depth estimation (MDE), is an active research topic in computer vision for decades. With the significant progress of deep models in recent years, new standards were set yielding remarkable results in capturing the 3D structure from a single image. However, established evaluation schemes of predicted depth maps are still limited, as they only consider global statistics of the depth residuals. In order to allow for a geometry-aware analysis, we propose a set of novel quality criteria addressing the preservation of depth discontinuities and planar regions, the depth consistency across the image, and a distance-related assessment. As current datasets do not fulfill the requirements of all proposed error metrics, we provide a new high-quality indoor RGB-D test dataset, acquired by a digital single-lens reflex (DSLR) camera together with a laser scanner. New insights into the performance of current state-of-the-art SIDE approaches, as well as subtle differences among them, could be unveiled by employing the proposed error metrics on our reference dataset. Additionally, investigations on the real-world applicability of SIDE methods by a series of experiments regarding different image augmentations, illumination changes and textured planar regions have shown current limitations in this research field.",

author = "Tobias Koch and Lukas Liebel and Marco K{\"o}rner and Friedrich Fraundorfer",

year = "2020",

month = feb,

day = "1",

doi = "10.1016/j.cviu.2019.102877",

language = "English",

volume = "191",

pages = "102877",

journal = "Computer Vision and Image Understanding",

issn = "1077-3142",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Comparison of monocular depth estimation methods using geometrically relevant metrics on the IBims-1 dataset

AU - Koch, Tobias

AU - Liebel, Lukas

AU - Körner, Marco

AU - Fraundorfer, Friedrich

PY - 2020/2/1

Y1 - 2020/2/1

N2 - The task of predicting a dense depth map from a monocular RGB image, commonly known as single-image depth estimation (SIDE) or monocular depth estimation (MDE), is an active research topic in computer vision for decades. With the significant progress of deep models in recent years, new standards were set yielding remarkable results in capturing the 3D structure from a single image. However, established evaluation schemes of predicted depth maps are still limited, as they only consider global statistics of the depth residuals. In order to allow for a geometry-aware analysis, we propose a set of novel quality criteria addressing the preservation of depth discontinuities and planar regions, the depth consistency across the image, and a distance-related assessment. As current datasets do not fulfill the requirements of all proposed error metrics, we provide a new high-quality indoor RGB-D test dataset, acquired by a digital single-lens reflex (DSLR) camera together with a laser scanner. New insights into the performance of current state-of-the-art SIDE approaches, as well as subtle differences among them, could be unveiled by employing the proposed error metrics on our reference dataset. Additionally, investigations on the real-world applicability of SIDE methods by a series of experiments regarding different image augmentations, illumination changes and textured planar regions have shown current limitations in this research field.

AB - The task of predicting a dense depth map from a monocular RGB image, commonly known as single-image depth estimation (SIDE) or monocular depth estimation (MDE), is an active research topic in computer vision for decades. With the significant progress of deep models in recent years, new standards were set yielding remarkable results in capturing the 3D structure from a single image. However, established evaluation schemes of predicted depth maps are still limited, as they only consider global statistics of the depth residuals. In order to allow for a geometry-aware analysis, we propose a set of novel quality criteria addressing the preservation of depth discontinuities and planar regions, the depth consistency across the image, and a distance-related assessment. As current datasets do not fulfill the requirements of all proposed error metrics, we provide a new high-quality indoor RGB-D test dataset, acquired by a digital single-lens reflex (DSLR) camera together with a laser scanner. New insights into the performance of current state-of-the-art SIDE approaches, as well as subtle differences among them, could be unveiled by employing the proposed error metrics on our reference dataset. Additionally, investigations on the real-world applicability of SIDE methods by a series of experiments regarding different image augmentations, illumination changes and textured planar regions have shown current limitations in this research field.

UR - http://www.scopus.com/inward/record.url?scp=85076557147&partnerID=8YFLogxK

U2 - 10.1016/j.cviu.2019.102877

DO - 10.1016/j.cviu.2019.102877

M3 - Article

AN - SCOPUS:85076557147

SN - 1077-3142

VL - 191

SP - 102877

JO - Computer Vision and Image Understanding

JF - Computer Vision and Image Understanding

M1 - 102877

ER -

Comparison of monocular depth estimation methods using geometrically relevant metrics on the IBims-1 dataset

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this