Predicting Sharp and Accurate Occlusion Boundaries in Monocular Depth Estimation Using Displacement Fields

Michaël  Ramamonjisoa; Yuming Du; Vincent Lepetit

doi:10.1109/CVPR42600.2020.01466

Predicting Sharp and Accurate Occlusion Boundaries in Monocular Depth Estimation Using Displacement Fields

Michaël Ramamonjisoa, Yuming Du, Vincent Lepetit

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

Abstract

Current methods for depth map prediction from monocular images tend to predict smooth, poorly localized contours for the occlusion boundaries in the input image. This is unfortunate as occlusion boundaries are important cues to recognize objects, and as we show, may lead to a way to discover new objects from scene reconstruction. To improve predicted depth maps, recent methods rely on various forms of filtering or predict an additive residual depth map to refine a first estimate. We instead learn to predict, given a depth map predicted by some reconstruction method, a 2D displacement field able to re-sample pixels around the occlusion boundaries into sharper reconstructions. Our method can be applied to the output of any depth estimation method and is fully differentiable, enabling end-to-end training. For evaluation, we manually annotated the occlusion boundaries in all the images in the test split of popular NYUv2-Depth dataset. We show that our approach improves the localization of occlusion boundaries for all state-of-the-art monocular depth estimation methods that we could evaluate ([32, 10, 6, 28]), without degrading the depth accuracy for the rest of the images.

Original language	English
Title of host publication	Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
Pages	14636-14645
Number of pages	10
DOIs	https://doi.org/10.1109/CVPR42600.2020.01466
Publication status	Published - 5 Aug 2020
Externally published	Yes
Event	2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition: CVPR 2020 - virtuell, Virtual, United States Duration: 14 Jun 2020 → 19 Jun 2020

Conference

Conference	2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Abbreviated title	CVPR 2020
Country/Territory	United States
City	Virtual
Period	14/06/20 → 19/06/20

ASJC Scopus subject areas

Software
Computer Vision and Pattern Recognition

Access to Document

10.1109/CVPR42600.2020.01466

Cite this

Ramamonjisoa, M, Du, Y & Lepetit, V 2020, Predicting Sharp and Accurate Occlusion Boundaries in Monocular Depth Estimation Using Displacement Fields. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 14636-14645, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Virtual, United States, 14/06/20. https://doi.org/10.1109/CVPR42600.2020.01466

@inproceedings{44278355e72f4ab1b49a80bb5ef336d8,

title = "Predicting Sharp and Accurate Occlusion Boundaries in Monocular Depth Estimation Using Displacement Fields",

abstract = "Current methods for depth map prediction from monocular images tend to predict smooth, poorly localized contours for the occlusion boundaries in the input image. This is unfortunate as occlusion boundaries are important cues to recognize objects, and as we show, may lead to a way to discover new objects from scene reconstruction. To improve predicted depth maps, recent methods rely on various forms of filtering or predict an additive residual depth map to refine a first estimate. We instead learn to predict, given a depth map predicted by some reconstruction method, a 2D displacement field able to re-sample pixels around the occlusion boundaries into sharper reconstructions. Our method can be applied to the output of any depth estimation method and is fully differentiable, enabling end-to-end training. For evaluation, we manually annotated the occlusion boundaries in all the images in the test split of popular NYUv2-Depth dataset. We show that our approach improves the localization of occlusion boundaries for all state-of-the-art monocular depth estimation methods that we could evaluate ([32, 10, 6, 28]), without degrading the depth accuracy for the rest of the images.",

author = "Micha{\"e}l Ramamonjisoa and Yuming Du and Vincent Lepetit",

year = "2020",

month = aug,

day = "5",

doi = "10.1109/CVPR42600.2020.01466",

language = "English",

pages = "14636--14645",

booktitle = "Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition",

note = "2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition : CVPR 2020, CVPR 2020 ; Conference date: 14-06-2020 Through 19-06-2020",

}

TY - GEN

T1 - Predicting Sharp and Accurate Occlusion Boundaries in Monocular Depth Estimation Using Displacement Fields

AU - Ramamonjisoa, Michaël

AU - Du, Yuming

AU - Lepetit, Vincent

PY - 2020/8/5

Y1 - 2020/8/5

N2 - Current methods for depth map prediction from monocular images tend to predict smooth, poorly localized contours for the occlusion boundaries in the input image. This is unfortunate as occlusion boundaries are important cues to recognize objects, and as we show, may lead to a way to discover new objects from scene reconstruction. To improve predicted depth maps, recent methods rely on various forms of filtering or predict an additive residual depth map to refine a first estimate. We instead learn to predict, given a depth map predicted by some reconstruction method, a 2D displacement field able to re-sample pixels around the occlusion boundaries into sharper reconstructions. Our method can be applied to the output of any depth estimation method and is fully differentiable, enabling end-to-end training. For evaluation, we manually annotated the occlusion boundaries in all the images in the test split of popular NYUv2-Depth dataset. We show that our approach improves the localization of occlusion boundaries for all state-of-the-art monocular depth estimation methods that we could evaluate ([32, 10, 6, 28]), without degrading the depth accuracy for the rest of the images.

AB - Current methods for depth map prediction from monocular images tend to predict smooth, poorly localized contours for the occlusion boundaries in the input image. This is unfortunate as occlusion boundaries are important cues to recognize objects, and as we show, may lead to a way to discover new objects from scene reconstruction. To improve predicted depth maps, recent methods rely on various forms of filtering or predict an additive residual depth map to refine a first estimate. We instead learn to predict, given a depth map predicted by some reconstruction method, a 2D displacement field able to re-sample pixels around the occlusion boundaries into sharper reconstructions. Our method can be applied to the output of any depth estimation method and is fully differentiable, enabling end-to-end training. For evaluation, we manually annotated the occlusion boundaries in all the images in the test split of popular NYUv2-Depth dataset. We show that our approach improves the localization of occlusion boundaries for all state-of-the-art monocular depth estimation methods that we could evaluate ([32, 10, 6, 28]), without degrading the depth accuracy for the rest of the images.

UR - http://www.scopus.com/inward/record.url?scp=85094152185&partnerID=8YFLogxK

U2 - 10.1109/CVPR42600.2020.01466

DO - 10.1109/CVPR42600.2020.01466

M3 - Conference paper

SP - 14636

EP - 14645

BT - Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

T2 - 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition

Y2 - 14 June 2020 through 19 June 2020

ER -

Predicting Sharp and Accurate Occlusion Boundaries in Monocular Depth Estimation Using Displacement Fields

Abstract

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this