Learned Collaborative Stereo Refinement

Patrick Knöbelreiter; Thomas Pock

doi:10.1007/s11263-021-01485-5

Learned Collaborative Stereo Refinement

Patrick Knöbelreiter^*, Thomas Pock

^*Corresponding author for this work

Institute of Computer Graphics and Vision (7100)

Research output: Contribution to journal › Article › peer-review

Abstract

In this work, we propose a learning-based method to denoise and refine disparity maps. The proposed variational network arises naturally from unrolling the iterates of a proximal gradient method applied to a variational energy defined in a joint disparity, color, and confidence image space. Our method allows to learn a robust collaborative regularizer leveraging the joint statistics of the color image, the confidence map and the disparity map. Due to the variational structure of our method, the individual steps can be easily visualized, thus enabling interpretability of the method. We can therefore provide interesting insights into how our method refines and denoises disparity maps. To this end, we can visualize and interpret the learned filters and activation functions and prove the increased reliability of the predicted pixel-wise confidence maps. Furthermore, the optimization based structure of our refinement module allows us to compute eigen disparity maps, which reveal structural properties of our refinement module. The efficiency of our method is demonstrated on the publicly available stereo benchmarks Middlebury 2014 and Kitti 2015.

Original language	English
Pages (from-to)	2565-2582
Number of pages	18
Journal	International Journal of Computer Vision
Volume	129
Issue number	9
DOIs	https://doi.org/10.1007/s11263-021-01485-5
Publication status	Published - Sept 2021

Keywords

Deep learning
Interpretable AI
Optimization
Refinement
Stereo

ASJC Scopus subject areas

Software
Computer Vision and Pattern Recognition
Artificial Intelligence

Access to Document

10.1007/s11263-021-01485-5Licence: CC BY 4.0

Cite this

@article{3e1d517f94dd43288d9fdf1248a947b2,

title = "Learned Collaborative Stereo Refinement",

abstract = "In this work, we propose a learning-based method to denoise and refine disparity maps. The proposed variational network arises naturally from unrolling the iterates of a proximal gradient method applied to a variational energy defined in a joint disparity, color, and confidence image space. Our method allows to learn a robust collaborative regularizer leveraging the joint statistics of the color image, the confidence map and the disparity map. Due to the variational structure of our method, the individual steps can be easily visualized, thus enabling interpretability of the method. We can therefore provide interesting insights into how our method refines and denoises disparity maps. To this end, we can visualize and interpret the learned filters and activation functions and prove the increased reliability of the predicted pixel-wise confidence maps. Furthermore, the optimization based structure of our refinement module allows us to compute eigen disparity maps, which reveal structural properties of our refinement module. The efficiency of our method is demonstrated on the publicly available stereo benchmarks Middlebury 2014 and Kitti 2015.",

keywords = "Deep learning, Interpretable AI, Optimization, Refinement, Stereo",

author = "Patrick Kn{\"o}belreiter and Thomas Pock",

note = "Publisher Copyright: {\textcopyright} 2021, The Author(s).",

year = "2021",

month = sep,

doi = "10.1007/s11263-021-01485-5",

language = "English",

volume = "129",

pages = "2565--2582",

journal = "International Journal of Computer Vision",

issn = "0920-5691",

publisher = "Springer Vieweg",

number = "9",

}

TY - JOUR

T1 - Learned Collaborative Stereo Refinement

AU - Knöbelreiter, Patrick

AU - Pock, Thomas

PY - 2021/9

Y1 - 2021/9

N2 - In this work, we propose a learning-based method to denoise and refine disparity maps. The proposed variational network arises naturally from unrolling the iterates of a proximal gradient method applied to a variational energy defined in a joint disparity, color, and confidence image space. Our method allows to learn a robust collaborative regularizer leveraging the joint statistics of the color image, the confidence map and the disparity map. Due to the variational structure of our method, the individual steps can be easily visualized, thus enabling interpretability of the method. We can therefore provide interesting insights into how our method refines and denoises disparity maps. To this end, we can visualize and interpret the learned filters and activation functions and prove the increased reliability of the predicted pixel-wise confidence maps. Furthermore, the optimization based structure of our refinement module allows us to compute eigen disparity maps, which reveal structural properties of our refinement module. The efficiency of our method is demonstrated on the publicly available stereo benchmarks Middlebury 2014 and Kitti 2015.

AB - In this work, we propose a learning-based method to denoise and refine disparity maps. The proposed variational network arises naturally from unrolling the iterates of a proximal gradient method applied to a variational energy defined in a joint disparity, color, and confidence image space. Our method allows to learn a robust collaborative regularizer leveraging the joint statistics of the color image, the confidence map and the disparity map. Due to the variational structure of our method, the individual steps can be easily visualized, thus enabling interpretability of the method. We can therefore provide interesting insights into how our method refines and denoises disparity maps. To this end, we can visualize and interpret the learned filters and activation functions and prove the increased reliability of the predicted pixel-wise confidence maps. Furthermore, the optimization based structure of our refinement module allows us to compute eigen disparity maps, which reveal structural properties of our refinement module. The efficiency of our method is demonstrated on the publicly available stereo benchmarks Middlebury 2014 and Kitti 2015.

KW - Deep learning

KW - Interpretable AI

KW - Optimization

KW - Refinement

KW - Stereo

UR - http://www.scopus.com/inward/record.url?scp=85108292037&partnerID=8YFLogxK

U2 - 10.1007/s11263-021-01485-5

DO - 10.1007/s11263-021-01485-5

M3 - Article

AN - SCOPUS:85108292037

SN - 0920-5691

VL - 129

SP - 2565

EP - 2582

JO - International Journal of Computer Vision

JF - International Journal of Computer Vision

IS - 9

ER -

Learned Collaborative Stereo Refinement

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this