End-to-End Training of Hybrid CNN-CRF Models for Stereo

Patrick Knöbelreiter; Christian Reinbacher; Alexander Shekhovtsov; Thomas Pock

End-to-End Training of Hybrid CNN-CRF Models for Stereo

Patrick Knöbelreiter, Christian Reinbacher, Alexander Shekhovtsov, Thomas Pock

Institute of Computer Graphics and Vision (7100)

Research output: Working paper › Preprint

Abstract

We propose a novel method for stereo estimation, combining advantages of convolutional neural networks (CNNs) and optimization-based approaches. The optimization, posed as a conditional random field (CRF), takes local matching costs and consistency-enforcing (smoothness) costs as inputs, both estimated by CNN blocks. To perform the inference in the CRF we use an approach based on linear programming relaxation with a fixed number of iterations. We address the challenging problem of training this hybrid model end-to-end. We show that in the discriminative formulation (structured support vector machine) the training is practically feasible. The trained hybrid model with shallow CNNs is comparable to state-of-the-art deep models in both time and performance. The optimization part efficiently replaces sophisticated and not jointly trainable (but commonly applied) post-processing steps by a trainable, well-understood model.

Original language	English
Publication status	Published - 30 Nov 2016

Keywords

cs.CV

Access to Document

1611.10229v1Submitted manuscript, 9.24 MBLicence: CC BY 4.0

https://arxiv.org/abs/1611.10229v1Licence: CC BY 4.0

End-to-End Training of Hybrid CNN-CRF Models for Stereo
Patrick Knöbelreiter (Speaker)
21 Jul 2017
Activity: Talk or presentation › Poster presentation › Science to science

Cite this

@techreport{9031cfdd54654eea9d0ab5b18ade286c,

title = "End-to-End Training of Hybrid CNN-CRF Models for Stereo",

abstract = "We propose a novel method for stereo estimation, combining advantages of convolutional neural networks (CNNs) and optimization-based approaches. The optimization, posed as a conditional random field (CRF), takes local matching costs and consistency-enforcing (smoothness) costs as inputs, both estimated by CNN blocks. To perform the inference in the CRF we use an approach based on linear programming relaxation with a fixed number of iterations. We address the challenging problem of training this hybrid model end-to-end. We show that in the discriminative formulation (structured support vector machine) the training is practically feasible. The trained hybrid model with shallow CNNs is comparable to state-of-the-art deep models in both time and performance. The optimization part efficiently replaces sophisticated and not jointly trainable (but commonly applied) post-processing steps by a trainable, well-understood model.",

keywords = "cs.CV",

author = "Patrick Kn{\"o}belreiter and Christian Reinbacher and Alexander Shekhovtsov and Thomas Pock",

year = "2016",

month = nov,

day = "30",

language = "English",

type = "WorkingPaper",

}

TY - UNPB

T1 - End-to-End Training of Hybrid CNN-CRF Models for Stereo

AU - Knöbelreiter, Patrick

AU - Reinbacher, Christian

AU - Shekhovtsov, Alexander

AU - Pock, Thomas

PY - 2016/11/30

Y1 - 2016/11/30

N2 - We propose a novel method for stereo estimation, combining advantages of convolutional neural networks (CNNs) and optimization-based approaches. The optimization, posed as a conditional random field (CRF), takes local matching costs and consistency-enforcing (smoothness) costs as inputs, both estimated by CNN blocks. To perform the inference in the CRF we use an approach based on linear programming relaxation with a fixed number of iterations. We address the challenging problem of training this hybrid model end-to-end. We show that in the discriminative formulation (structured support vector machine) the training is practically feasible. The trained hybrid model with shallow CNNs is comparable to state-of-the-art deep models in both time and performance. The optimization part efficiently replaces sophisticated and not jointly trainable (but commonly applied) post-processing steps by a trainable, well-understood model.

AB - We propose a novel method for stereo estimation, combining advantages of convolutional neural networks (CNNs) and optimization-based approaches. The optimization, posed as a conditional random field (CRF), takes local matching costs and consistency-enforcing (smoothness) costs as inputs, both estimated by CNN blocks. To perform the inference in the CRF we use an approach based on linear programming relaxation with a fixed number of iterations. We address the challenging problem of training this hybrid model end-to-end. We show that in the discriminative formulation (structured support vector machine) the training is practically feasible. The trained hybrid model with shallow CNNs is comparable to state-of-the-art deep models in both time and performance. The optimization part efficiently replaces sophisticated and not jointly trainable (but commonly applied) post-processing steps by a trainable, well-understood model.

KW - cs.CV

M3 - Preprint

BT - End-to-End Training of Hybrid CNN-CRF Models for Stereo

ER -

End-to-End Training of Hybrid CNN-CRF Models for Stereo

Abstract

Keywords

Access to Document

Fingerprint

Activities

End-to-End Training of Hybrid CNN-CRF Models for Stereo

Cite this