Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild

Alexander Grabner; Yaming Wang; Peizhao Zhang; Peihong Guo; Tong Xiao; Peter Vajda; Peter M. Roth; Vincent Lepetit

doi:10.1007/978-3-030-58517-4_7

Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild

Alexander Grabner^*, Yaming Wang, Peizhao Zhang, Peihong Guo, Tong Xiao, Peter Vajda, Peter M. Roth, Vincent Lepetit

^*Korrespondierende/r Autor/-in für diese Arbeit

Institut für Maschinelles Sehen und Darstellen (7100)

Publikation: Beitrag in Buch/Bericht/Konferenzband › Beitrag in einem Konferenzband › Begutachtung

Abstract

We present a novel 3D pose refinement approach based ondifferentiable rendering for objects of arbitrary categories in the wild. Incontrast to previous methods, we make two main contributions: First,instead of comparing real-world images and synthetic renderings in theRGB or mask space, we compare them in a feature space optimized for3D pose refinement. Second, we introduce a novel differentiable rendererthat learns to approximate the rasterization backward pass from data in-stead of relying on a hand-crafted algorithm. For this purpose, we predictdeep cross-domain correspondences between RGB images and 3D modelrenderings in the form of what we call geometric correspondence fields.These correspondence fields serve as pixel-level gradients which are ana-lytically propagated backward through the rendering pipeline to performa gradient-based optimization directly on the 3D pose. In this way, weprecisely align 3D models to objects in RGB images which results in sig-nificantly improved 3D pose estimates. We evaluate our approach on thechallenging Pix3D dataset and achieve up to 55% relative improvementcompared to state-of-the-art refinement methods in multiple metrics.

Originalsprache	englisch
Titel	Computer Vision – ECCV 2020
Erscheinungsort	Cham
Herausgeber (Verlag)	Springer
Seiten	102-119
Seitenumfang	14
ISBN (Print)	978-3-030-58516-7
DOIs	https://doi.org/10.1007/978-3-030-58517-4_7
Publikationsstatus	Veröffentlicht - 2020
Veranstaltung	16th European Conference on Computer Vision: ECCV 2020 - Virtual, Glasgow, Großbritannien / Vereinigtes Königreich Dauer: 23 Aug. 2020 → 28 Aug. 2020

Publikationsreihe

Name	Lecture Notes in Computer Science
Band	12361

Konferenz

Konferenz	16th European Conference on Computer Vision
Kurztitel	ECCV 2020
Land/Gebiet	Großbritannien / Vereinigtes Königreich
Ort	Virtual, Glasgow
Zeitraum	23/08/20 → 28/08/20

Zugriff auf Dokument

10.1007/978-3-030-58517-4_7

Dieses zitieren

Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild. / Grabner, Alexander; Wang, Yaming; Zhang, Peizhao et al.
Computer Vision – ECCV 2020. Cham: Springer, 2020. S. 102-119 (Lecture Notes in Computer Science; Band 12361).

Publikation: Beitrag in Buch/Bericht/Konferenzband › Beitrag in einem Konferenzband › Begutachtung

Grabner, A, Wang, Y, Zhang, P, Guo, P, Xiao, T, Vajda, P, Roth, PM & Lepetit, V 2020, Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild. in Computer Vision – ECCV 2020. Lecture Notes in Computer Science, Bd. 12361, Springer, Cham, S. 102-119, 16th European Conference on Computer Vision, Virtual, Glasgow, Großbritannien / Vereinigtes Königreich, 23/08/20. https://doi.org/10.1007/978-3-030-58517-4_7

@inproceedings{48a9c9f97cfa4882817aa79a9e7f6ec1,

title = "Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild",

abstract = "We present a novel 3D pose refinement approach based ondifferentiable rendering for objects of arbitrary categories in the wild. Incontrast to previous methods, we make two main contributions: First,instead of comparing real-world images and synthetic renderings in theRGB or mask space, we compare them in a feature space optimized for3D pose refinement. Second, we introduce a novel differentiable rendererthat learns to approximate the rasterization backward pass from data in-stead of relying on a hand-crafted algorithm. For this purpose, we predictdeep cross-domain correspondences between RGB images and 3D modelrenderings in the form of what we call geometric correspondence fields.These correspondence fields serve as pixel-level gradients which are ana-lytically propagated backward through the rendering pipeline to performa gradient-based optimization directly on the 3D pose. In this way, weprecisely align 3D models to objects in RGB images which results in sig-nificantly improved 3D pose estimates. We evaluate our approach on thechallenging Pix3D dataset and achieve up to 55% relative improvementcompared to state-of-the-art refinement methods in multiple metrics.",

author = "Alexander Grabner and Yaming Wang and Peizhao Zhang and Peihong Guo and Tong Xiao and Peter Vajda and Roth, {Peter M.} and Vincent Lepetit",

year = "2020",

doi = "10.1007/978-3-030-58517-4_7",

language = "English",

isbn = "978-3-030-58516-7",

series = "Lecture Notes in Computer Science",

publisher = "Springer",

pages = "102--119",

booktitle = "Computer Vision – ECCV 2020",

note = "16th European Conference on Computer Vision : ECCV 2020, ECCV 2020 ; Conference date: 23-08-2020 Through 28-08-2020",

}

TY - GEN

T1 - Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild

AU - Grabner, Alexander

AU - Wang, Yaming

AU - Zhang, Peizhao

AU - Guo, Peihong

AU - Xiao, Tong

AU - Vajda, Peter

AU - Roth, Peter M.

AU - Lepetit, Vincent

PY - 2020

Y1 - 2020

N2 - We present a novel 3D pose refinement approach based ondifferentiable rendering for objects of arbitrary categories in the wild. Incontrast to previous methods, we make two main contributions: First,instead of comparing real-world images and synthetic renderings in theRGB or mask space, we compare them in a feature space optimized for3D pose refinement. Second, we introduce a novel differentiable rendererthat learns to approximate the rasterization backward pass from data in-stead of relying on a hand-crafted algorithm. For this purpose, we predictdeep cross-domain correspondences between RGB images and 3D modelrenderings in the form of what we call geometric correspondence fields.These correspondence fields serve as pixel-level gradients which are ana-lytically propagated backward through the rendering pipeline to performa gradient-based optimization directly on the 3D pose. In this way, weprecisely align 3D models to objects in RGB images which results in sig-nificantly improved 3D pose estimates. We evaluate our approach on thechallenging Pix3D dataset and achieve up to 55% relative improvementcompared to state-of-the-art refinement methods in multiple metrics.

AB - We present a novel 3D pose refinement approach based ondifferentiable rendering for objects of arbitrary categories in the wild. Incontrast to previous methods, we make two main contributions: First,instead of comparing real-world images and synthetic renderings in theRGB or mask space, we compare them in a feature space optimized for3D pose refinement. Second, we introduce a novel differentiable rendererthat learns to approximate the rasterization backward pass from data in-stead of relying on a hand-crafted algorithm. For this purpose, we predictdeep cross-domain correspondences between RGB images and 3D modelrenderings in the form of what we call geometric correspondence fields.These correspondence fields serve as pixel-level gradients which are ana-lytically propagated backward through the rendering pipeline to performa gradient-based optimization directly on the 3D pose. In this way, weprecisely align 3D models to objects in RGB images which results in sig-nificantly improved 3D pose estimates. We evaluate our approach on thechallenging Pix3D dataset and achieve up to 55% relative improvementcompared to state-of-the-art refinement methods in multiple metrics.

U2 - 10.1007/978-3-030-58517-4_7

DO - 10.1007/978-3-030-58517-4_7

M3 - Conference paper

SN - 978-3-030-58516-7

T3 - Lecture Notes in Computer Science

SP - 102

EP - 119

BT - Computer Vision – ECCV 2020

PB - Springer

CY - Cham

T2 - 16th European Conference on Computer Vision

Y2 - 23 August 2020 through 28 August 2020

ER -

Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild

Abstract

Publikationsreihe

Konferenz

Zugriff auf Dokument

Fingerprint

Dieses zitieren