Casting Geometric Constraints in Semantic Segmentation as Semi-Supervised Learning

Sinisa Stekovic; Friedrich Fraundorfer; Vincent Lepetit

doi:10.1109/WACV45572.2020.9093571

Casting Geometric Constraints in Semantic Segmentation as Semi-Supervised Learning

Sinisa Stekovic, Friedrich Fraundorfer, Vincent Lepetit

Institut für Maschinelles Sehen und Darstellen (7100)

Publikation: Beitrag in Buch/Bericht/Konferenzband › Beitrag in einem Konferenzband › Begutachtung

Abstract

We propose a simple yet effective method to learn to segment new indoor scenes from video frames: State-of- the-art methods trained on one dataset, even as large as the SUNRGB-D dataset, can perform poorly when applied to images that are not part of the dataset, because of the dataset bias, a common phenomenon in computer vision. To make semantic segmentation more useful in practice, one can exploit geometric constraints. Our main contribution is to show that these constraints can be cast conveniently as semi-supervised terms, which enforce the fact that the same class should be predicted for the projections of the same 3D location in different images. This is interesting as we can exploit general existing techniques de- veloped for semi-supervised learning to efficiently incorporate the constraints. We show that this approach can efficiently and accurately learn to segment target sequences of ScanNet and our own target sequences using only annotations from SUNRGB-D, and geometric relations between the video frames of target sequences.

Originalsprache	englisch
Titel	Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020
Seiten	1843-1852
Seitenumfang	10
ISBN (elektronisch)	9781728165530
DOIs	https://doi.org/10.1109/WACV45572.2020.9093571
Publikationsstatus	Veröffentlicht - März 2020
Veranstaltung	2020 IEEE/CVF Winter Conference on Applications of Computer Vision: WACV 2020 - Snowmass Village, USA / Vereinigte Staaten Dauer: 1 März 2020 → 5 März 2020

Konferenz

Konferenz	2020 IEEE/CVF Winter Conference on Applications of Computer Vision
Kurztitel	WACV 2020
Land/Gebiet	USA / Vereinigte Staaten
Ort	Snowmass Village
Zeitraum	1/03/20 → 5/03/20

ASJC Scopus subject areas

Maschinelles Sehen und Mustererkennung
Angewandte Informatik

Zugriff auf Dokument

10.1109/WACV45572.2020.9093571

Andere Dateien und Links

http://www.scopus.com/inward/record.url?scp=85085484647&partnerID=8YFLogxK

Dieses zitieren

Stekovic, S , Fraundorfer, F & Lepetit, V 2020, Casting Geometric Constraints in Semantic Segmentation as Semi-Supervised Learning. in Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020., 9093571, S. 1843-1852, 2020 IEEE/CVF Winter Conference on Applications of Computer Vision, Snowmass Village, Colorado, USA / Vereinigte Staaten, 1/03/20. https://doi.org/10.1109/WACV45572.2020.9093571

@inproceedings{bed0cf16654d44bf99f7e39b447cc577,

title = "Casting Geometric Constraints in Semantic Segmentation as Semi-Supervised Learning",

abstract = "We propose a simple yet effective method to learn to segment new indoor scenes from video frames: State-of- the-art methods trained on one dataset, even as large as the SUNRGB-D dataset, can perform poorly when applied to images that are not part of the dataset, because of the dataset bias, a common phenomenon in computer vision. To make semantic segmentation more useful in practice, one can exploit geometric constraints. Our main contribution is to show that these constraints can be cast conveniently as semi-supervised terms, which enforce the fact that the same class should be predicted for the projections of the same 3D location in different images. This is interesting as we can exploit general existing techniques de- veloped for semi-supervised learning to efficiently incorporate the constraints. We show that this approach can efficiently and accurately learn to segment target sequences of ScanNet and our own target sequences using only annotations from SUNRGB-D, and geometric relations between the video frames of target sequences.",

author = "Sinisa Stekovic and Friedrich Fraundorfer and Vincent Lepetit",

year = "2020",

month = mar,

doi = "10.1109/WACV45572.2020.9093571",

language = "English",

pages = "1843--1852",

booktitle = "Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020",

note = "wacv2020 : WACV 2020, WACV 2020 ; Conference date: 01-03-2020 Through 05-03-2020",

}

TY - GEN

T1 - Casting Geometric Constraints in Semantic Segmentation as Semi-Supervised Learning

AU - Stekovic, Sinisa

AU - Fraundorfer, Friedrich

AU - Lepetit, Vincent

PY - 2020/3

Y1 - 2020/3

N2 - We propose a simple yet effective method to learn to segment new indoor scenes from video frames: State-of- the-art methods trained on one dataset, even as large as the SUNRGB-D dataset, can perform poorly when applied to images that are not part of the dataset, because of the dataset bias, a common phenomenon in computer vision. To make semantic segmentation more useful in practice, one can exploit geometric constraints. Our main contribution is to show that these constraints can be cast conveniently as semi-supervised terms, which enforce the fact that the same class should be predicted for the projections of the same 3D location in different images. This is interesting as we can exploit general existing techniques de- veloped for semi-supervised learning to efficiently incorporate the constraints. We show that this approach can efficiently and accurately learn to segment target sequences of ScanNet and our own target sequences using only annotations from SUNRGB-D, and geometric relations between the video frames of target sequences.

AB - We propose a simple yet effective method to learn to segment new indoor scenes from video frames: State-of- the-art methods trained on one dataset, even as large as the SUNRGB-D dataset, can perform poorly when applied to images that are not part of the dataset, because of the dataset bias, a common phenomenon in computer vision. To make semantic segmentation more useful in practice, one can exploit geometric constraints. Our main contribution is to show that these constraints can be cast conveniently as semi-supervised terms, which enforce the fact that the same class should be predicted for the projections of the same 3D location in different images. This is interesting as we can exploit general existing techniques de- veloped for semi-supervised learning to efficiently incorporate the constraints. We show that this approach can efficiently and accurately learn to segment target sequences of ScanNet and our own target sequences using only annotations from SUNRGB-D, and geometric relations between the video frames of target sequences.

UR - http://www.scopus.com/inward/record.url?scp=85085484647&partnerID=8YFLogxK

U2 - 10.1109/WACV45572.2020.9093571

DO - 10.1109/WACV45572.2020.9093571

M3 - Conference paper

SP - 1843

EP - 1852

BT - Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020

T2 - wacv2020

Y2 - 1 March 2020 through 5 March 2020

ER -

Casting Geometric Constraints in Semantic Segmentation as Semi-Supervised Learning

Abstract

Konferenz

ASJC Scopus subject areas

Zugriff auf Dokument

Andere Dateien und Links

Fingerprint

Dieses zitieren