Improving Optical Flow on a Pyramid Level

Markus Hofinger, Samuel Rota Bulò, Lorenzo Porzi, Arno Knapitsch, Thomas Pock, Peter Kontschieder

Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandBegutachtung

Abstract

In this work we review the coarse-to-fine spatial feature pyramid concept, which is used in state-of-the-art optical flow estimation networks to make exploration of the pixel flow search space computationally tractable and efficient. Within an individual pyramid level, we improve the cost volume construction process by departing from a warping- to a sampling-based strategy, which avoids ghosting and hence enables us to better preserve fine flow details. We further amplify the positive effects through a level-specific, loss max-pooling strategy that adaptively shifts the focus of the learning process on under-performing predictions. Our second contribution revises the gradient flow across pyramid levels. The typical operations performed at each pyramid level can lead to noisy, or even contradicting gradients across levels. We show and discuss how properly blocking some of these gradient components leads to improved convergence and ultimately better performance. Finally, we introduce a distillation concept to counteract the issue of catastrophic forgetting during finetuning and thus preserving knowledge over models sequentially trained on multiple datasets. Our findings are conceptually simple and easy to implement, yet result in compelling improvements on relevant error measures that we demonstrate via exhaustive ablations on datasets like Flying Chairs2, Flying Things, Sintel and KITTI. We establish new state-of-the-art results on the challenging Sintel and KITTI 2012 test datasets, and even show the portability of our findings to different optical flow and depth from stereo approaches.
Originalspracheenglisch
TitelComputer Vision – ECCV 2020 - 16th European Conference, 2020, Proceedings
Redakteure/-innenAndrea Vedaldi, Horst Bischof, Thomas Brox, Jan-Michael Frahm
Seiten770-786
Seitenumfang17
Band12373
ISBN (elektronisch)978-3-030-58604-1
DOIs
PublikationsstatusVeröffentlicht - 1 Jan. 2020
Veranstaltung16th European Conference on Computer Vision: ECCV 2020 - Virtual, Glasgow, Großbritannien / Vereinigtes Königreich
Dauer: 23 Aug. 202028 Aug. 2020

Publikationsreihe

NameLecture Notes in Computer Science
Band12373
ISSN (Print)0302-9743
ISSN (elektronisch)1611-3349

Konferenz

Konferenz16th European Conference on Computer Vision
KurztitelECCV 2020
Land/GebietGroßbritannien / Vereinigtes Königreich
OrtVirtual, Glasgow
Zeitraum23/08/2028/08/20

ASJC Scopus subject areas

  • Theoretische Informatik
  • Informatik (insg.)

Fingerprint

Untersuchen Sie die Forschungsthemen von „Improving Optical Flow on a Pyramid Level“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren