Is Appearance Free Action Recognition Possible?

Filip Ilic*, Thomas Pock, Richard P. Wildes

*Korrespondierende/r Autor/-in für diese Arbeit

Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandBegutachtung

Abstract

Intuition might suggest that motion and dynamic information are key to video-based action recognition. In contrast, there is evidence that state-of-the-art deep-learning video understanding architectures are biased toward static information available in single frames. Presently, a methodology and corresponding dataset to isolate the effects of dynamic information in video are missing. Their absence makes it difficult to understand how well contemporary architectures capitalize on dynamic vs. static information. We respond with a novel Appearance Free Dataset (AFD) for action recognition. AFD is devoid of static information relevant to action recognition in a single frame. Modeling of the dynamics is necessary for solving the task, as the action is only apparent through consideration of the temporal dimension. We evaluated 11 contemporary action recognition architectures on AFD as well as its related RGB video. Our results show a notable decrease in performance for all architectures on AFD compared to RGB. We also conducted a complimentary study with humans that shows their recognition accuracy on AFD and RGB is very similar and much better than the evaluated architectures on AFD. Our results motivate a novel architecture that revives explicit recovery of optical flow, within a contemporary design for best performance on AFD and RGB.

Originalspracheenglisch
TitelComputer Vision – ECCV 2022
Untertitel17th European Conference, 2022, Proceedings
Redakteure/-innenShai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner
Herausgeber (Verlag)Springer Science and Business Media Deutschland GmbH
Seiten156-173
Band4
ISBN (Print)9783031197710
DOIs
PublikationsstatusVeröffentlicht - 2022
Veranstaltung2022 European Conference on Computer Vision: ECCV 2022 - Hybrider Event, Tel Aviv, Israel
Dauer: 23 Okt. 202227 Okt. 2022

Publikationsreihe

NameLecture Notes in Computer Science
Band13664
ISSN (Print)0302-9743
ISSN (elektronisch)1611-3349

Konferenz

Konferenz2022 European Conference on Computer Vision
KurztitelECCV 2022
Land/GebietIsrael
OrtHybrider Event, Tel Aviv
Zeitraum23/10/2227/10/22

ASJC Scopus subject areas

  • Theoretische Informatik
  • Allgemeine Computerwissenschaft

Fingerprint

Untersuchen Sie die Forschungsthemen von „Is Appearance Free Action Recognition Possible?“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren