Collaborative Multi-agent Reinforcement Learning for Landmark Localization Using Continuous Action Space

Klemens Kasseroller, Franz Thaler, Christian Payer, Darko Štern*

*Korrespondierende/r Autor/-in für diese Arbeit

Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandBegutachtung

Abstract

We propose a reinforcement learning (RL) based approach for anatomical landmark localization in medical images, where the agent can move in arbitrary directions with a variable step size. Using a continuous action space reduces the average number of steps required to locate a landmark by more than 30 times compared to localization using discrete actions. Our approach outperforms a state-of-the-art RL method based on a discrete action space and is inline with state-of-the-art supervised regression based methods. Furthermore, we extend our approach to a multi-agent setting, where we allow collaboration between agents to enable learning of the landmarks’ spatial configuration. The results of the multi-agent RL based approach show that the position of occluded landmarks can be successfully estimated based on the relative position predicted for the visible landmarks.

Originalspracheenglisch
TitelInformation Processing in Medical Imaging - 27th International Conference, IPMI 2021, Proceedings
Redakteure/-innenAasa Feragen, Stefan Sommer, Julia Schnabel, Mads Nielsen
Herausgeber (Verlag)Springer Science and Business Media Deutschland GmbH
Seiten767-778
Seitenumfang12
ISBN (Print)9783030781903
DOIs
PublikationsstatusVeröffentlicht - 2021
Veranstaltung27th International Conference on Information Processing in Medical Imaging, IPMI 2021 - Virtual, Online
Dauer: 28 Juni 202130 Juni 2021

Publikationsreihe

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band12729 LNCS
ISSN (Print)0302-9743
ISSN (elektronisch)1611-3349

Konferenz

Konferenz27th International Conference on Information Processing in Medical Imaging, IPMI 2021
OrtVirtual, Online
Zeitraum28/06/2130/06/21

ASJC Scopus subject areas

  • Theoretische Informatik
  • Allgemeine Computerwissenschaft

Fingerprint

Untersuchen Sie die Forschungsthemen von „Collaborative Multi-agent Reinforcement Learning for Landmark Localization Using Continuous Action Space“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren