Collaborative Multi-agent Reinforcement Learning for Landmark Localization Using Continuous Action Space

Klemens Kasseroller; Franz Thaler; Christian Payer; Darko Štern

doi:10.1007/978-3-030-78191-0_59

Collaborative Multi-agent Reinforcement Learning for Landmark Localization Using Continuous Action Space

Klemens Kasseroller, Franz Thaler, Christian Payer, Darko Štern^*

^*Korrespondierende/r Autor/-in für diese Arbeit

Institut für Maschinelles Sehen und Darstellen (7100)

Publikation: Beitrag in Buch/Bericht/Konferenzband › Beitrag in einem Konferenzband › Begutachtung

Abstract

We propose a reinforcement learning (RL) based approach for anatomical landmark localization in medical images, where the agent can move in arbitrary directions with a variable step size. Using a continuous action space reduces the average number of steps required to locate a landmark by more than 30 times compared to localization using discrete actions. Our approach outperforms a state-of-the-art RL method based on a discrete action space and is inline with state-of-the-art supervised regression based methods. Furthermore, we extend our approach to a multi-agent setting, where we allow collaboration between agents to enable learning of the landmarks’ spatial configuration. The results of the multi-agent RL based approach show that the position of occluded landmarks can be successfully estimated based on the relative position predicted for the visible landmarks.

Originalsprache	englisch
Titel	Information Processing in Medical Imaging - 27th International Conference, IPMI 2021, Proceedings
Redakteure/-innen	Aasa Feragen, Stefan Sommer, Julia Schnabel, Mads Nielsen
Herausgeber (Verlag)	Springer Science and Business Media Deutschland GmbH
Seiten	767-778
Seitenumfang	12
ISBN (Print)	9783030781903
DOIs	https://doi.org/10.1007/978-3-030-78191-0_59
Publikationsstatus	Veröffentlicht - 2021
Veranstaltung	27th International Conference on Information Processing in Medical Imaging, IPMI 2021 - Virtual, Online Dauer: 28 Juni 2021 → 30 Juni 2021

Publikationsreihe

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band	12729 LNCS
ISSN (Print)	0302-9743
ISSN (elektronisch)	1611-3349

Konferenz

Konferenz	27th International Conference on Information Processing in Medical Imaging, IPMI 2021
Ort	Virtual, Online
Zeitraum	28/06/21 → 30/06/21

ASJC Scopus subject areas

Theoretische Informatik
Allgemeine Computerwissenschaft

Zugriff auf Dokument

10.1007/978-3-030-78191-0_59

Andere Dateien und Links

Verknüpfung zur Publikation in Scopus

Dieses zitieren

Kasseroller, K., Thaler, F., Payer, C., & Štern, D. (2021). Collaborative Multi-agent Reinforcement Learning for Landmark Localization Using Continuous Action Space. in A. Feragen, S. Sommer, J. Schnabel, & M. Nielsen (Hrsg.), Information Processing in Medical Imaging - 27th International Conference, IPMI 2021, Proceedings (S. 767-778). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 12729 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-78191-0_59

Collaborative Multi-agent Reinforcement Learning for Landmark Localization Using Continuous Action Space. / Kasseroller, Klemens; Thaler, Franz; Payer, Christian et al.
Information Processing in Medical Imaging - 27th International Conference, IPMI 2021, Proceedings. Hrsg. / Aasa Feragen; Stefan Sommer; Julia Schnabel; Mads Nielsen. Springer Science and Business Media Deutschland GmbH, 2021. S. 767-778 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 12729 LNCS).

Publikation: Beitrag in Buch/Bericht/Konferenzband › Beitrag in einem Konferenzband › Begutachtung

Kasseroller, K, Thaler, F, Payer, C & Štern, D 2021, Collaborative Multi-agent Reinforcement Learning for Landmark Localization Using Continuous Action Space. in A Feragen, S Sommer, J Schnabel & M Nielsen (Hrsg.), Information Processing in Medical Imaging - 27th International Conference, IPMI 2021, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Bd. 12729 LNCS, Springer Science and Business Media Deutschland GmbH, S. 767-778, 27th International Conference on Information Processing in Medical Imaging, IPMI 2021, Virtual, Online, 28/06/21. https://doi.org/10.1007/978-3-030-78191-0_59

Kasseroller K, Thaler F, Payer C, Štern D. Collaborative Multi-agent Reinforcement Learning for Landmark Localization Using Continuous Action Space. in Feragen A, Sommer S, Schnabel J, Nielsen M, Hrsg., Information Processing in Medical Imaging - 27th International Conference, IPMI 2021, Proceedings. Springer Science and Business Media Deutschland GmbH. 2021. S. 767-778. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-78191-0_59

Kasseroller, Klemens ; Thaler, Franz ; Payer, Christian et al. / Collaborative Multi-agent Reinforcement Learning for Landmark Localization Using Continuous Action Space. Information Processing in Medical Imaging - 27th International Conference, IPMI 2021, Proceedings. Hrsg. / Aasa Feragen ; Stefan Sommer ; Julia Schnabel ; Mads Nielsen. Springer Science and Business Media Deutschland GmbH, 2021. S. 767-778 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{9693b8b01f254328a6e3412367fac6eb,

title = "Collaborative Multi-agent Reinforcement Learning for Landmark Localization Using Continuous Action Space",

abstract = "We propose a reinforcement learning (RL) based approach for anatomical landmark localization in medical images, where the agent can move in arbitrary directions with a variable step size. Using a continuous action space reduces the average number of steps required to locate a landmark by more than 30 times compared to localization using discrete actions. Our approach outperforms a state-of-the-art RL method based on a discrete action space and is inline with state-of-the-art supervised regression based methods. Furthermore, we extend our approach to a multi-agent setting, where we allow collaboration between agents to enable learning of the landmarks{\textquoteright} spatial configuration. The results of the multi-agent RL based approach show that the position of occluded landmarks can be successfully estimated based on the relative position predicted for the visible landmarks.",

keywords = "Collaborative multi-agent, Continuous action space, Landmark localization, Reinforcement learning",

author = "Klemens Kasseroller and Franz Thaler and Christian Payer and Darko {\v S}tern",

note = "Publisher Copyright: {\textcopyright} 2021, Springer Nature Switzerland AG.; 27th International Conference on Information Processing in Medical Imaging, IPMI 2021 ; Conference date: 28-06-2021 Through 30-06-2021",

year = "2021",

doi = "10.1007/978-3-030-78191-0_59",

language = "English",

isbn = "9783030781903",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "767--778",

editor = "Aasa Feragen and Stefan Sommer and Julia Schnabel and Mads Nielsen",

booktitle = "Information Processing in Medical Imaging - 27th International Conference, IPMI 2021, Proceedings",

address = "Germany",

}

TY - GEN

T1 - Collaborative Multi-agent Reinforcement Learning for Landmark Localization Using Continuous Action Space

AU - Kasseroller, Klemens

AU - Thaler, Franz

AU - Payer, Christian

AU - Štern, Darko

PY - 2021

Y1 - 2021

N2 - We propose a reinforcement learning (RL) based approach for anatomical landmark localization in medical images, where the agent can move in arbitrary directions with a variable step size. Using a continuous action space reduces the average number of steps required to locate a landmark by more than 30 times compared to localization using discrete actions. Our approach outperforms a state-of-the-art RL method based on a discrete action space and is inline with state-of-the-art supervised regression based methods. Furthermore, we extend our approach to a multi-agent setting, where we allow collaboration between agents to enable learning of the landmarks’ spatial configuration. The results of the multi-agent RL based approach show that the position of occluded landmarks can be successfully estimated based on the relative position predicted for the visible landmarks.

AB - We propose a reinforcement learning (RL) based approach for anatomical landmark localization in medical images, where the agent can move in arbitrary directions with a variable step size. Using a continuous action space reduces the average number of steps required to locate a landmark by more than 30 times compared to localization using discrete actions. Our approach outperforms a state-of-the-art RL method based on a discrete action space and is inline with state-of-the-art supervised regression based methods. Furthermore, we extend our approach to a multi-agent setting, where we allow collaboration between agents to enable learning of the landmarks’ spatial configuration. The results of the multi-agent RL based approach show that the position of occluded landmarks can be successfully estimated based on the relative position predicted for the visible landmarks.

KW - Collaborative multi-agent

KW - Continuous action space

KW - Landmark localization

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85111455725&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-78191-0_59

DO - 10.1007/978-3-030-78191-0_59

M3 - Conference paper

AN - SCOPUS:85111455725

SN - 9783030781903

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 767

EP - 778

BT - Information Processing in Medical Imaging - 27th International Conference, IPMI 2021, Proceedings

A2 - Feragen, Aasa

A2 - Sommer, Stefan

A2 - Schnabel, Julia

A2 - Nielsen, Mads

PB - Springer Science and Business Media Deutschland GmbH

T2 - 27th International Conference on Information Processing in Medical Imaging, IPMI 2021

Y2 - 28 June 2021 through 30 June 2021

ER -

Collaborative Multi-agent Reinforcement Learning for Landmark Localization Using Continuous Action Space

Abstract

Publikationsreihe

Konferenz

ASJC Scopus subject areas

Zugriff auf Dokument

Andere Dateien und Links

Fingerprint

Dieses zitieren