Reinforcement learning based process optimization and strategy development in conventional tunneling

Georg H. Erharter; Tom F. Hansen; Zhongqiang Liu; Thomas Marcher

doi:10.1016/j.autcon.2021.103701

Reinforcement learning based process optimization and strategy development in conventional tunneling

Georg H. Erharter^*, Tom F. Hansen, Zhongqiang Liu, Thomas Marcher

^*Korrespondierende/r Autor/-in für diese Arbeit

Institut für Felsmechanik und Tunnelbau (2200)

Publikation: Beitrag in einer Fachzeitschrift › Artikel › Begutachtung

Abstract

Reinforcement learning (RL) - a branch of machine learning - refers to the process of an agent learning to achieve a certain goal by interaction with its environment. The process of conventional tunneling shows many similarities, where a geotechnician (agent) tries to achieve a breakthrough (goal) by excavating the rockmass (environment) in an optimum way. In this paper we present a novel RL based framework for strategy development for conventional tunneling. We developed a virtual environment with the goal of a tunnel breakthrough and with a deep Q-network as the agent's architecture. It can choose from different excavation sequences to reach that goal and learns to do so in an economical and safe way by getting feedback from a specially designed reward system. Result analyses show that the optimal policies have great similarities to current practices of sequential tunneling and the framework has the potential to discover new tunneling strategies.

Originalsprache	englisch
Aufsatznummer	103701
Fachzeitschrift	Automation in Construction
Jahrgang	127
DOIs	https://doi.org/10.1016/j.autcon.2021.103701
Publikationsstatus	Veröffentlicht - Juli 2021

ASJC Scopus subject areas

Steuerungs- und Systemtechnik
Tief- und Ingenieurbau
Bauwesen

Zugriff auf Dokument

10.1016/j.autcon.2021.103701Lizenz: CC BY 4.0

Andere Dateien und Links

http://www.scopus.com/inward/record.url?scp=85104426966&partnerID=8YFLogxK

Dieses zitieren

@article{006e659838c64d45a59d4c5f1b856bc4,

title = "Reinforcement learning based process optimization and strategy development in conventional tunneling",

abstract = "Reinforcement learning (RL) - a branch of machine learning - refers to the process of an agent learning to achieve a certain goal by interaction with its environment. The process of conventional tunneling shows many similarities, where a geotechnician (agent) tries to achieve a breakthrough (goal) by excavating the rockmass (environment) in an optimum way. In this paper we present a novel RL based framework for strategy development for conventional tunneling. We developed a virtual environment with the goal of a tunnel breakthrough and with a deep Q-network as the agent's architecture. It can choose from different excavation sequences to reach that goal and learns to do so in an economical and safe way by getting feedback from a specially designed reward system. Result analyses show that the optimal policies have great similarities to current practices of sequential tunneling and the framework has the potential to discover new tunneling strategies.",

keywords = "Conventional tunneling, Excavation sequences, Machine learning, Reinforcement learning, Tunnel excavation strategy",

author = "Erharter, {Georg H.} and Hansen, {Tom F.} and Zhongqiang Liu and Thomas Marcher",

year = "2021",

month = jul,

doi = "10.1016/j.autcon.2021.103701",

language = "English",

volume = "127",

journal = "Automation in Construction",

issn = "0926-5805",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Reinforcement learning based process optimization and strategy development in conventional tunneling

AU - Erharter, Georg H.

AU - Hansen, Tom F.

AU - Liu, Zhongqiang

AU - Marcher, Thomas

PY - 2021/7

Y1 - 2021/7

N2 - Reinforcement learning (RL) - a branch of machine learning - refers to the process of an agent learning to achieve a certain goal by interaction with its environment. The process of conventional tunneling shows many similarities, where a geotechnician (agent) tries to achieve a breakthrough (goal) by excavating the rockmass (environment) in an optimum way. In this paper we present a novel RL based framework for strategy development for conventional tunneling. We developed a virtual environment with the goal of a tunnel breakthrough and with a deep Q-network as the agent's architecture. It can choose from different excavation sequences to reach that goal and learns to do so in an economical and safe way by getting feedback from a specially designed reward system. Result analyses show that the optimal policies have great similarities to current practices of sequential tunneling and the framework has the potential to discover new tunneling strategies.

AB - Reinforcement learning (RL) - a branch of machine learning - refers to the process of an agent learning to achieve a certain goal by interaction with its environment. The process of conventional tunneling shows many similarities, where a geotechnician (agent) tries to achieve a breakthrough (goal) by excavating the rockmass (environment) in an optimum way. In this paper we present a novel RL based framework for strategy development for conventional tunneling. We developed a virtual environment with the goal of a tunnel breakthrough and with a deep Q-network as the agent's architecture. It can choose from different excavation sequences to reach that goal and learns to do so in an economical and safe way by getting feedback from a specially designed reward system. Result analyses show that the optimal policies have great similarities to current practices of sequential tunneling and the framework has the potential to discover new tunneling strategies.

KW - Conventional tunneling

KW - Excavation sequences

KW - Machine learning

KW - Reinforcement learning

KW - Tunnel excavation strategy

UR - http://www.scopus.com/inward/record.url?scp=85104426966&partnerID=8YFLogxK

U2 - 10.1016/j.autcon.2021.103701

DO - 10.1016/j.autcon.2021.103701

M3 - Article

AN - SCOPUS:85104426966

SN - 0926-5805

VL - 127

JO - Automation in Construction

JF - Automation in Construction

M1 - 103701

ER -

Reinforcement learning based process optimization and strategy development in conventional tunneling

Abstract

ASJC Scopus subject areas

Zugriff auf Dokument

Andere Dateien und Links

Fingerprint

Dieses zitieren