Reinforcement Learning of Dispatching Strategies for Large-Scale Industrial Scheduling

Pierre Tassel; Benjamin Kovács; Martin Gebser; Konstantin Schekotihin; Wolfgang Kohlenbrein; Philipp Schrott-Kostwein

doi:10.1609/icaps.v32i1.19852

Reinforcement Learning of Dispatching Strategies for Large-Scale Industrial Scheduling

Pierre Tassel, Benjamin Kovács, Martin Gebser, Konstantin Schekotihin, Wolfgang Kohlenbrein, Philipp Schrott-Kostwein

Institut für Softwaretechnologie (7160)

Publikation: Beitrag in Buch/Bericht/Konferenzband › Beitrag in einem Konferenzband › Begutachtung

Abstract

Scheduling is an important problem for many applications, including manufacturing, transportation, or cloud computing. Unfortunately, most of the scheduling problems occurring in practice are intractable and, therefore, solving large industrial instances is very time-consuming. Heuristic-based dispatching methods can compute schedules in an acceptable time, but the construction of a heuristic providing satisfactory solution quality is a tedious process. This work introduces a method to automatically learn dispatching strategies from just a few training instances using reinforcement learning. Evaluation results obtained on real-world, large-scale instances of a resource-constrained project scheduling problem taken from the literature show that the learned dispatching heuristic generalizes to unseen instances and produces high-quality schedules within seconds. As a result, our approach significantly outperforms state-of-the-art combinatorial optimization techniques in terms of solution quality and computation time.

Originalsprache	englisch
Titel	Proceedings of the 32nd International Conference on Automated Planning and Scheduling, ICAPS 2022
Redakteure/-innen	Akshat Kumar, Sylvie Thiebaux, Pradeep Varakantham, William Yeoh
Herausgeber (Verlag)	Association for the Advancement of Artificial Intelligence (AAAI)
Seiten	638-646
Seitenumfang	9
ISBN (elektronisch)	9781577358749
DOIs	https://doi.org/10.1609/icaps.v32i1.19852
Publikationsstatus	Veröffentlicht - 13 Juni 2022
Veranstaltung	32nd International Conference on Automated Planning and Scheduling: ICAPS 2022 - Virtual, Online, Singapur Dauer: 13 Juni 2022 → 24 Juni 2022

Publikationsreihe

Name	Proceedings International Conference on Automated Planning and Scheduling, ICAPS
Band	32
ISSN (Print)	2334-0835
ISSN (elektronisch)	2334-0843

Konferenz

Konferenz	32nd International Conference on Automated Planning and Scheduling
Kurztitel	ICAPS 2022
Land/Gebiet	Singapur
Ort	Virtual, Online
Zeitraum	13/06/22 → 24/06/22

ASJC Scopus subject areas

Artificial intelligence
Angewandte Informatik
Informationssysteme und -management

Zugriff auf Dokument

10.1609/icaps.v32i1.19852Lizenz: Andere

Andere Dateien und Links

Verknüpfung zur Publikation in Scopus

Dieses zitieren

Tassel, P., Kovács, B., Gebser, M., Schekotihin, K., Kohlenbrein, W., & Schrott-Kostwein, P. (2022). Reinforcement Learning of Dispatching Strategies for Large-Scale Industrial Scheduling. in A. Kumar, S. Thiebaux, P. Varakantham, & W. Yeoh (Hrsg.), Proceedings of the 32nd International Conference on Automated Planning and Scheduling, ICAPS 2022 (S. 638-646). (Proceedings International Conference on Automated Planning and Scheduling, ICAPS; Band 32). Association for the Advancement of Artificial Intelligence (AAAI) . https://doi.org/10.1609/icaps.v32i1.19852

Reinforcement Learning of Dispatching Strategies for Large-Scale Industrial Scheduling. / Tassel, Pierre; Kovács, Benjamin; Gebser, Martin et al.
Proceedings of the 32nd International Conference on Automated Planning and Scheduling, ICAPS 2022. Hrsg. / Akshat Kumar; Sylvie Thiebaux; Pradeep Varakantham; William Yeoh. Association for the Advancement of Artificial Intelligence (AAAI) , 2022. S. 638-646 (Proceedings International Conference on Automated Planning and Scheduling, ICAPS; Band 32).

Publikation: Beitrag in Buch/Bericht/Konferenzband › Beitrag in einem Konferenzband › Begutachtung

Tassel, P, Kovács, B, Gebser, M, Schekotihin, K, Kohlenbrein, W & Schrott-Kostwein, P 2022, Reinforcement Learning of Dispatching Strategies for Large-Scale Industrial Scheduling. in A Kumar, S Thiebaux, P Varakantham & W Yeoh (Hrsg.), Proceedings of the 32nd International Conference on Automated Planning and Scheduling, ICAPS 2022. Proceedings International Conference on Automated Planning and Scheduling, ICAPS, Bd. 32, Association for the Advancement of Artificial Intelligence (AAAI) , S. 638-646, 32nd International Conference on Automated Planning and Scheduling, Virtual, Online, Singapur, 13/06/22. https://doi.org/10.1609/icaps.v32i1.19852

Tassel P, Kovács B, Gebser M, Schekotihin K, Kohlenbrein W, Schrott-Kostwein P. Reinforcement Learning of Dispatching Strategies for Large-Scale Industrial Scheduling. in Kumar A, Thiebaux S, Varakantham P, Yeoh W, Hrsg., Proceedings of the 32nd International Conference on Automated Planning and Scheduling, ICAPS 2022. Association for the Advancement of Artificial Intelligence (AAAI) . 2022. S. 638-646. (Proceedings International Conference on Automated Planning and Scheduling, ICAPS). doi: 10.1609/icaps.v32i1.19852

Tassel, Pierre ; Kovács, Benjamin ; Gebser, Martin et al. / Reinforcement Learning of Dispatching Strategies for Large-Scale Industrial Scheduling. Proceedings of the 32nd International Conference on Automated Planning and Scheduling, ICAPS 2022. Hrsg. / Akshat Kumar ; Sylvie Thiebaux ; Pradeep Varakantham ; William Yeoh. Association for the Advancement of Artificial Intelligence (AAAI) , 2022. S. 638-646 (Proceedings International Conference on Automated Planning and Scheduling, ICAPS).

@inproceedings{6c04b71716464e8992309331bbf24aa1,

title = "Reinforcement Learning of Dispatching Strategies for Large-Scale Industrial Scheduling",

abstract = "Scheduling is an important problem for many applications, including manufacturing, transportation, or cloud computing. Unfortunately, most of the scheduling problems occurring in practice are intractable and, therefore, solving large industrial instances is very time-consuming. Heuristic-based dispatching methods can compute schedules in an acceptable time, but the construction of a heuristic providing satisfactory solution quality is a tedious process. This work introduces a method to automatically learn dispatching strategies from just a few training instances using reinforcement learning. Evaluation results obtained on real-world, large-scale instances of a resource-constrained project scheduling problem taken from the literature show that the learned dispatching heuristic generalizes to unseen instances and produces high-quality schedules within seconds. As a result, our approach significantly outperforms state-of-the-art combinatorial optimization techniques in terms of solution quality and computation time.",

author = "Pierre Tassel and Benjamin Kov{\'a}cs and Martin Gebser and Konstantin Schekotihin and Wolfgang Kohlenbrein and Philipp Schrott-Kostwein",

note = "Funding Information: This work was partially funded by KWF project 28472, cms electronics GmbH, FunderMax GmbH, Hirsch Armb{\"a}nder GmbH, incubed IT GmbH, Infineon Technologies Austria AG, Isovolta AG, Kostwein Holding GmbH, and Privatstiftung K{\"a}rntner Sparkasse. We are grateful to the anonymous reviewers for their constructive and helpful comments. Funding Information: This work was partially funded by KWF project 28472, cms electronics GmbH, FunderMax GmbH, Hirsch Armb{\"a}nder GmbH, incubed IT GmbH, Infineon Technologies Austria AG, Isovolta AG, Kostwein Holding GmbH, and Privats-tiftung K{\"a}rntner Sparkasse. We are grateful to the anonymous reviewers for their constructive and helpful comments. Publisher Copyright: {\textcopyright} 2022, Association for the Advancement of Artificial Intelligence.; 32nd International Conference on Automated Planning and Scheduling : ICAPS 2022, ICAPS 2022 ; Conference date: 13-06-2022 Through 24-06-2022",

year = "2022",

month = jun,

day = "13",

doi = "10.1609/icaps.v32i1.19852",

language = "English",

series = "Proceedings International Conference on Automated Planning and Scheduling, ICAPS",

publisher = "Association for the Advancement of Artificial Intelligence (AAAI) ",

pages = "638--646",

editor = "Akshat Kumar and Sylvie Thiebaux and Pradeep Varakantham and William Yeoh",

booktitle = "Proceedings of the 32nd International Conference on Automated Planning and Scheduling, ICAPS 2022",

}

TY - GEN

T1 - Reinforcement Learning of Dispatching Strategies for Large-Scale Industrial Scheduling

AU - Tassel, Pierre

AU - Kovács, Benjamin

AU - Gebser, Martin

AU - Schekotihin, Konstantin

AU - Kohlenbrein, Wolfgang

AU - Schrott-Kostwein, Philipp

N1 - Funding Information: This work was partially funded by KWF project 28472, cms electronics GmbH, FunderMax GmbH, Hirsch Armbänder GmbH, incubed IT GmbH, Infineon Technologies Austria AG, Isovolta AG, Kostwein Holding GmbH, and Privatstiftung Kärntner Sparkasse. We are grateful to the anonymous reviewers for their constructive and helpful comments. Funding Information: This work was partially funded by KWF project 28472, cms electronics GmbH, FunderMax GmbH, Hirsch Armbänder GmbH, incubed IT GmbH, Infineon Technologies Austria AG, Isovolta AG, Kostwein Holding GmbH, and Privats-tiftung Kärntner Sparkasse. We are grateful to the anonymous reviewers for their constructive and helpful comments. Publisher Copyright: © 2022, Association for the Advancement of Artificial Intelligence.

PY - 2022/6/13

Y1 - 2022/6/13

N2 - Scheduling is an important problem for many applications, including manufacturing, transportation, or cloud computing. Unfortunately, most of the scheduling problems occurring in practice are intractable and, therefore, solving large industrial instances is very time-consuming. Heuristic-based dispatching methods can compute schedules in an acceptable time, but the construction of a heuristic providing satisfactory solution quality is a tedious process. This work introduces a method to automatically learn dispatching strategies from just a few training instances using reinforcement learning. Evaluation results obtained on real-world, large-scale instances of a resource-constrained project scheduling problem taken from the literature show that the learned dispatching heuristic generalizes to unseen instances and produces high-quality schedules within seconds. As a result, our approach significantly outperforms state-of-the-art combinatorial optimization techniques in terms of solution quality and computation time.

AB - Scheduling is an important problem for many applications, including manufacturing, transportation, or cloud computing. Unfortunately, most of the scheduling problems occurring in practice are intractable and, therefore, solving large industrial instances is very time-consuming. Heuristic-based dispatching methods can compute schedules in an acceptable time, but the construction of a heuristic providing satisfactory solution quality is a tedious process. This work introduces a method to automatically learn dispatching strategies from just a few training instances using reinforcement learning. Evaluation results obtained on real-world, large-scale instances of a resource-constrained project scheduling problem taken from the literature show that the learned dispatching heuristic generalizes to unseen instances and produces high-quality schedules within seconds. As a result, our approach significantly outperforms state-of-the-art combinatorial optimization techniques in terms of solution quality and computation time.

UR - http://www.scopus.com/inward/record.url?scp=85137896939&partnerID=8YFLogxK

U2 - 10.1609/icaps.v32i1.19852

DO - 10.1609/icaps.v32i1.19852

M3 - Conference paper

AN - SCOPUS:85137896939

T3 - Proceedings International Conference on Automated Planning and Scheduling, ICAPS

SP - 638

EP - 646

BT - Proceedings of the 32nd International Conference on Automated Planning and Scheduling, ICAPS 2022

A2 - Kumar, Akshat

A2 - Thiebaux, Sylvie

A2 - Varakantham, Pradeep

A2 - Yeoh, William

PB - Association for the Advancement of Artificial Intelligence (AAAI)

T2 - 32nd International Conference on Automated Planning and Scheduling

Y2 - 13 June 2022 through 24 June 2022

ER -