Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World

Florian Laurent; Manuel Schneider; Christian Scheller; Jeremy Watson; Jiaoyang Li; Zhe Chen; Yi Zheng; Shao-Hung Chan; Konstantin Makhnev; Oleg Svidchenko; Vladimir Egorov; Dmitry Ivanov; Aleksei Shpilman; Evgenija Spirovska; Oliver Tanevski; Aleksandar Nikov; Ramon Grunder; David Galevski; Jakov Mitrovski; Guillaume Sartoretti; Zhiyao Luo; Mehul Damani; Nilabha Bhattacharya; Shivam Agarwal; Adrian Egli; Erik Nygren; Sharada Mohanty

Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World

Florian Laurent, Manuel Schneider, Christian Scheller, Jeremy Watson, Jiaoyang Li, Zhe Chen, Yi Zheng, Shao-Hung Chan, Konstantin Makhnev, Oleg Svidchenko, Vladimir Egorov, Dmitry Ivanov, Aleksei Shpilman, Evgenija Spirovska, Oliver Tanevski, Aleksandar Nikov, Ramon Grunder, David Galevski, Jakov Mitrovski, Guillaume Sartoretti, Zhiyao Luo, Mehul Damani, Nilabha Bhattacharya, Shivam Agarwal, Adrian Egli, Erik Nygren, Sharada Mohanty

Proceedings of the NeurIPS 2020 Competition and Demonstration Track, PMLR 133:275-301, 2021.

Abstract

The Flatland competition aimed at finding novel approaches to solve the vehicle re-scheduling problem (VRSP). The VRSP is concerned with scheduling trips in traffic networks and the re-scheduling of vehicles when disruptions occur, for example the breakdown of a vehicle. While solving the VRSP in various settings has been an active area in operations research (OR) for decades, the ever-growing complexity of modern railway networks makes dynamic real-time scheduling of traffic virtually impossible. Recently, multi-agent reinforcement learning (MARL) has successfully tackled challenging tasks where many agents need to be coordinated, such as multiplayer video games. However, the coordination of hundreds of agents in a real-life setting like a railway network remains challenging and the Flatland environment used for the competition models these real-world properties in a simplified manner. Submissions had to bring as many trains (agents) to their target stations in as little time as possible. While the best submissions were in the OR category, participants found many promising MARL approaches. Using both centralized and decentralized learning based approaches, top submissions used graph representations of the environment to construct tree-based observations. Further, different coordination mechanisms were implemented, such as communication and prioritization between agents. This paper presents the competition setup, four outstanding solutions to the competition, and a cross-comparison between them.

Cite this Paper

BibTeX

@InProceedings{pmlr-v133-laurent21a,
  title = 	 {Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World},
  author =       {Laurent, Florian and Schneider, Manuel and Scheller, Christian and Watson, Jeremy and Li, Jiaoyang and Chen, Zhe and Zheng, Yi and Chan, Shao-Hung and Makhnev, Konstantin and Svidchenko, Oleg and Egorov, Vladimir and Ivanov, Dmitry and Shpilman, Aleksei and Spirovska, Evgenija and Tanevski, Oliver and Nikov, Aleksandar and Grunder, Ramon and Galevski, David and Mitrovski, Jakov and Sartoretti, Guillaume and Luo, Zhiyao and Damani, Mehul and Bhattacharya, Nilabha and Agarwal, Shivam and Egli, Adrian and Nygren, Erik and Mohanty, Sharada},
  booktitle = 	 {Proceedings of the NeurIPS 2020 Competition and Demonstration Track},
  pages = 	 {275--301},
  year = 	 {2021},
  editor = 	 {Escalante, Hugo Jair and Hofmann, Katja},
  volume = 	 {133},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {06--12 Dec},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v133/laurent21a/laurent21a.pdf},
  url = 	 {https://proceedings.mlr.press/v133/laurent21a.html},
  abstract = 	 {The Flatland competition aimed at finding novel approaches to solve the vehicle re-scheduling problem (VRSP). The VRSP is concerned with scheduling trips in traffic networks and the re-scheduling of vehicles when disruptions occur, for example the breakdown of a vehicle. While solving the VRSP in various settings has been an active area in operations research (OR) for decades, the ever-growing complexity of modern railway networks makes dynamic real-time scheduling of traffic virtually impossible. Recently, multi-agent reinforcement learning (MARL) has successfully tackled challenging tasks where many agents need to be coordinated, such as multiplayer video games. However, the coordination of hundreds of agents in a real-life setting like a railway network remains challenging and the Flatland environment used for the competition models these real-world properties in a simplified manner. Submissions had to bring as many trains (agents) to their target stations in as little time as possible. While the best submissions were in the OR category, participants found many promising MARL approaches. Using both centralized and decentralized learning based approaches, top submissions used graph representations of the environment to construct tree-based observations. Further, different coordination mechanisms were implemented, such as communication and prioritization between agents. This paper presents the competition setup, four outstanding solutions to the competition, and a cross-comparison between them.	}
}

Endnote

%0 Conference Paper
%T Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
%A Florian Laurent
%A Manuel Schneider
%A Christian Scheller
%A Jeremy Watson
%A Jiaoyang Li
%A Zhe Chen
%A Yi Zheng
%A Shao-Hung Chan
%A Konstantin Makhnev
%A Oleg Svidchenko
%A Vladimir Egorov
%A Dmitry Ivanov
%A Aleksei Shpilman
%A Evgenija Spirovska
%A Oliver Tanevski
%A Aleksandar Nikov
%A Ramon Grunder
%A David Galevski
%A Jakov Mitrovski
%A Guillaume Sartoretti
%A Zhiyao Luo
%A Mehul Damani
%A Nilabha Bhattacharya
%A Shivam Agarwal
%A Adrian Egli
%A Erik Nygren
%A Sharada Mohanty
%B Proceedings of the NeurIPS 2020 Competition and Demonstration Track
%C Proceedings of Machine Learning Research
%D 2021
%E Hugo Jair Escalante
%E Katja Hofmann	
%F pmlr-v133-laurent21a
%I PMLR
%P 275--301
%U https://proceedings.mlr.press/v133/laurent21a.html
%V 133
%X The Flatland competition aimed at finding novel approaches to solve the vehicle re-scheduling problem (VRSP). The VRSP is concerned with scheduling trips in traffic networks and the re-scheduling of vehicles when disruptions occur, for example the breakdown of a vehicle. While solving the VRSP in various settings has been an active area in operations research (OR) for decades, the ever-growing complexity of modern railway networks makes dynamic real-time scheduling of traffic virtually impossible. Recently, multi-agent reinforcement learning (MARL) has successfully tackled challenging tasks where many agents need to be coordinated, such as multiplayer video games. However, the coordination of hundreds of agents in a real-life setting like a railway network remains challenging and the Flatland environment used for the competition models these real-world properties in a simplified manner. Submissions had to bring as many trains (agents) to their target stations in as little time as possible. While the best submissions were in the OR category, participants found many promising MARL approaches. Using both centralized and decentralized learning based approaches, top submissions used graph representations of the environment to construct tree-based observations. Further, different coordination mechanisms were implemented, such as communication and prioritization between agents. This paper presents the competition setup, four outstanding solutions to the competition, and a cross-comparison between them.

APA

Laurent, F., Schneider, M., Scheller, C., Watson, J., Li, J., Chen, Z., Zheng, Y., Chan, S., Makhnev, K., Svidchenko, O., Egorov, V., Ivanov, D., Shpilman, A., Spirovska, E., Tanevski, O., Nikov, A., Grunder, R., Galevski, D., Mitrovski, J., Sartoretti, G., Luo, Z., Damani, M., Bhattacharya, N., Agarwal, S., Egli, A., Nygren, E. & Mohanty, S.. (2021). Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World. Proceedings of the NeurIPS 2020 Competition and Demonstration Track, in Proceedings of Machine Learning Research 133:275-301 Available from https://proceedings.mlr.press/v133/laurent21a.html.

Related Material

Download PDF