Multi-agent pathfinding explained
The problem of Multi-Agent Pathfinding (MAPF) is an instance of multi-agent planning and consists in the computation of collision-free paths for a group of agents from their location to an assigned target. It is an optimization problem, since the aim is to find those paths that optimize a given objective function, usually defined as the number of time steps until all agents reach their goal cells. MAPF is the multi-agent generalization of the pathfinding problem, and it is closely related to the shortest path problem in the context of graph theory.
Several algorithms have been proposed to solve the MAPF problem. Due to its complexity, it happens that optimal approaches are infeasible on big environments and with a high number of agents. However, given the applications in which MAPF is involved such as automated warehouses and airport management, it is important to reach a trade-off between the efficiency of the solution and its effectiveness.
Problem Formalization
The elements of a classical MAPF [1] problem are the following:
of
agents;
, where
is the node set, and
is the edge set. The nodes represent the possible locations of the agents, while the arcs are the possible connections between such positions;
that associates each agent with its starting point;
that associates each agent with its target point.
It is assumed that time is discrete, and that each agent can perform one action at each time step. There are two possible types of actions: the wait action, in which the agent remains in its node, and the move action, that allows the agent to move to an adjacent node. An action is formalized as a function
, meaning that
represents the action of moving from
to
if
is adjacent to
and different than
, or to stay in node
if
.
The agents perform sequences of actions to go from their starting point to their target location. A sequence of action performed by agent
is denoted by
and is called a plan. If agent
starts from its location
and arrives to its target location
performing plan
, then
is called single-agent plan for agent
. A valid solution for the MAPF problem is a set of
single-agent plans (one for each agent), such that the plans do not collide one another. Once an agent has reached its target, it can either remain in the target location or disappear.
Types of Collisions
In order to have a valid solution for a MAPF problem, it is necessary that the single-agent plans of the
agents do not collide one another. Given plan
, the expression
denotes the position of agent
after having performed
steps of plan
. It is possible to distinguish five different types of collisions between two plans
and
.
- Vertex conflict: there is a vertex conflict between plans
and
when the two agents occupy the same location at the same time. Formally, a vertex conflict happens when
.
- Edge conflict: an edge conflict occurs whenever two agents cross the same edge in the same direction at the same time, that is
and
. If vertex conflicts are not allowed, then edge conflicts cannot exist.
- Following conflict: a following conflict happens when at a certain time step an agent occupies a location that was occupied by another agent in the previous time step. Mathematically, a following conflict is described as
.
- Cycle conflict: a cycle conflict happens whenever a set of agents (at least three) move as if they are spinning in a cycle. It means that each agent takes the position that was previously occupied by the other agent one step-ahead in the cycle. If following conflicts are forbidden, then cycle conflicts cannot happen.
- Swapping conflict: a swapping conflict is the case in which two agents exchange their position, passing on the same edge at the same time in two different directions. It is expressed as
and
.
When formalizing a MAPF problem, it is possible to decide which conflicts are allowed and which are forbidden. A unified standard about permitted and denied conflicts does not exist, however vertex and edge conflicts are usually not allowed.
Objective Functions
When computing single-agent plans, the aim is to maximize a user-defined objective function. There is not a standard objective function to adopt, however the most common are:
- flowtime: this measure is obtained by summing the time steps employed by each agent to reach their target location. Formally, it is equal to
, where the plans
are single-agent plans without collisions;
- makespan: the number of time steps necessary so that all the agents complete theirs tasks, defined as
, where
are part of a valid solution;
- maximization of reached targets given a deadline: the aim is to find a valid solution that maximizes the number of agents that reach their target given a time deadline.[2]
Algorithms
Several algorithms have been proposed to solve the MAPF problem. The issue is that it is NP-hard to find optimal makespan or flow-time solutions,[3] also when considering planar graphs,[4] or graphs similar to grids.[5] For what concerns bounded suboptimal solutions, it is shown that it is NP-hard to find a makespan-optimal solution with a factor of suboptimality smaller than
.
[6] Optimal MAPF solvers return high quality solutions, but their efficiency is low. Instead, bounded-suboptimal and suboptimal solvers are more efficient, but their solutions are less effective. Also
machine learning approaches have been proposed to solve the MAPF problem.
[7] Prioritized Planning
One possible approach to face the computational complexity is prioritized planning.[8] It consists in decoupling the MAPF problem into
single-agent pathfinding problems.
The first step is to assign to each agent a unique number
that corresponds to the priority given to the agent. Then, following the priority order, for each agent a plan is computed to reach the target location. When planning, agents have to avoid collisions with paths of other agents with a higher priority that have already computed their plans.
Finding a solution for the MAPF problem in such setting corresponds to the shortest path problem in a time-expansion graph.[9] A time-expansion graph is a graph that takes into account the passing of time. Each node is composed by two entries
, where
is the node name and
is the time step. Each node
is linked to the nodes
such that
is adjacent to
and
is not occupied at time step
.
The drawback of prioritized planning is that, even if it is a sound approach (it returns valid solutions), it is neither optimal nor complete.[10] This means that it is not assured that the algorithm will return a solution and, even in that case, the solution may not be optimal.
Optimal MAPF Solvers
It is possible to distinguish four different categories of optimal MAPF solvers:[10]
- Extensions of A*: algorithms in this category employ modified versions of the A* approach.
- Increasing Cost Tree Search:[11] a novel formalization of the MAPF problem is proposed, that comprehends an increasing search tree and the corresponding algorithm. The algorithm is composed by two levels and relies on the assumption that a valid solution for the MAPF problem is composed by a set of solutions for the single agents.
- Conflict-Based Search:[12] this algorithm computes paths as when solving single-agent pathfinding problems, and then it adds constraints in an incremental way in order to avoid collisions.
- Constraints programming:[13] with this kind of approach, MAPF problems are transformed into a set of constraints and then solved using specific constraint solvers such as SAT and Mixed Integer Programming (MIP) solvers.
Bounded Suboptimal MAPF Solvers
Bounded suboptimal algorithms offer a trade-off between the optimality and the cost of the solution. They are said to be bounded by a certain factor because they return solutions with a cost at most equal to the optimal solution cost times the factor. MAPF bounded suboptimal solvers can be divided following the same categorization presented for optimal MAPF solvers.[10]
Variations
The way in which MAPF problems are defined allows to change various aspects, for example the fact of being in a grid environment or the assumption that time is discrete. This section reports some variations of the classical MAPF problem.
Anonymous MAPF
It is a version of MAPF in which there is a set of target locations but agents are not assigned a specific target.[14] It does not matter whether the agent that reaches the target, the important thing is that targets are completed. A slight modification of this version is the one in which agents are divided into groups and each group has to perform a set of targets.[15]
Multi-Agent Pick-up and Delivery
MAPF problem is not able to capture some aspects relative to real world applications. For example, in automated warehouses it happens that robots have to complete several tasks one after the other. For this reason, an extended MAPF version is proposed, called Multi-Agent Pick-up and Delivery (MAPD).[16] In a MAPD setting, agents have to complete a stream of tasks, where each task is composed by a pick-up a location and a delivery location. When planning for the completion of a task, the path has to start from the current position of the robot and to end in the delivery position of the task, passing through the pick-up point. MAPD is considered a "lifelong" version of MAPF in which tasks arrive incrementally.[16]
Beyond Classical MAPF
The assumptions that the agents are in a grid environment, that their speed is constant and that time is discrete are simplifying hypotheses. Many works take into account the kinematic constraints of agents,[17] such as velocity and orientation, or go past the assumption that the weights of the arcs are all equal to 1.[18] Other works focus on eliminating the discrete time assumptions and that the duration of actions is exactly equal to one time step.[19] Another assumption that does not reflect reality is that agents occupy exactly one cell of the environment in which they are: some studies have been conducted to overcome this hypothesis.[20] It is interesting to note that the shape and geometry of agents may introduce new types of conflicts, since agents may crash with one another even if they are not completely overlapped.
Applications
MAPF can be applied in several real case scenarios:
- Automated warehouses: warehouse logistics represents the main industrial application of MAPF. It has been shown that automation in warehouses manages to increase the productivity level.[21]
- Airport operations: MAPF algorithms can be employed in crowded airports to coordinate towing vehicles that transport aircraft.[22] Being able to optimize this kind of problems provides also a benefit to the environment.
- Autonomous mobile service robots: service robots are automated agents that perform dangerous and repetitive tasks for humans in a non-industrial environment.[23] Their main goal is to help human beings.
- Video games: the usefulness of MAPF in such settings can be found when the player has to move a team of agents in a congested video game environment.[24]
See also
References
- Stern . Roni . Sturtevant . Nathan R. . Felner . Ariel . Koenig . Sven . Ma . Hang . Walker . Thayne . Li . Jiaoyang . Atzmon . Dor . Cohen . Liron . Kumar . T. K. Satish . Boyarski . Eli . Barták . Roman . Multi-Agent Pathfinding: Definitions, Variants, and Benchmarks . 2019 . 1906.08291 .
- Ma . Hang . Wagner . Glenn . Felner . Ariel . Li . Jiaoyang . Kumar . T. K. Satish . Koenig . Sven . Multi-Agent Path Finding with Deadlines . 2018 . cs.AI . 1806.04216.
- Book: Yu . Jingjin . LaValle . Steven M. . Structure and Intractability of Optimal Multi-Robot Path Planning on Graphs . Proceedings of the AAAI Conference on Artificial Intelligence . 2013 . 27 . 1443–1449 . 10.1609/aaai.v27i1.8541 . 11655439 . https://dl.acm.org/doi/10.5555/2891460.2891662.
- Yu . Jingjin . Intractability of Optimal Multirobot Path Planning on Planar Graphs . IEEE Robotics and Automation Letters . January 2016 . 1 . 1 . 33–40 . 10.1109/LRA.2015.2503143. 1504.02072 . 10275469 .
- Banfi . Jacopo . Basilico . Nicola . Amigoni . Francesco . Intractability of Time-Optimal Multirobot Path Planning on 2D Grid Graphs with Holes . IEEE Robotics and Automation Letters . October 2017 . 2 . 4 . 1941–1947 . 10.1109/LRA.2017.2715406. 36801258 .
- Book: Ma . Hang . Tovey . Craig . Sharon . Guni . Kumar . T. K. . Koenig . Sven . Multi-Agent Path Finding with Payload Transfers and the Package-Exchange Robot-Routing Problem . Proceedings of the AAAI Conference on Artificial Intelligence . 5 March 2016 . 30 . 1 . 10.1609/aaai.v30i1.10409. 1585005 .
- Sartoretti . Guillaume . Kerr . Justin . Shi . Yunfei . Wagner . Glenn . Kumar . T. K. Satish . Koenig . Sven . Choset . Howie . PRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning . IEEE Robotics and Automation Letters . July 2019 . 4 . 3 . 2378–2385 . 10.1109/LRA.2019.2903261. 52191178 . free . 1809.03531 .
- Cap . Michal . Novak . Peter . Kleiner . Alexander . Selecky . Martin . Prioritized Planning Algorithms for Trajectory Coordination of Multiple Mobile Robots . IEEE Transactions on Automation Science and Engineering . July 2015 . 12 . 3 . 835–849 . 10.1109/TASE.2015.2445780. 347488 . 1409.2399 .
- Book: Silver . David . Cooperative Pathfinding . Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment . 2021 . 1 . 1 . 117–122 . 10.1609/aiide.v1i1.18726 . 17714238 . https://ojs.aaai.org/index.php/AIIDE/article/view/18726/18503.
- Book: Stern . Roni . Artificial Intelligence . Multi-Agent Path Finding – an Overview . Lecture Notes in Computer Science . 2019 . 11866 . 96–115 . 10.1007/978-3-030-33274-7_6. 978-3-030-33273-0 . 204832267 .
- Sharon . Guni . Stern . Roni . Goldenberg . Meir . Felner . Ariel . The increasing cost tree search for optimal multi-agent pathfinding . Artificial Intelligence . February 2013 . 195 . 470–495 . 10.1016/j.artint.2012.11.006. free .
- Sharon . Guni . Stern . Roni . Felner . Ariel . Sturtevant . Nathan R. . Conflict-based search for optimal multi-agent pathfinding . Artificial Intelligence . February 2015 . 219 . 40–66 . 10.1016/j.artint.2014.11.006. 3809045 .
- Book: Bartak . Roman . Zhou . Neng-Fa . Stern . Roni . Boyarski . Eli . Surynek . Pavel . 2017 IEEE 29th International Conference on Tools with Artificial Intelligence (ICTAI) . Modeling and Solving the Multi-agent Pathfinding Problem in Picat . November 2017 . 959–966 . 10.1109/ICTAI.2017.00147. 978-1-5386-3876-7 . 7819470 .
- Kloder . S. . Hutchinson . S. . Path planning for permutation-invariant multirobot formations . IEEE Transactions on Robotics . August 2006 . 22 . 4 . 650–665 . 10.1109/TRO.2006.878952. 62805494 .
- Ma . Hang . Koenig . Sven . Optimal Target Assignment and Path Finding for Teams of Agents . 2016 . cs.AI . 1612.05693.
- Ma . Hang . Li . Jiaoyang . Kumar . T. K. Satish . Koenig . Sven . Lifelong Multi-Agent Path Finding for Online Pickup and Delivery Tasks . 30 May 2017 . cs.AI . 1705.10868.
- Book: Hoenig . Wolfgang . Kumar . T. K. Satish . Cohen . Liron . Ma . Hang . Xu . Hong . Ayanian . Nora . Koenig . Sven . Multi-Agent Path Finding with Kinematic Constraints . 2016 . https://www.aaai.org/ocs/index.php/ICAPS/ICAPS16/paper/view/13183/12711 . Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI-17).
- Book: Barták . Roman . Švancara . Jiří . Vlk . Marek . A scheduling-based approach to multi-agent path finding with weighted and capacitated arcs . Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems . 2018 . 748–756 . http://svancara.net/files/aamas2018.pdf.
- Andreychuk . Anton . Yakovlev . Konstantin . Surynek . Pavel . Atzmon . Dor . Stern . Roni . Multi-agent pathfinding with continuous time . Artificial Intelligence . April 2022 . 305 . 103662 . 10.1016/j.artint.2022.103662. 1901.05506 . 207791641 .
- Book: Li . Jiaoyang . Surynek . Pavel . Felner . Ariel . Ma . Hang . Kumar . T. K. Satish . Koenig . Sven . Multi-Agent Path Finding for Large Agents . Proceedings of the AAAI Conference on Artificial Intelligence . 17 July 2019 . 33 . 7627–7634 . 10.1609/aaai.v33i01.33017627. 53687939 .
- Wurman . Peter R. . D'Andrea . Raffaello . Mountz . Mick . Coordinating Hundreds of Cooperative, Autonomous Vehicles in Warehouses . AI Magazine . 20 March 2008 . 29 . 1 . 9 . 10.1609/aimag.v29i1.2082. 10475273 .
- Book: Morris . Robert . Pasareanu . Corina S. . Corina Păsăreanu . Luckow . Kasper . Malik . Waqar . Ma . Hang . Kumar . T. K. Satish . Koenig . Sven . Planning, Scheduling and Monitoring for Airport Surface Operations . AAAI Workshop: Planning for Hybrid Systems . 2016 . https://www.aaai.org/ocs/index.php/WS/AAAIW16/paper/view/12611/12430.
- Book: Veloso . Manuela . Biswas . Joydeep . Coltin . Brian . Rosenthal . Stephanie . CoBots: Robust Symbiotic Autonomous Mobile Service Robots . 2015 . https://dl.acm.org/doi/10.5555/2832747.2832901 . IJCAI'15: Proceedings of the 24th International Conference on Artificial Intelligence. 4423–4429 . 978-1-57735-738-4 .
- Ma . Hang . Yang . Jingxing . Cohen . Liron . Kumar . T. K. Satish . Koenig . Sven . 2017 . 1710.01447 . Feasibility Study: Moving Non-Homogeneous Teams in Congested Video Game Environments. Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment . 13 . 1 . 270–272 . 10.1609/aiide.v13i1.12919 .
External links