Who Plays First?
Optimizing the Order of Play in Stackelberg Games with Many Robots
Coordinating Delivery Mini-trucks
We consider the multi-agent spatial navigation problem of computing the socially optimal order of play, i.e., the sequence in which the agents commit to their decisions, and its associated equilibrium in an N-player Stackelberg trajectory game. We model this problem as a mixed-integer optimization problem over the space of all possible Stackelberg games associated with the order of play’s permutations.
To solve the problem, we introduce Branch and Play (B&P), an efficient and exact algorithm that provably converges to a socially optimal order of play and its Stackelberg equilibrium. As a subroutine for B&P, we employ and extend sequential trajectory planning, i.e., a popular multi-agent control approach, to scalably compute valid local Stackelberg equilibria for any given order of play.
We demonstrate the practical utility of B&P to coordinate air traffic control, swarm formation, and delivery vehicle fleets. We find that B&P consistently outperforms various baselines, and computes the socially optimal equilibrium.
Air Traffic Management
Quadrotor Formation Control
@inproceedings{hu2024plays,
author = {Haimin Hu and Gabriele Dragotto and Zixu Zhang and Kaiqu Liang and Bartolomeo Stellato and Jaime F. Fisac},
title = {Who Plays First? Optimizing the Order of Play in Stackelberg Games with Many Robots},
booktitle = {Proceedings of Robotics: Science and Systems},
year = {2024}}
Citation
Authors
*H. Hu and G. Dragotto contributed equally.
This work is supported in part by the Princeton DataX Grant.