GitHub - abj247/maicbf: [ICRA'25] Decentralized Safe and Scalable Multi-Agent Control Under Limited Actuation

Decentralized Safe and Scalable Multi-Agent Control Under Limited Actuation

Vrushabh Zinage¹ · Abhishek Jha² · Rohan Chandra³ · Efstathios Bakolas¹

¹University of Texas at Austin · ² Delhi Technological University · ³University of Virginia

[ICRA'25] Decentralized Safe and Scalable Multi-Agent Control Under Limited Actuation

Abstract:

To deploy safe and agile robots in cluttered environments, there is a need to develop fully decentralized controllers that guarantee safety, respect actuation limits, prevent deadlocks, and scale to thousands of agents. Current approaches fall short of meeting all these goals: optimization-based methods ensure safety but lack scalability, while learning-based methods scale but do not guarantee safety. We propose a novel algorithm to achieve safe and scalable control for multiple agents under limited actuation. Specifically, our approach includes: (i) learning a decentralized neural Integral Control Barrier function (neural ICBF) for scalable, input-constrained control, (ii) embedding a lightweight decentralized Model Predictive Control-based Integral Control Barrier Function (MPC-ICBF) into the neural network policy to ensure safety while maintaining scalability, and (iii) introducing a novel method to minimize deadlocks based on gradient-based optimization techniques from machine learning to address local minima in deadlocks. Our numerical simulations show that this approach outperforms state-of-the-art multi-agent control algorithms in terms of safety, input constraint satisfaction, and minimizing deadlocks. Additionally, we demonstrate strong generalization across scenarios with varying agent counts, scaling up to 1000 agents.

Installation

Create a virtual environment with Anaconda:

conda create -n maicbf python=3.6

Activate the virtual environment:

source activate maicbf

Clone this repository:

git clone https://github.com/abj247/maicbf.git

Enter the main folder and install the dependencies:

pip install -r requirements.txt

Training

To train the ma-icbf model specified number of agents (e.g. 4) use this command,

python train.py --num_agents 4

This will train the model with 4 agents

Evaluation

For evaluation of ma-icbf model use this command,

python eval.py --num_agents 1024 --model_path models/agile_u_max_0.2/model_ours_weight_1.0_agents_4_v_max_0.2_u_max_0.2_sigma_0.05_default_iter_69999  --env Maze --vis 1

This will evaluate the model for 1024 agents and will detect deadlock, resolve deadlock track collision, resolve collision for all agents using decentralized mpc-icbf controller, save cbf data and time to goal for each agents in a csv file with pretrained weights trained with 4 agents. For pretrained weights see models. The evaluation will output the safety rate, number of deadlocks, time taken to complete the simulation. After the simulation ends the output gif will be saved. The evaluation results after completing the simulation will be in the format shown below:

All collisions resolved.
MPC-ICBF was triggered 2 times to resolve collisions
Steps taken by agents saved to 'steps_taken_by_agents.csv'
GIF saved at: trajectory\ma-icbf_trajectory_16_agents.gif
Evaluation Step: 1 | 1, Time: 5.7216, Deadlocked Agents: 2.0000
Total Number of Collisions : 0.0
collision tracking data saved!!!
Accuracy: [0.97641134, 0.9999202, 0.9374962, 0.9967269, 0.9942482, 2.5885205, 14.096791]
Distance Error (MA-ICBF): 0.6942
Mean Safety Rate (MA-ICBF): 1.0000
Deadlocked agents (MA-ICBF): 2.0000

Running Baselines

The baselines contains the baselines used in the research for comparison. This includes the CBF based learning approaches and MARL algortihms used in the work for comparative studies.

If you find our work useful, please cite us

@article{zinage2024decentralized,
  title={Decentralized Safe and Scalable Multi-Agent Control under Limited Actuation},
  author={Zinage, Vrushabh and Jha, Abhishek and Chandra, Rohan and Bakolas, Efstathios},
  journal={arXiv preprint arXiv:2409.09573},
  year={2024}
}

Acknowledgement

This work is inspired and build upon the work from macbf which is the implementation of Learning Safe Multi-Agent Control with Decentralized Neural Barrier Certificates. The computational resources for this work are taken from the University of Virginia, Department of Computer Science.

Reference Links

Qin, Z., Zhang, K., Chen, Y., Chen, J. and Fan, C., 2021. Learning safe multi-agent control with decentralized neural barrier certificates.pdf project webpage
I.-J. Liu, R. A. Yeh, and A. G. Schwing, “Pic: permutation invariant critic for multi-agent deep reinforcement learning,” in Conference on Robot Learning. PMLR, 2020, pp. 590–602.pdf
S. Nayak, K. Choi, W. Ding, S. Dolan, K. Gopalakrishnan, and H. Balakrishnan, “Scalable multi-agent reinforcement learning through intelligent information aggregation,” in International Conference on Machine Learning. PMLR, 2023, pp. 25 817–25 833.pdf
S. Zhang, K. Garg, and C. Fan, “Neural graph control barrier functions guided distributed collision-avoidance multi-agent control,” in Conference on robot learning. PMLR, 2023, pp. 2373–2392.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 240 Commits
__pycache__		__pycache__
baselines		baselines
models		models
trajectory		trajectory
.gitmodules		.gitmodules
README.md		README.md
config.py		config.py
core.py		core.py
dijkstra.py		dijkstra.py
eval.py		eval.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Decentralized Safe and Scalable Multi-Agent Control Under Limited Actuation

[ICRA'25] Decentralized Safe and Scalable Multi-Agent Control Under Limited Actuation

Abstract:

Installation

Training

Evaluation

Running Baselines

If you find our work useful, please cite us

Acknowledgement

Reference Links

About

Uh oh!

Releases

Packages

Uh oh!

Languages

abj247/maicbf

Folders and files

Latest commit

History

Repository files navigation

Decentralized Safe and Scalable Multi-Agent Control Under Limited Actuation

[ICRA'25] Decentralized Safe and Scalable Multi-Agent Control Under Limited Actuation

Abstract:

Installation

Training

Evaluation

Running Baselines

If you find our work useful, please cite us

Acknowledgement

Reference Links

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages