This is the official code repository for our NeurIPS 2025 paper "Prompt Tuning Decision Transformers with Structured and Scalable Bandits". Please cite this work as
@inproceedings{rietz2025neurips,
author = {Rietz, Finn and Smirnov, Oleg and Karimi, Sara and Cao, Lele},
booktitle = {Advances in Neural Information Processing Systems},
title = {Prompt Tuning Decision Transformers with Structured and Scalable Bandits},
volume = {38},
year = {2025}
}Please see the instructions for installation and running each experiment in the corresponding directory:
bandit_regret_experimentfor the synthethic regret comparison of standard and our structured bandit for prompt tuningprompt_tune_2d_exerimentfor the 2D experimentsprompt_tune_mujoco_experimentfor the MuJoCo experiments