Prompt Tuning Decision Transformers with Structured and Scalable Bandits

This is the official code repository for our NeurIPS 2025 paper "Prompt Tuning Decision Transformers with Structured and Scalable Bandits". Please cite this work as

@inproceedings{rietz2025neurips,
 author = {Rietz, Finn and Smirnov, Oleg and Karimi, Sara and Cao, Lele},
 booktitle = {Advances in Neural Information Processing Systems},
 title = {Prompt Tuning Decision Transformers with Structured and Scalable Bandits},
 volume = {38},
 year = {2025}
}

Instructions

Please see the instructions for installation and running each experiment in the corresponding directory:

bandit_regret_experiment for the synthethic regret comparison of standard and our structured bandit for prompt tuning
prompt_tune_2d_exeriment for the 2D experiments
prompt_tune_mujoco_experiment for the MuJoCo experiments

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
bandit_regret_experiment		bandit_regret_experiment
prompt_tune_2d_experiment		prompt_tune_2d_experiment
prompt_tune_mujoco_experiment		prompt_tune_mujoco_experiment
AUTHORS		AUTHORS
CONTRIBUTORS		CONTRIBUTORS
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Prompt Tuning Decision Transformers with Structured and Scalable Bandits

Instructions

About

Uh oh!

Releases

Packages

Languages

License

king/pdt-bandits

Folders and files

Latest commit

History

Repository files navigation

Prompt Tuning Decision Transformers with Structured and Scalable Bandits

Instructions

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages