R4P-Mini-SE

This repository contains the official implementation of the paper: "Scalable Supervising Software Agents with Patch Reasoner".

In this paper, we explore a reasoning-based patch verification strategy to provide scalable supervision for software engineering agents. This approach (1) mitigates data scarcity caused by test quality requirements in open-source codebases, (2) removes the need for environment setup and makes data expansion costless, and (3) greatly reduces computational overhead compared to heavy test execution. We aim to leverage such imperfect yet easily scalable supervision to enhance model capability even after high-quality test data is exhausted.

Setup

Our framework is based on Verl. To install our environment, please refer to the Verl repo.

Data

You can find our training and testing data here. Please create the datasets directory and save them to datasets.

info_xxx.parquet: Original data without prompt.
data_xxx.parquet: Data with prompt. You can process a info parquet to data parquet by using verl_utils/data/data_proc.py

Run

You can find our scripts in verl_utils/scripts:

r4p.sh: Training and testing scripts of R4P.
minise.sh: Training and testing scripts of Mini-SE.
eval.sh: Evaluation scripts of R4P individually.
tts.sh: Evaluation scripts of Mini-SE (test-time patch selection).
setup.sh: Script for serving R4P. You may need to adjust MODEL_PATH in verl_utils/model_server.py and SERVER_URL in verl_utils/reward/model_client.py.

Before training Mini-SE, please use verl_utils/data/env_init.py to checkout the repositories first.

Citation

@article{xu2025scalable,
  title={Scalable Supervising Software Agents with Patch Reasoner},
  author={Xu, Junjielong and Tan, Boyin and Liu, Xiaoyuan and Peng, Chao and Gao, Pengfei and He, Pinjia},
  journal={arXiv preprint arXiv:2510.22775},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
verl		verl
verl_utils		verl_utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements-npu.txt		requirements-npu.txt
requirements.txt		requirements.txt
requirements_sglang.txt		requirements_sglang.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

R4P-Mini-SE

Setup

Data

Run

Citation

About

Uh oh!

Releases

Packages

Languages

License

Siyuexi/R4P-Mini-SE

Folders and files

Latest commit

History

Repository files navigation

R4P-Mini-SE

Setup

Data

Run

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages