-
Notifications
You must be signed in to change notification settings - Fork 77
Pull requests: radixark/miles
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Use new rollout function by default when corresponding flag is on
#491
opened Jan 18, 2026 by
fzyzcjy
Loading…
Change max_num_tokens according to rollout_max_context_len
#487
opened Jan 17, 2026 by
fzyzcjy
Loading…
Add environment variable to guard enabling the new rollout
#484
opened Jan 17, 2026 by
fzyzcjy
Loading…
Add rollout level integration test for (multi-turn, agentic) x (single-sample, multi-sample)
#483
opened Jan 17, 2026 by
fzyzcjy
Loading…
Add three turn integration testing and refactor related stubs
#482
opened Jan 17, 2026 by
fzyzcjy
Loading…
Support agentic rollout to generate one single sample for the whole tracjectory
#481
opened Jan 17, 2026 by
fzyzcjy
Loading…
Support tracing OpenAI endpoint and converting to Sample
#477
opened Jan 17, 2026 by
fzyzcjy
Loading…
Support multiple output samples in addition to single sample in multi-turn
#469
opened Jan 17, 2026 by
fzyzcjy
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.