yangzhch6

Follow

Deep Thinking

yangzhch6 yangzhch6

Deep Thinking

Follow

PhD Student | Reasoning with LLMs

41 followers · 65 following

Beijing, China
18:52 (UTC +08:00)
https://yangzhch6.github.io/

Achievements

Achievements

Highlights

Pro

Pinned Loading

Accordion-Thinking Accordion-Thinking Public

Training LLM to fold the rasoning process

Python 2
DARS DARS Public

The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration"

Python 24
ReSocratic ReSocratic Public

OptiBench and ReSocratic Synthesis Method

Python 30 1
Mirror-Critique Mirror-Critique Public

The official implemention of "Critique to Verify: Accurate and Honest Test-Time Scaling with RL-Trained Verifiers"

Python 8 1