WangZilin's picture

2

WangZilin

terr1ble

·

AI & ML interests

None yet

Organizations

None yet

upvoted 2 papers 4 months ago

Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration

Paper • 2508.13755 • Published Aug 19, 2025 • 14

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19, 2025 • 118