Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models Paper • 2603.07777 • Published 11 days ago • 5
Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity Paper • 2603.05168 • Published 14 days ago • 4
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective Jan 27 • 67