Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published Feb 9 • 264
Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published 26 days ago • 46
ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement Paper • 2604.01591 • Published 26 days ago • 42