Speculative Pipeline Decoding: Higher-Accruacy and Zero-Bubble Speculation via Pipeline Parallelism Paper • 2605.30852 • Published 4 days ago • 3
ZeroUnlearn: Few-Shot Knowledge Unlearning in Large Language Models Paper • 2605.18879 • Published 13 days ago • 6
Agent Explorative Policy Optimization for Multimodal Agentic Reasoning Paper • 2605.28774 • Published 6 days ago • 81
Share More, Search Less: Collaborative Parallel Thinking for Efficient Test-Time Scaling Paper • 2605.27030 • Published 7 days ago • 29
SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 11 days ago • 212
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories Paper • 2605.21468 • Published 13 days ago • 49
The Unlearnability Phenomenon in RLVR for Language Models Paper • 2605.16787 • Published 17 days ago • 6
G-Zero: Self-Play for Open-Ended Generation from Zero Data Paper • 2605.09959 • Published 22 days ago • 17
G-Zero: Self-Play for Open-Ended Generation from Zero Data Paper • 2605.09959 • Published 22 days ago • 17
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published 25 days ago • 69
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published 25 days ago • 69
Rethinking the Reranker: Boundary-Aware Evidence Selection for Robust Retrieval-Augmented Generation Paper • 2602.03689 • Published Feb 3
Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration Paper • 2605.05566 • Published 26 days ago • 37
SkillOS: Learning Skill Curation for Self-Evolving Agents Paper • 2605.06614 • Published 26 days ago • 46
Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction Paper • 2605.05242 • Published about 1 month ago • 119
Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration Paper • 2605.05566 • Published 26 days ago • 37
Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration Paper • 2605.05566 • Published 26 days ago • 37