LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published 5 days ago • 59
SymptomAI: Towards a Conversational AI Agent for Everyday Symptom Assessment Paper • 2605.04012 • Published 8 days ago • 11
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 10 days ago • 152
Heterogeneous Scientific Foundation Model Collaboration Paper • 2604.27351 • Published 13 days ago • 212
Medical Triage as Pairwise Ranking Collection A Benchmark for Urgency in Patient Portal Messages • 6 items • Updated Mar 2 • 3
CityRAG: Stepping Into a City via Spatially-Grounded Video Generation Paper • 2604.19741 • Published 22 days ago • 17
CutClaw: Agentic Hours-Long Video Editing via Music Synchronization Paper • 2603.29664 • Published Mar 31 • 48
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper • 2602.18422 • Published Feb 20 • 30
EgoPush: Learning End-to-End Egocentric Multi-Object Rearrangement for Mobile Robots Paper • 2602.18071 • Published Feb 20 • 22
VideoWorld 2: Learning Transferable Knowledge from Real-world Videos Paper • 2602.10102 • Published Feb 10 • 14