Yao

distant-yuan

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning

upvoted a paper 9 days ago

Learning to Act under Noise: Enhancing Agent Robustness via Noisy Environments

upvoted a paper 9 days ago

VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions

View all activity

Organizations

None yet

upvoted a paper 8 days ago

Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning

Paper • 2605.28424 • Published 10 days ago • 31

upvoted 2 papers 9 days ago

Learning to Act under Noise: Enhancing Agent Robustness via Noisy Environments

Paper • 2605.27209 • Published 11 days ago • 16

VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions

Paper • 2605.27141 • Published 11 days ago • 19

upvoted a paper 15 days ago

Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles

Paper • 2605.22177 • Published 16 days ago • 21

upvoted a paper 22 days ago

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published 23 days ago • 111

New activity in ChilleD/WebHarbor 23 days ago

feat(phys_org): add Phys.org mirror tarball

#7 opened 23 days ago by

distant-yuan

feat(phys_org): add Phys.org mirror tarball

#6 opened 23 days ago by

distant-yuan

updated a dataset 23 days ago

distant-yuan/WebHarbor

Updated 23 days ago • 70

upvoted a paper 25 days ago

Dynamic Skill Lifecycle Management for Agentic Reinforcement Learning

Paper • 2605.10923 • Published 26 days ago • 13

upvoted a paper about 2 months ago

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Paper • 2604.08455 • Published Apr 9 • 47

upvoted 2 papers 2 months ago

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Paper • 2604.02268 • Published Apr 2 • 101

V_{0.5}: Generalist Value Model as a Prior for Sparse RL Rollouts

Paper • 2603.10848 • Published Mar 11 • 16

upvoted a paper 4 months ago

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Paper • 2602.07026 • Published Feb 2 • 140

authored a paper 4 months ago

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Paper • 2602.03048 • Published Feb 3 • 32

upvoted 6 papers 4 months ago

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Paper • 2602.03048 • Published Feb 3 • 32

MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning

Paper • 2601.21468 • Published Jan 29 • 25

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published Jan 23 • 181

Yao

AI & ML interests

Recent Activity

Organizations

distant-yuan's activity

feat(phys_org): add Phys.org mirror tarball

feat(phys_org): add Phys.org mirror tarball