6 1

liuyixiu

liuyx0903

AI & ML interests

None yet

Recent Activity

published a model about 6 hours ago

SII-GAIR-NLP/davinci-llm-model

updated a model about 7 hours ago

SII-GAIR-NLP/davinci-llm-model

upvoted a paper 3 days ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

View all activity

Organizations

published a model about 6 hours ago

SII-GAIR-NLP/davinci-llm-model

3B • Updated about 4 hours ago

updated a model about 7 hours ago

SII-GAIR-NLP/davinci-llm-model

3B • Updated about 4 hours ago

upvoted a paper 3 days ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published 3 days ago • 106

upvoted a paper about 2 months ago

daVinci-Dev: Agent-native Mid-training for Software Engineering

Paper • 2601.18418 • Published Jan 26 • 126

upvoted a paper 3 months ago

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published Dec 29, 2025 • 66

liked a Space 5 months ago

The Smol Training Playbook

📚

3.06k

The secrets to building world-class LLMs

upvoted a paper 5 months ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 85

upvoted a paper 9 months ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25, 2025 • 47

updated a model 10 months ago

liuyx0903/xf

8B • Updated May 24, 2025

published a model 10 months ago

liuyx0903/xf

8B • Updated May 24, 2025

upvoted a paper 10 months ago

Efficient Agent Training for Computer Use

Paper • 2505.13909 • Published May 20, 2025 • 44

updated 2 models over 1 year ago

GAIR/Safety-J-v5

Feature Extraction • 8B • Updated Jul 15, 2024 • 3 • 1

GAIR/Safety-J-v1

Feature Extraction • 8B • Updated Jul 15, 2024 • 1

liuyixiu

AI & ML interests

Recent Activity

Organizations

liuyx0903's activity

The Smol Training Playbook