Li Dong

unilm

AI & ML interests

Language Model Pre-Training

Recent Activity

authored a paper about 24 hours ago

VIBEVOICE-ASR Technical Report

authored a paper about 24 hours ago

On-Policy Context Distillation for Language Models

authored a paper about 24 hours ago

Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models

View all activity

Organizations

authored 5 papers about 24 hours ago

VIBEVOICE-ASR Technical Report

Paper • 2601.18184 • Published Jan 26 • 23

On-Policy Context Distillation for Language Models

Paper • 2602.12275 • Published Feb 12 • 2

Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models

Paper • 2603.07777 • Published 11 days ago • 5

Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity

Paper • 2603.05168 • Published 14 days ago • 4

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 1 day ago • 43

upvoted 2 papers 1 day ago

On-Policy Context Distillation for Language Models

Paper • 2602.12275 • Published Feb 12 • 2

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 1 day ago • 43

commented a paper 1 day ago

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 1 day ago • 43 •

submitted a paper to Daily Papers 1 day ago

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 1 day ago • 43

upvoted an article 19 days ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

liked 2 models about 1 month ago

microsoft/VibeVoice-AcousticTokenizer

Feature Extraction • Updated Feb 6 • 514 • 12

kugelaudio/kugelaudio-0-open

Text-to-Speech • Updated Feb 6 • 34k • 178

updated a Space about 2 months ago

VibeVoice ASR

🌍

Official Playground of Microsoft VibeVoice-ASR

liked a Space about 2 months ago

VibeVoice ASR

🌍

Official Playground of Microsoft VibeVoice-ASR

published a Space about 2 months ago

VibeVoice ASR

🌍

Official Playground of Microsoft VibeVoice-ASR

upvoted a paper about 2 months ago

VIBEVOICE-ASR Technical Report

Paper • 2601.18184 • Published Jan 26 • 23

authored a paper about 2 months ago

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published Jan 22 • 86

liked a model about 2 months ago

microsoft/VibeVoice-ASR

Automatic Speech Recognition • 9B • Updated Jan 27 • 598k • 918

upvoted a paper about 2 months ago

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published Jan 22 • 86

New activity in microsoft/VibeVoice-ASR about 2 months ago

Can this model be run on a Turing GPU (No Flash Attention support)?

#1 opened about 2 months ago by

rsbdev

Li Dong

AI & ML interests

Recent Activity

Organizations

unilm's activity

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

VibeVoice ASR

VibeVoice ASR

VibeVoice ASR

Can this model be run on a Turing GPU (No Flash Attention support)?