Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2602.22661

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28, 2025 • 45
DINGO: Constrained Inference for Diffusion LLMs

Paper • 2505.23061 • Published May 29, 2025 • 31
Discrete Diffusion in Large Language and Multimodal Models: A Survey

Paper • 2506.13759 • Published Jun 16, 2025 • 43
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Paper • 2506.14429 • Published Jun 17, 2025 • 44

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

Paper • 2603.15594 • Published Mar 16 • 149
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence

Paper • 2603.13398 • Published Mar 11 • 154
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published Mar 6 • 119

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153

THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

Paper • 2601.23143 • Published Jan 30 • 39
PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published Jan 30 • 227
Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 204
BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 201

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 139
Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 183
MOSS-TTS Technical Report

Paper • 2603.18090 • Published Mar 18 • 12
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Paper • 2603.23516 • Published Mar 6 • 49

Interesting Papers

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153
SkillNet: Create, Evaluate, and Connect AI Skills

Paper • 2603.04448 • Published Feb 26 • 93
Interactive Benchmarks

Paper • 2603.04737 • Published Mar 5 • 19

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153
Fara-7B: An Efficient Agentic Model for Computer Use

Paper • 2511.19663 • Published Nov 24, 2025 • 17
LMCache: An Efficient KV Cache Layer for Enterprise-Scale LLM Inference

Paper • 2510.09665 • Published Oct 8, 2025 • 5
PersonaLive! Expressive Portrait Image Animation for Live Streaming

Paper • 2512.11253 • Published Dec 12, 2025 • 40

IndustryShapes: An RGB-D Benchmark dataset for 6D object pose estimation of industrial assembly components and tools

Paper • 2602.05555 • Published Feb 5
MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations

Paper • 2410.13790 • Published Oct 17, 2024
dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153
VGG-T^3: Offline Feed-Forward 3D Reconstruction at Scale

Paper • 2602.23361 • Published Feb 26 • 15

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published Dec 15, 2025 • 106
MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published Dec 16, 2025 • 121
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Paper • 2512.23447 • Published Dec 29, 2025 • 99
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published Dec 29, 2025 • 66

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28, 2025 • 45
DINGO: Constrained Inference for Diffusion LLMs

Paper • 2505.23061 • Published May 29, 2025 • 31
Discrete Diffusion in Large Language and Multimodal Models: A Survey

Paper • 2506.13759 • Published Jun 16, 2025 • 43
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Paper • 2506.14429 • Published Jun 17, 2025 • 44

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 139
Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 183
MOSS-TTS Technical Report

Paper • 2603.18090 • Published Mar 18 • 12
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Paper • 2603.23516 • Published Mar 6 • 49

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data

Paper • 2603.15594 • Published Mar 16 • 149
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence

Paper • 2603.13398 • Published Mar 11 • 154
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published Mar 6 • 119

Interesting Papers

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153
SkillNet: Create, Evaluate, and Connect AI Skills

Paper • 2603.04448 • Published Feb 26 • 93
Interactive Benchmarks

Paper • 2603.04737 • Published Mar 5 • 19

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153
Fara-7B: An Efficient Agentic Model for Computer Use

Paper • 2511.19663 • Published Nov 24, 2025 • 17
LMCache: An Efficient KV Cache Layer for Enterprise-Scale LLM Inference

Paper • 2510.09665 • Published Oct 8, 2025 • 5
PersonaLive! Expressive Portrait Image Animation for Live Streaming

Paper • 2512.11253 • Published Dec 12, 2025 • 40

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153

IndustryShapes: An RGB-D Benchmark dataset for 6D object pose estimation of industrial assembly components and tools

Paper • 2602.05555 • Published Feb 5
MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations

Paper • 2410.13790 • Published Oct 17, 2024
dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153
VGG-T^3: Offline Feed-Forward 3D Reconstruction at Scale

Paper • 2602.23361 • Published Feb 26 • 15

THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

Paper • 2601.23143 • Published Jan 30 • 39
PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published Jan 30 • 227
Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 204
BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 201

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published Dec 15, 2025 • 106
MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published Dec 16, 2025 • 121
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Paper • 2512.23447 • Published Dec 29, 2025 • 99
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published Dec 29, 2025 • 66

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs