CASA - a kyutai Collection

kyutai 's Collections

MoshiRAG Release

Moshi v0.1 Release

CASA

updated Mar 9

CASA: Cross-Attention over Self-Attention for Efficient Vision-Language Fusion on long-context streaming inputs

Running

Agents

3

CASA Gallery

🏠

3

Video Gallery for CASA: Cross-Attention over Self-Attention
CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion

Paper • 2512.19535 • Published Dec 22, 2025 • 12
kyutai/CASA-Helium1-VL-2B

Image-Text-to-Text • 3B • Updated Mar 9 • 28 • 8
kyutai/CASA-Qwen2_5-VL-3B

Image-Text-to-Text • 4B • Updated Dec 23, 2025 • 127 • 2
kyutai/CASA-Qwen2_5-VL-3B-LiveCC

Video-Text-to-Text • 4B • Updated Dec 23, 2025 • 81 • 4
kyutai/Helium1-VL-2B

Image-Text-to-Text • 3B • Updated Dec 23, 2025 • 22 • 1