view article Article Introducing Daggr: Chain apps programmatically, inspect visually +3 3 days ago • 66
Waypoint-1 Collection The first real time diffusion world model designed for consumer hardware • 3 items • Updated 2 days ago • 7
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 9 items • Updated 10 days ago • 206
view changelog Changelog Introducing HF Jobs: Run scalable compute jobs on Hugging Face Jul 30, 2025 • 201
view article Article Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio Jul 31, 2025 • 60
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 138
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality +2 Mar 4, 2025 • 78
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4, 2025 • 254
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model Paper • 2402.07827 • Published Feb 12, 2024 • 48
Naijaweb datasets 🇳🇬 Collection A recreation of the fineweb collection for Nigerians • 3 items • Updated Oct 24, 2024 • 6
OpenCulture Collection A multilingual dataset of public domain books and newspapers. • 27 items • Updated Nov 6, 2024 • 132
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling Paper • 2311.00430 • Published Nov 1, 2023 • 56