view article Article Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR 14 days ago • 63
microsoft/Phi-3-mini-4k-instruct-gguf Text Generation • 4B • Updated Dec 10, 2025 • 40.5k • 552
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails Paper • 2502.05163 • Published Feb 7, 2025 • 22
intfloat/e5-mistral-7b-instruct Feature Extraction • 7B • Updated Apr 23, 2024 • 93.9k • • 554
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 Jul 5, 2024 • 310
meta-llama/Llama-3.2-11B-Vision Image-Text-to-Text • 11B • Updated Sep 27, 2024 • 9.09k • 578