TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 8 days ago • 104
bartowski/google_gemma-4-26B-A4B-it-GGUF Image-Text-to-Text • 25B • Updated 3 days ago • 171k • 85
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation • 124B • Updated 3 days ago • 1.07M • 229