RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-dynamic Text Generation • 561B • Updated about 21 hours ago • 9.57k • 1
RedHatAI/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 Text Generation • 67B • Updated about 22 hours ago • 2.03k • 2
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w4a16 Text Generation • 8B • Updated 8 days ago • 40.5k • 30
RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation • 8B • Updated 8 days ago • 35.2k • 9
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8 Text Generation • 8B • Updated 8 days ago • 18.3k • 20
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w4a16 Text Generation • 24B • Updated 8 days ago • 308 • 1
RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16 Text Generation • 71B • Updated 8 days ago • 2.21k • 3