-
-
-
-
-
-
Inference Providers
Active filters: 8-bit
nightmedia/Qwen3.5-122B-A10B-Text-qx85-mlx
Text Generation
• 122B • Updated
• 108
• 2
mlx-community/Qwen3.5-27B-heretic-8bit
Image-Text-to-Text
• 8B • Updated
• 2
MaziyarPanahi/TinyLlama-1.1B-Chat-v1.0-GGUF
Text Generation
• 1B • Updated
• 341
• 2
LoneStriker/Meta-Llama-3-8B-Instruct-8.0bpw-h8-exl2
Text Generation
• Updated
• 5
• 15
MaziyarPanahi/Hermes-2-Pro-Llama-3-13B-GGUF
Text Generation
• 12B • Updated
• 100
• 1
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8
Text Generation
• 8B • Updated
• 17.9k
• 20
MaziyarPanahi/solar-pro-preview-instruct-GGUF
Text Generation
• 22B • Updated
• 107k
• 28
Qwen/Qwen2.5-Coder-14B-Instruct-GPTQ-Int8
Text Generation
• 15B • Updated
• 8.28k
• 6
tiiuae/Falcon3-10B-Instruct-1.58bit
Text Generation
• 3B • Updated
• 848
• 22
Text Generation
• 15B • Updated
• 108k
• 6
roleplaiapp/oh-dcft-v3.1-claude-3-5-haiku-20241022-Q8_0-GGUF
Text Generation
• 8B • Updated
• 33
• 1
MaziyarPanahi/Mistral-Small-24B-Instruct-2501-GGUF
Text Generation
• 24B • Updated
• 109k
• 9
meituan/DeepSeek-R1-Channel-INT8
Text Generation
• 685B • Updated
• 614
• 32
MaziyarPanahi/gemma-3-12b-it-GGUF
Text Generation
• 12B • Updated
• 111k
• 16
MaziyarPanahi/Qwen3-0.6B-GGUF
Text Generation
• 0.8B • Updated
• 180k
• 10
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated
• 1.31k
• 4
nvidia/DeepSeek-R1-0528-NVFP4
Text Generation
• 397B • Updated
• 16.2k
• 42
darkshapes/comma-v0.1-2t-MLX-Q8
Text Generation
• 7B • Updated
• 7
• 1
NVFP4/Polaris-4B-Preview-FP4
2B • Updated
• 13
• 1
Text Generation
• 33B • Updated
• 335
• 2
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
• 16B • Updated
• 57.9k
• 24
nvidia/DeepSeek-R1-0528-NVFP4-v2
Text Generation
• 394B • Updated
• 122k
• 15
NVFP4/Qwen3-30B-A3B-Instruct-2507-FP4
Text Generation
• 16B • Updated
• 1.31k
• 12
QuantTrio/Qwen3-Coder-30B-A3B-Instruct-GPTQ-Int8
Text Generation
• 31B • Updated
• 4.24k
• 8
yasserrmd/gpt-oss-coder-20b
Text Generation
• 12B • Updated
• 194
• 12
Bellesteck/Qwen3-30B-A3B-NVFP4-vLLM
17B • Updated
• 5
• 2
nightmedia/LIMI-Air-qx86-hi-mlx
Text Generation
• 107B • Updated
• 18
• 3
AngelSlim/Qwen3-32B_nvfp4
19B • Updated
• 3
• 2
Text Generation
• 199B • Updated
• 364
• 5
nightmedia/UIGEN-FX-Agentic-32B-qx86-hi-mlx
Text Generation
• 33B • Updated
• 15
• 1