Inference Providers
Active filters: vLLM
mistralai/Mistral-Medium-3.5-128B
128B • Updated • 50.4k
• 322
QuantTrio/Qwen3.6-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 713k
• 22
mistralai/Mistral-Medium-3.5-128B-EAGLE
Updated • 512
• 40
mistralai/Mistral-Small-4-119B-2603
119B • Updated • 67.3k
• 376
mistralai/Mistral-Small-4-119B-2603-NVFP4
Updated • 1.08k
• 88
Text Generation
• 229B • Updated • 2.95k
• 9
Image-Text-to-Text
• 10B • Updated • 234k
• 14
unsloth/Mistral-Small-4-119B-2603-GGUF
119B • Updated • 27.1k
• 67
QuantTrio/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-AWQ
Image-Text-to-Text
• 28B • Updated • 143k
• 14
QuantTrio/gemma-4-31B-it-AWQ-6Bit
Image-Text-to-Text
• 31B • Updated • 9.09k
• 9
QuantTrio/gemma-4-31B-it-AWQ
Image-Text-to-Text
• 31B • Updated • 623k
• 11
QuantTrio/MiniMax-M2.7-AWQ
Text Generation
• 229B • Updated • 32.4k
• 8
QuantTrio/Qwen3.6-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 711k
• 10
bartowski/mistralai_Mistral-Medium-3.5-128B-GGUF
Image-Text-to-Text
• 125B • Updated • 15.4k
• 7
RecViking/Mistral-Medium-3.5-128B-NVFP4
74B • Updated • 9.19k
• 4
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
• 9B • Updated • 33
• 6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
• 9B • Updated • 9
• 2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 80
• 2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
• 69B • Updated • 71
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
• 15B • Updated • 7
• 2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B • Updated • 185
• 1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B • Updated • 387
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
• 0.6B • Updated • 357
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
• 0.6B • Updated • 21
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
• 2B • Updated • 2.69k
• 1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated • 23
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 3.89k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 507
• 4
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
• 5B • Updated • 25
• 1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
• 15B • Updated • 111
• 1