Inference Providers
Active filters: vLLM
mistralai/Mistral-Small-4-119B-2603
119B • Updated • 29.2k
• 389
mistralai/Mistral-Medium-3.5-128B
128B • Updated • 315k
• 351
mistralai/Mistral-Medium-3.5-128B-EAGLE
Updated • 208
• 45
unsloth/Mistral-Small-4-119B-2603-GGUF
119B • Updated • 6.73k
• 71
bartowski/mistralai_Mistral-Small-4-119B-2603-GGUF
Image-Text-to-Text
• 119B • Updated • 750
• 12
Text Generation
• 754B • Updated • 1.42k
• 9
QuantTrio/Qwen3.6-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 386k
• 14
QuantTrio/Qwen3.6-27B-AWQ-6Bit
Image-Text-to-Text
• 28B • Updated • 11.8k
• 11
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
• 9B • Updated • 52
• 6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
• 9B • Updated • 2
• 2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 57
• 2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
• 69B • Updated • 54
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
• 15B • Updated • 1
• 2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B • Updated • 104
• 1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B • Updated • 285
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
• 0.6B • Updated • 179
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
• 0.6B • Updated • 4
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
• 2B • Updated • 109
• 1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated • 5
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 17.1k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 86
• 4
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
• 5B • Updated • 15
• 1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
• 15B • Updated • 57
• 1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
• 15B • Updated • 2.67k
• 4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
• 8B • Updated • 1.61k
JunHowie/Qwen3-8B-GPTQ-Int4
Text Generation
• 8B • Updated • 131
• 4
JunHowie/Qwen3-4B-GPTQ-Int4
Text Generation
• 4B • Updated • 1.66k
• 1
JunHowie/Qwen3-4B-GPTQ-Int8
Text Generation
• 4B • Updated • 13
JunHowie/Qwen3-30B-A3B-GPTQ-Int8
Text Generation
• 8B • Updated • 31
QuantTrio/Qwen3-235B-A22B-GPTQ-Int8
Text Generation
• 235B • Updated • 12