Inference Providers
Active filters: gptq
openbmb/MiniCPM-V-4.6-Thinking-GPTQ
Image-Text-to-Text
• 1B • Updated • 423
• 4
LordNeel/DeepSeek-V4-Flash-Acti-MTP-W4A16-FP8
Text Generation
• 44B • Updated • 1.3k
• 4
openbmb/MiniCPM-V-4.6-GPTQ
Image-Text-to-Text
• 1B • Updated • 477
• 3
shieldstar/Qwen3.5-122B-A10B-int4-AutoRound-EC
Image-Text-to-Text
• 21B • Updated • 4.52k
• 5
pastapaul/DeepSeek-V4-Flash-W4A16-FP8
44B • Updated • 6.34k
• 8
AxisQuant/Qwen3.6-27b-gptq-int4
Text Generation
• 27B • Updated • 65
• 2
openbmb/MiniCPM3-4B-GPTQ-Int4
Text Generation
• 4B • Updated • 154
• 12
openbmb/MiniCPM-o-2_6-int4
Any-to-Any
• Updated • 781
• 53
empirischtech/DeepSeek-R1-Distill-Qwen-32B-gptq-4bit
Text Generation
• 33B • Updated • 1.18k
• 13
Qwen/Qwen3-0.6B-GPTQ-Int8
Text Generation
• 0.6B • Updated • 1.49k
• 9
openbmb/MiniCPM4-8B-Eagle-FRSpec-QAT-cpmcu
Text Generation
• Updated • 56
• 11
openbmb/MiniCPM4-8B-marlin-Eagle-vLLM
Text Generation
• Updated • 42
• 7
openbmb/MiniCPM4-8B-marlin-vLLM
Text Generation
• 8B • Updated • 44
• 7
openbmb/MiniCPM4-8B-marlin-cpmcu
Text Generation
• Updated • 38
• 8
openbmb/MiniCPM4-0.5B-QAT-Int4-GPTQ-format
Text Generation
• 0.5B • Updated • 500
• 3
openbmb/MiniCPM4.1-8B-GPTQ
Text Generation
• 8B • Updated • 953
• 1
openbmb/MiniCPM4.1-8B-Marlin
Text Generation
• Updated • 62
• 2
Qwen/Qwen3.5-27B-GPTQ-Int4
Image-Text-to-Text
• 28B • Updated • 445k
• 54
Qwen/Qwen3.5-35B-A3B-GPTQ-Int4
Image-Text-to-Text
• 36B • Updated • 1.32M
• 83
RafaDom/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled-v2-GPTQ-Int4-HQ
4B • Updated • 100
• 1
bbarn4/medgemma-27b-text-it-GPTQ
Text Generation
• 5B • Updated • 2.88k
• 1
ebircak/gemma-4-31B-it-4bit-W4A16-GPTQ
Text Generation
• 32B • Updated • 22.2k
• 3
palmfuture/Qwen3.6-35B-A3B-GPTQ-Int4
Image-Text-to-Text
• 36B • Updated • 153k
• 17
groxaxo/Qwen3.6-27B-GPTQ-Pro-4bit
Image-Text-to-Text
• 27B • Updated • 156k
• 35
raydelossantos/Qwen3.6-27B-GPTQ-Int4
Text Generation
• 28B • Updated • 19.3k
• 3
Sociopacific/Qwen3.6-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled-GPTQ-Int4
Text Generation
• 35B • Updated • 173
• 1
onnx-community/PaddleOCR-VL-1.5-ONNX
Updated • 140
• 2
llmfan46/Qwen3.6-35B-A3B-uncensored-heretic-Native-MTP-Preserved-GPTQ-Int4
Image-Text-to-Text
• 36B • Updated • 2.05k
• 2
elinas/alpaca-13b-lora-int4
Text Generation
• Updated • 11
• 40
elinas/alpaca-30b-lora-int4
Text Generation
• Updated • 12
• 68