Inference Providers
Active filters: chat
microsoft/bitnet-b1.58-2B-4T
Text Generation
• Updated • 14.6k
• 1.39k
Text Generation
• 8B • Updated • 21.3M
• • 1.14k
Qwen/Qwen2.5-1.5B-Instruct
Text Generation
• 2B • Updated • 8.89M
• • 643
Qwen/Qwen2.5-Coder-7B-Instruct-GGUF
Text Generation
• 8B • Updated • 99.2k
• 212
NousResearch/Hermes-3-Llama-3.1-8B
Text Generation
• 8B • Updated • 382k
• • 398
Qwen/Qwen2.5-Coder-14B-Instruct-GGUF
Text Generation
• 15B • Updated • 56.5k
• 103
DataPilot/ArrowCanaria-Llama-8B-SFT-v0.1
Text Generation
• 8B • Updated • 6
• 5
Qwen/Qwen2.5-72B-Instruct
Text Generation
• 73B • Updated • 772k
• • 920
Qwen/Qwen2.5-Coder-3B-Instruct-GGUF
Text Generation
• 3B • Updated • 37.5k
• 65
unsloth/Phi-4-mini-instruct-GGUF
Text Generation
• 4B • Updated • 22.8k
• 84
microsoft/bitnet-b1.58-2B-4T-gguf
Text Generation
• 2B • Updated • 28.8k
• 248
ValiantLabs/Qwen3.5-27B-Guardpoint
Image-Text-to-Text
• 28B • Updated • 191
• 9
NousResearch/Hermes-3-Llama-3.1-405B
Text Generation
• Updated • 152
• 263
Qwen/Qwen2.5-0.5B-Instruct
Text Generation
• 0.5B • Updated • 7.08M
• 485
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation
• 8B • Updated • 2.48M
• • 669
NousResearch/Hermes-4-14B
Text Generation
• 425k • Updated • 3.82k
• 124
DavidAU/OpenAi-GPT-oss-20b-HERETIC-uncensored-NEO-Imatrix-gguf
Text Generation
• 21B • Updated • 35k
• 118
NousResearch/Hermes-3-Llama-3.1-8B-GGUF
8B • Updated • 8.26k
• 138
Qwen/Qwen2.5-14B-Instruct
Text Generation
• Updated • 2.01M
• • 323
bartowski/Qwen2.5-7B-Instruct-GGUF
Text Generation
• 8B • Updated • 75.6k
• 48
Qwen/Qwen2.5-7B-Instruct-AWQ
Text Generation
• Updated • 667k
• 39
Qwen/Qwen2.5-0.5B-Instruct-GGUF
Text Generation
• 0.6B • Updated • 66.9k
• 81
Qwen/Qwen2.5-3B-Instruct-GGUF
Text Generation
• 3B • Updated • 378k
• 90
Qwen/Qwen2.5-7B-Instruct-GGUF
Text Generation
• 8B • Updated • 53.2k
• 132
Text Generation
• 3B • Updated • 7.7M
• 417
bartowski/Qwen2.5-14B-Instruct-GGUF
Text Generation
• 15B • Updated • 29.9k
• 53
bartowski/Qwen2.5-0.5B-Instruct-GGUF
Text Generation
• 0.5B • Updated • 7.76k
• 13
QuantFactory/Qwen2.5-7B-Instruct-abliterated-v2-GGUF
Text Generation
• 8B • Updated • 4.83k
• 7
bartowski/Qwen2.5.1-Coder-7B-Instruct-GGUF
Text Generation
• 8B • Updated • 5.91k
• 103
mlx-community/Qwen2.5-Coder-32B-Instruct-4bit
Text Generation
• Updated • 1.39k
• 12