Inference Providers
Active filters: modelopt
Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4
Text Generation
• 17B • Updated • 46
• 1
mdavidson83/Qwen3-Embedding-4B_nvfp4_hf
Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4-256K
Text Generation
• 17B • Updated • 305
• 1
Image-Text-to-Text
• 13B • Updated • 23
lukealonso/MiniMax-M2-NVFP4
115B • Updated • 48
• 14
Text Generation
• 7B • Updated • 2
• 1
leatan95/Tongyi-DeepResearch-30B-A3B-NVFP4
16B • Updated • 1
DataSnake/Wayfarer-12B-NVFP4
Text Generation
• 7B • Updated • 2
• 1
DataSnake/Wayfarer-2-12B-NVFP4
Text Generation
• 7B • Updated • 5
• 2
Ex0bit/OLMo-3-7B-Instruct-NVFP4-1M
Text Generation
• 4B • Updated • 4
• 2
wangqia0309/Captain-Eris_Violet-V0.420-12B-FP8-KV-modelopt
12B • Updated • 3
rahtml/Qwen3-Coder-30B-A3B-Instruct-NVFP4
16B • Updated • 1
nvidia/Kimi-K2-Thinking-NVFP4
Text Generation
• Updated • 11.9k
• 30
zhuyksir/qwen3_30b_a3b_nvfp4_baseline
16B • Updated zhuyksir/qwen3_30b_a3b_nvfp4_qat
16B • Updated alphatozeta/sglang_glm_4_6_fp4_modelopt
177B • Updated • 1
ericlewis/Nemotron-Orchestrator-8B-NVFP4
Text Generation
• 5B • Updated • 7
nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
• Updated • 12.9k
• 39
trithemius/Velvet-14B-nvfp4
8B • Updated • 11
OPENZEKA/Qwen3-4B-Instruct-2507-NVFP4
2B • Updated • 238
Z841973620/Qwen3-30B-A3B-NVFP4
Text Generation
• 16B • Updated Z841973620/Qwen3-30B-A3B-FP8
Text Generation
• 31B • Updated • 2
OPENZEKA/Qwen3-Coder-30B-A3B-Instruct-NVFP4
Text Generation
• 16B • Updated • 565
josephdowling10/Mixtral-8x7B-Instruct-v0.1-NVFP4
Text Generation
• 23B • Updated • 50
taharmasmaliyev07/Llama-2-7b-hf-fp8
7B • Updated • 1
OPENZEKA/Qwen3-Coder-480B-A35B-Instruct-NVFP4
241B • Updated • 6
Shifusen/Llama-3.3-70B-Instruct-abliterated-NVFP4-modelopt
36B • Updated • 64
taharmasmaliyev07/Mistral-7B-v0.1-fp8
7B • Updated taharmasmaliyev07/Llama-3.1-8B-fp8
8B • Updated taharmasmaliyev07/gemma-2-9b-it-fp8
9B • Updated • 1