-
-
-
-
-
-
Inference Providers
Active filters:
redhat
BCCard/Qwen3-32B-FP8-Dynamic
Text Generation
•
33B
•
Updated
•
2
•
1
BCCard/Qwen3-30B-A3B-FP8-Dynamic
Text Generation
•
31B
•
Updated
•
133
Text Generation
•
15B
•
Updated
•
190
•
1
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8
Image-Text-to-Text
•
402B
•
Updated
•
65
•
2
RedHatTraining/AI296-m3diterraneo-hotels
8B
•
Updated
•
6
•
1
RedHatAI/DeepSeek-R1-0528-quantized.w4a16
Text Generation
•
104B
•
Updated
•
152
•
12
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-quantized.w4a16
Image-Text-to-Text
•
59B
•
Updated
•
359
•
1
Image-Text-to-Text
•
109B
•
Updated
•
4
RedHatAI/Kimi-K2-Instruct-quantized.w4a16
Text Generation
•
146B
•
Updated
•
194
•
12
nm-testing/Llama-3.1-8B-Instruct-speculator.eagle3-converted
Text Generation
•
1.0B
•
Updated
•
1
RedHatAI/SmolLM3-3B-quantized.w4a16
0.9B
•
Updated
•
22
•
1
Text-to-Image
•
Updated
•
2
RedHatAI/Devstral-Small-2507-FP8-Dynamic
Text Generation
•
24B
•
Updated
•
9
•
4
RedHatAI/Devstral-Small-2507-quantized.w8a8
Text Generation
•
24B
•
Updated
•
26
•
1
RedHatAI/Devstral-Small-2507-quantized.w4a16
Text Generation
•
4B
•
Updated
•
61
•
1
RedHatAI/Qwen3-14B-speculator.eagle3
Text Generation
•
1B
•
Updated
•
174
RedHatAI/Qwen3-32B-speculator.eagle3
Text Generation
•
2B
•
Updated
•
786
•
4
RedHatAI/Llama-3.3-70B-Instruct-speculator.eagle3
Text Generation
•
2B
•
Updated
•
743
•
1
RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3
Text Generation
•
1.0B
•
Updated
•
9.07k
•
1
RedHatAI/Qwen3-8B-speculator.eagle3
Text Generation
•
1B
•
Updated
•
57.7k
RedHatAI/Qwen3-235B-A22B-Instruct-2507-speculator.eagle3
Text Generation
•
1B
•
Updated
•
638
ChibuUkachi/Qwen3-4B-Instruct-2507.w4a16
Text Generation
•
1B
•
Updated
•
82
inference-optimization/Qwen3-4B-Thinking-2507.w4a16
Text Generation
•
1B
•
Updated
•
9
inference-optimization/Qwen3-4B-Instruct-2507.w4a16
Text Generation
•
1B
•
Updated
•
3
inference-optimization/Qwen3-30B-A3B-Thinking-2507.w4a16
Text Generation
•
5B
•
Updated
•
3
RedHatAI/Qwen3-30B-A3B-Instruct-2507.w4a16
Text Generation
•
5B
•
Updated
•
781
RedHatAI/Qwen3-Next-80B-A3B-Instruct-quantized.w4a16
Text Generation
•
Updated
•
343
•
1
RedHatAI/Qwen3-30B-A3B-Instruct-2507-speculator.eagle3
Text Generation
•
0.5B
•
Updated
•
94.3k
•
1
RedHatAI/Qwen3-Next-80B-A3B-Thinking-quantized.w4a16
Text Generation
•
Updated
•
107
inference-optimization/Ministral-3-14B-Instruct-2512-FP8-dynamic
Text Generation
•
14B
•
Updated
•
179