Inference Providers
Active filters: RL
nvidia/Nemotron-Cascade-2-30B-A3B
Text Generation
• 32B • Updated • 318k
• 476
NousResearch/DeepHermes-ToolCalling-Specialist-Atropos
Reinforcement Learning
• 8B • Updated • 44
• 16
mlx-community/Nemotron-Cascade-2-30B-A3B-4bit
Text Generation
• 32B • Updated • 4.75k
• 17
mlx-community/Nemotron-Cascade-2-30B-A3B-8bit
Text Generation
• 32B • Updated • 2.22k
• 6
mlx-community/Nemotron-Cascade-2-30B-A3B-mlx-6bit
Text Generation
• 32B • Updated • 758
• 2
stanfordnlp/SteamSHP-flan-t5-xl
Updated • 15
• 43
stanfordnlp/SteamSHP-flan-t5-large
Updated • 101
• 33
SultanR/SmolTulu-1.7b-Reinforced
Text Generation
• 2B • Updated • 11
• 5
mradermacher/SmolTulu-1.7b-Reinforced-GGUF
2B • Updated • 55
Daemontatox/Llama3.3-70B-CogniLink
Text Generation
• 71B • Updated • 63
• • 3
mradermacher/Llama3.3-70B-CogniLink-GGUF
Text Generation
• 71B • Updated • 164
mradermacher/Llama3.3-70B-CogniLink-i1-GGUF
Text Generation
• 71B • Updated • 215
JHuel/Mistral-Nemo-Instruct-2407_DPO_qlora
Reinforcement Learning
• Updated JHuel/Mistral-Nemo-Instruct-2407_ORPO
Text Generation
• Updated Ihor/Text2Graph-R1-Qwen2.5-0.5b
Text Generation
• 0.5B • Updated • 79
• 24
Reinforcement Learning
• Updated • 1
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF
0.5B • Updated • 219
• 1
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
0.5B • Updated • 279
• 1
mradermacher/QuadConnect2.5-0.5B-v0.0.3b-GGUF
0.5B • Updated • 98
Text Generation
• 684B • Updated • 143
• 1
mradermacher/QuadConnect2.5-0.5B-v0.0.8b-GGUF
0.5B • Updated • 43
Lyte/QuadConnect2.5-0.5B-v0.0.9b
Text Generation
• 0.5B • Updated • 19
mradermacher/QuadConnect2.5-0.5B-v0.0.9b-GGUF
0.5B • Updated • 190
Lyte/QuadConnect2.5-1.5B-v0.1.0b
Text Generation
• 2B • Updated • 31
• 1
mradermacher/QuadConnect2.5-1.5B-v0.1.0b-GGUF
2B • Updated • 82
• 1
mradermacher/Zireal-0-GGUF
mradermacher/Magellanic-Qwen-25B-R999-GGUF
25B • Updated • 222
• 1
mradermacher/Magellanic-Qwen-25B-R999-i1-GGUF
25B • Updated • 77
• 1
VaidikML0508/Shark-Tank-Offer-Evaluator-llama3.2-3B-Instruct-SFT-DPO-4bits-V1
Text Generation
• 3B • Updated • 2
Teen-Different/squiral_maze
Reinforcement Learning
• Updated