Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

239

Base only

Active filters: RL

Jackrong/Qwopus3.5-4B-Coder-MTP-GGUF

Image-Text-to-Text • Updated 21 days ago • 27.6k • 51

Jackrong/Qwopus3.5-4B-Coder-GGUF

Image-Text-to-Text • 4B • Updated 21 days ago • 21.9k • 24

NousResearch/DeepHermes-ToolCalling-Specialist-Atropos

Reinforcement Learning • 8B • Updated Apr 28, 2025 • 9 • 19

nvidia/Nemotron-Cascade-2-30B-A3B

Text Generation • 32B • Updated May 1 • 50.7k • 504

bartowski/nvidia_Nemotron-Cascade-2-30B-A3B-GGUF

Text Generation • 32B • Updated Mar 22 • 9.14k • 36

erreursyntax/DeepHermes-Egregore-v1-RLAIF-8b-Atropos

Reinforcement Learning • 8B • Updated 22 days ago • 20 • 1

mradermacher/DeepHermes-Egregore-v1-RLAIF-8b-Atropos-GGUF

Reinforcement Learning • 8B • Updated 21 days ago • 743 • 1

mradermacher/DeepHermes-Egregore-v1-RLAIF-8b-Atropos-i1-GGUF

Reinforcement Learning • 8B • Updated 21 days ago • 2.29k • 1

stanfordnlp/SteamSHP-flan-t5-xl

Updated Oct 10, 2023 • 7 • 43

stanfordnlp/SteamSHP-flan-t5-large

Updated Oct 10, 2023 • 313 • 33

SultanR/SmolTulu-1.7b-Reinforced

Text Generation • 2B • Updated Dec 17, 2024 • 7 • 5

mradermacher/SmolTulu-1.7b-Reinforced-GGUF

2B • Updated Dec 18, 2024 • 165

Daemontatox/Llama3.3-70B-CogniLink

Text Generation • 71B • Updated Jun 21, 2025 • 24 • • 3

mradermacher/Llama3.3-70B-CogniLink-GGUF

Text Generation • 71B • Updated Jun 22, 2025 • 51

mradermacher/Llama3.3-70B-CogniLink-i1-GGUF

Text Generation • 71B • Updated Jun 22, 2025 • 191

JHuel/Mistral-Nemo-Instruct-2407_DPO_qlora

Reinforcement Learning • Updated Jan 22, 2025

JHuel/Mistral-Nemo-Instruct-2407_ORPO

Text Generation • Updated Jan 22, 2025

Ihor/Text2Graph-R1-Qwen2.5-0.5b

Text Generation • 0.5B • Updated Aug 18, 2025 • 116 • • 24

tecosys/Nutaan-RL1

Reinforcement Learning • Updated Feb 7, 2025 • 2

mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF

0.5B • Updated Aug 18, 2025 • 64 • 1

mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF

0.5B • Updated Aug 18, 2025 • 360 • 1

mradermacher/QuadConnect2.5-0.5B-v0.0.3b-GGUF

0.5B • Updated Feb 22, 2025 • 71

Daemontatox/Zireal-0

Text Generation • 684B • Updated Jul 1, 2025 • 43 • 1

mradermacher/QuadConnect2.5-0.5B-v0.0.8b-GGUF

0.5B • Updated Jul 31, 2025 • 65

Lyte/QuadConnect2.5-0.5B-v0.0.9b

Text Generation • 0.5B • Updated Feb 27, 2025 • 89 •

mradermacher/QuadConnect2.5-0.5B-v0.0.9b-GGUF

0.5B • Updated Jul 31, 2025 • 87

Lyte/QuadConnect2.5-1.5B-v0.1.0b

Text Generation • 2B • Updated Feb 28, 2025 • 59 • • 1

mradermacher/QuadConnect2.5-1.5B-v0.1.0b-GGUF

2B • Updated Mar 1, 2025 • 61 • 1

mradermacher/Zireal-0-GGUF

Updated Jul 31, 2025 • 1

mradermacher/Magellanic-Qwen-25B-R999-GGUF

25B • Updated Mar 5, 2025 • 30 • 1