dataset-lang HuggingFaceFW/fineweb Viewer • Updated Jul 11, 2025 • 52.5B • 187k • 2.6k google/smol Viewer • Updated Oct 31, 2025 • 798k • 3.19k • 80
Language Models deepseek-ai/DeepSeek-R1 Text Generation • 685B • Updated Mar 27, 2025 • 439k • • 12.9k facebook/opt-125m Text Generation • Updated Sep 15, 2023 • 3.52M • 229 meta-llama/Llama-3.3-70B-Instruct Text Generation • 71B • Updated Dec 21, 2024 • 307k • • 2.62k meta-llama/Llama-3.1-8B-Instruct Text Generation • 8B • Updated Sep 25, 2024 • 12.2M • • 5.23k
dataset-math-reasoning bethgelab/CuratedThoughts Viewer • Updated Feb 26, 2025 • 222k • 279 • 44 open-r1/OpenR1-Math-220k Viewer • Updated Feb 18, 2025 • 450k • 12.4k • 691 facebook/natural_reasoning Viewer • Updated Feb 21, 2025 • 1.15M • 1.37k • 546 open-thoughts/OpenThoughts-114k Viewer • Updated Aug 31, 2025 • 228k • 101k • 784
old-language-models openai-community/gpt2 Text Generation • 0.1B • Updated Feb 19, 2024 • 6.79M • 3.08k
Code Language Models Models generating code or performing code completion refactai/Refact-1_6B-fim Text Generation • 2B • Updated Nov 9, 2023 • 163k • 141 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 142k • 213 Kwaipilot/KwaiCoder-DS-V2-Lite-Base Text Generation • 16B • Updated Jan 6, 2025 • 47 • 6
dataset-lang HuggingFaceFW/fineweb Viewer • Updated Jul 11, 2025 • 52.5B • 187k • 2.6k google/smol Viewer • Updated Oct 31, 2025 • 798k • 3.19k • 80
dataset-math-reasoning bethgelab/CuratedThoughts Viewer • Updated Feb 26, 2025 • 222k • 279 • 44 open-r1/OpenR1-Math-220k Viewer • Updated Feb 18, 2025 • 450k • 12.4k • 691 facebook/natural_reasoning Viewer • Updated Feb 21, 2025 • 1.15M • 1.37k • 546 open-thoughts/OpenThoughts-114k Viewer • Updated Aug 31, 2025 • 228k • 101k • 784
old-language-models openai-community/gpt2 Text Generation • 0.1B • Updated Feb 19, 2024 • 6.79M • 3.08k
Language Models deepseek-ai/DeepSeek-R1 Text Generation • 685B • Updated Mar 27, 2025 • 439k • • 12.9k facebook/opt-125m Text Generation • Updated Sep 15, 2023 • 3.52M • 229 meta-llama/Llama-3.3-70B-Instruct Text Generation • 71B • Updated Dec 21, 2024 • 307k • • 2.62k meta-llama/Llama-3.1-8B-Instruct Text Generation • 8B • Updated Sep 25, 2024 • 12.2M • • 5.23k
Code Language Models Models generating code or performing code completion refactai/Refact-1_6B-fim Text Generation • 2B • Updated Nov 9, 2023 • 163k • 141 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 142k • 213 Kwaipilot/KwaiCoder-DS-V2-Lite-Base Text Generation • 16B • Updated Jan 6, 2025 • 47 • 6