4 34 232

Koty KD

kotyKD

AI & ML interests

None yet

Recent Activity

liked a Space 5 days ago

gemma-challenge/gemma-dashboard

liked a Space about 1 month ago

HuggingFaceTB/trl-distillation-trainer

liked a model 2 months ago

unsloth/gemma-4-31B-it-GGUF

View all activity

Organizations

None yet

upvoted 2 collections 3 months ago

Nemotron-Pre-Training-Datasets

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 5 days ago • 162

Sutra Pedagogical Datasets

Collection

High-quality synthetic educational datasets designed for LLM pretraining with structured pedagogical content across 9 knowledge domains. • 7 items • Updated Mar 17 • 5

upvoted 2 articles 4 months ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

burtenshaw, evalstate, merve, pcuenq

•

Jan 28

• 158

Article

Custom Kernels for All from Codex and Claude

burtenshaw, sayakpaul, ariG23498, evalstate

•

Feb 13

• 80

upvoted a paper 5 months ago

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Paper • 2306.13649 • Published Jun 23, 2023 • 37

upvoted a collection 5 months ago

Falcon-H1-Tiny

Collection

A series of extremely small, yet powerful language models redefining capabilities at small scale • 19 items • Updated Mar 2 • 37

upvoted an article 6 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

mlabonne

•

Jul 29, 2024

• 372

upvoted 2 articles 7 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

codelion

•

Nov 3, 2025

• 65

Article

What makes good reasoning data

MiniMax-AI

•

Oct 30, 2025

• 45

upvoted 3 collections 8 months ago

upvoted a collection 9 months ago

Granite 4.0 Language Models

Collection

Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 11 items • Updated Apr 29 • 221

upvoted 2 articles 11 months ago

Article

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

codelion

•

Aug 3, 2025

• 7

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 779

upvoted a paper about 1 year ago

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

Paper • 2401.00788 • Published Jan 1, 2024 • 23

upvoted an article about 1 year ago

Article

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

codelion

•

May 20, 2025

• 69

upvoted a collection about 1 year ago

RADLADS

Collection

7 items • Updated May 7, 2025 • 8

upvoted a paper about 1 year ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29, 2025 • 99

upvoted a collection about 1 year ago

Unsloth Dynamic 2.0 Quants

Collection

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 105 items • Updated 1 day ago • 710

Koty KD

AI & ML interests

Recent Activity

Organizations

kotyKD's activity

We Got Claude to Build CUDA Kernels and teach open models!

Custom Kernels for All from Codex and Claude

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

What makes good reasoning data

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

SmolLM3: smol, multilingual, long-context reasoner

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve