Hugging Face

Team

company

Verified

https://huggingface.co

huggingface

Activity Feed

AI & ML interests

The AI community building the future.

Recent Activity

akhaliq submitted a paper about 2 hours ago

Towards a Science of Scaling Agent Systems

sayakpaul updated a dataset about 4 hours ago

huggingface/diffusers-metadata

lhoestq updated a dataset about 4 hours ago

huggingface/documentation-images

View all activity

Papers

FineVision: Open Data Is All You Need

SmolVLM: Redefining small and efficient multimodal models

View all Papers

Articles

sayakpaul

updated a dataset about 4 hours ago

huggingface/diffusers-metadata

Viewer • Updated about 4 hours ago • 81 • 1.21k • 13

lhoestq

updated a dataset about 4 hours ago

huggingface/documentation-images

Viewer • Updated about 4 hours ago • 55 • 2.02M • 93

pagezyhf

updated a dataset about 5 hours ago

huggingface/documentation-images

Viewer • Updated about 4 hours ago • 55 • 2.02M • 93

sergiopaniego

posted an update about 6 hours ago

Post

TRL now includes agent training support for GRPO‼️

Train 🕵️ agents with 🔧 tools, enabling interaction with external functions and APIs.

And of course, a new notebook and scripts to get you up to speed

📘 notebook tutorial: https://github.com/huggingface/trl/blob/main/examples/notebooks/grpo_agent.ipynb

📂 script examples: https://github.com/huggingface/trl/blob/main/examples/scripts/grpo_agent.py

📦 TRL v0.26.0 release: https://github.com/huggingface/trl/releases/tag/v0.26.0

lysandre

updated a dataset about 6 hours ago

huggingface/transformers-metadata

Viewer • Updated about 6 hours ago • 1.95k • 1.87k • 31

nielsr

updated a dataset about 12 hours ago

huggingface/trending-papers-x

Viewer • Updated about 12 hours ago • 24 • 98 • 1

alvarobartt

updated a dataset about 21 hours ago

huggingface/DEH-image-scan-data

Viewer • Updated about 21 hours ago • 3 • 1.97k • 2

sergiopaniego

posted an update 1 day ago

Post

1402

ICYMI, you can fine-tune open LLMs using Claude Code

just tell it:
“Fine-tune Qwen3-0.6B on open-r1/codeforces-cots”

and Claude submits a real training job on HF GPUs using TRL.

it handles everything:
> dataset validation
> GPU selection
> training + Trackio monitoring
> job submission + cost estimation
when it’s done, your model is on the Hub, ready to use

read more about the process: https://huggingface.co/blog/hf-skills-training

sergiopaniego

posted an update 1 day ago

Post

1355

We just released TRL v0.26.0!

It comes packed with updates:
> Agent training with tools in GRPO
> New CISPO & SAPO losses + reasoning rewards
> vLLM quantization in colocate mode
> Dataset shuffling in SFT
> Lots of NEW examples
> Tons of fixes and documentation improvements

3 replies

sergiopaniego

posted an update 2 days ago

Post

2742

NEW: @EssentialAI just released Rnj-1, their first 8B model.

You can easily fine-tune it with GRPO using TRL to add reasoning capabilities to a compact mode

Free Colab link: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_rnj_1_instruct.ipynb

More free TRL notebooks: https://huggingface.co/docs/trl/main/en/example_overview#notebooks

alozowski

authored a paper 2 days ago

YourBench: Easy Custom Evaluation Sets for Everyone

Paper • 2504.01833 • Published Apr 2 • 22

sergiopaniego

posted an update 6 days ago

Post

2778

Want to get started with fine-tuning but don’t know where to begin? 🤓☝️

We’re expanding our collection of beginner-friendly free Colab notebooks so you can learn and fine-tune models using TRL at no cost

🔬 Check out the full list of free notebooks: https://huggingface.co/docs/trl/main/en/example_overview#notebooks

🔬 If you want more advanced content, we also have a lot to cover in the community tutorials: https://huggingface.co/docs/trl/community_tutorials

And now the obvious question: what would you like us to add next?

sergiopaniego

posted an update 8 days ago

Post

2320

NEW: @mistralai released a fantastic family of multimodal models, Ministral 3.

You can fine-tune them for free on Colab using TRL ⚡️, supporting both SFT and GRPO

Link to the notebooks:
- SFT: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/sft_ministral3_vl.ipynb
- GRPO: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_ministral3_vl.ipynb
- TRL and more examples: https://huggingface.co/docs/trl/index

2 replies

sergiopaniego

posted an update 10 days ago

Post

2146

ICYMI, transformers v5 is out!

Grab a coffee ☕ and go read the announcement blog https://huggingface.co/blog/transformers-v5

sergiopaniego

posted an update 10 days ago

Post

3080

want to use open models easily through an API?

Inference Providers might be exactly what you’re looking for sooo here’s a complete beginner-friendly walkthrough 🧐

https://www.youtube.com/watch?v=oxwsizy1Spw

2 replies

sergiopaniego

posted an update 14 days ago

Post

1728

nanochat is now in transformers!

The LLM by @karpathy is officially in the library, and we wrote a blog covering: how did we port the model, differences from the original, and how to run or train it.

go read it 🤓

nanochat-students/transformers

sergiopaniego

posted an update 16 days ago

Post

3948

you gotta go fast and go read the latest blog by @ror et al. explaining Continuous Batching in depth

https://huggingface.co/blog/continuous_batching

eustlb

authored a paper 17 days ago

Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation

Paper • 2510.06961 • Published Oct 8 • 10

sergiopaniego

posted an update 17 days ago

Post

1712

Interested in RL training environments?

We just released a beginner-friendly walkthrough notebook!

Train a model to play Wordle using TRL + OpenEnv (TextArena) + GRPO + vLLM.

happy learning! 🌱

Notebook: https://github.com/huggingface/trl/blob/main/examples/notebooks/openenv_wordle_grpo.ipynb

OpenEnv guide in TRL: https://huggingface.co/docs/trl/main/en/openenv

Steveeeeeeen

authored a paper 20 days ago

Treble10: A high-quality dataset for far-field speech recognition, dereverberation, and enhancement

Paper • 2510.23141 • Published Oct 27 • 4

AI & ML interests

Recent Activity

Papers

Articles

On the Shifting Global Compute Landscape

Announcing Hugging Face Fundamentals: A New Learning Track on DataCamp

Yay! Organizations can now publish blog Articles

Team members 190

huggingface's activity