Joao Gante

joaogante

https://github.com/gante

AI & ML interests

None yet

Recent Activity

upvoted an article 12 days ago

Mixture of Experts (MoEs) in Transformers

liked a model 4 months ago

deepseek-ai/DeepSeek-V3.2-Speciale

liked a Space 5 months ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

liked a model 4 months ago

deepseek-ai/DeepSeek-V3.2-Speciale

Text Generation • Updated Dec 1, 2025 • 18.3k • 691

liked a Space 5 months ago

The Smol Training Playbook

📚

3.09k

The secrets to building world-class LLMs

liked a Space 6 months ago

Maintain the unmaintainable

📚

Explore the complex relationships between 400+ machine learning models

liked 2 models 8 months ago

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 5.74M • • 4.51k

transformers-community/sep_cache

8B • Updated Aug 4, 2025 • 10 • 9

liked a model 9 months ago

mistralai/Voxtral-Mini-3B-2507

5B • Updated Jul 28, 2025 • 550k • 637

liked a model 11 months ago

Qwen/Qwen3-0.6B

Text Generation • 0.8B • Updated Jul 26, 2025 • 14.4M • 1.18k

liked a model about 1 year ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

Text Generation • 8B • Updated Feb 24, 2025 • 619k • • 802

liked a model over 1 year ago

Qwen/Qwen2.5-0.5B-Instruct

Text Generation • 0.5B • Updated Sep 25, 2024 • 6.1M • 494

liked a Space over 1 year ago

SynthID Text

🏃

Watermarking LLM-generated text with SynthID Text

liked a model over 1 year ago

meta-llama/Llama-3.2-1B

Text Generation • 1B • Updated Oct 24, 2024 • 1.5M • 2.35k

liked a Space over 1 year ago

Repository statistics

📊

liked 2 models over 1 year ago

meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 8.64M • • 5.67k

mattshumer/Reflection-Llama-3.1-70B

Text Generation • 71B • Updated Sep 24, 2024 • 282 • 1.71k

liked a Space over 1 year ago

FLUX.1 [dev]

🖥

9.42k

Generate images from text prompts with FLUX.1 diffusion model

liked a model over 1 year ago

google/gemma-2-2b-it

Text Generation • 3B • Updated Aug 27, 2024 • 380k • • 1.32k

liked a Space over 1 year ago

Hf Co Docs Chat

🚀

liked 3 Spaces almost 2 years ago

Open-LLM performances are plateauing, let’s make the leaderboard steep again

🏔

127

Explore and compare advanced language models on a new leaderboard

Omni-Zero

🧛

462

Restylize & repose person ID

FineWeb: decanting the web for the finest text data at scale

🍷

1.32k

Read a detailed overview of the FineWeb web‑scale text dataset

Joao Gante

AI & ML interests

Recent Activity

Organizations

joaogante's activity

The Smol Training Playbook

Maintain the unmaintainable

SynthID Text

Repository statistics

FLUX.1 [dev]

Hf Co Docs Chat

Open-LLM performances are plateauing, let’s make the leaderboard steep again

Omni-Zero

FineWeb: decanting the web for the finest text data at scale