Building on HF

Anurag

edwixx

https://anuragkanade.com/

AI & ML interests

ASR, TTS, Speech and Audio

Recent Activity

liked a Space 1 day ago

nanotron/ultrascale-playbook

liked a Space 1 day ago

HuggingFaceTB/smol-training-playbook

new activity 4 days ago

edwixx/aesthetic-image-captions-10k:[bot] Conversion to Parquet

View all activity

Organizations

upvoted an article about 1 month ago

Article

You could have designed state of the art positional encoding

FL33TW00D-HF

•

Nov 25, 2024

• 480

upvoted a changelog 2 months ago

Hugging Face Changelog

Introducing Buckets: S3-like storage on the Hub

Mar 10

• 187

upvoted 4 collections 4 months ago

upvoted an article 4 months ago

Article

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

nvidia

•

Jan 5

• 86

upvoted an article 6 months ago

Article

Continuous batching from first principles

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 389

upvoted a paper 6 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 134

upvoted 2 articles 7 months ago

Article

Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness

Steveeeeeeen

•

Nov 5, 2025

• 12

Article

G2P Shrinks Speech Models

hexgrad

•

Feb 5, 2025

• 94

upvoted a changelog 7 months ago

Hugging Face Changelog

Set Default Sorting in the Community Tab

Oct 28, 2025

• 70

upvoted 2 articles 7 months ago

Article

Large-scale Near-deduplication Behind BigCode

chenghao

•

May 16, 2023

• 37

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

spisakjo, darktex, zkwentz, mortimerp9, Sanyam, Hamid-Nazeri, Pankit01, emre0, lewtun, reach-vb

•

Oct 23, 2025

• 162

upvoted a collection 7 months ago

TTS

Collection

Collection of some of the TTS models i found cool • 6 items • Updated Oct 10, 2025 • 1

upvoted a paper 8 months ago

EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs

Paper • 2509.09174 • Published Sep 11, 2025 • 62

upvoted an article 9 months ago

Article

Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨

Wauplin, celinah, julien-c

•

Jul 25, 2025

• 84

upvoted an article 12 months ago

Article

KV Cache from scratch in nanoVLM

ariG23498, kashif, lusxvr, andito, pcuenq

•

Jun 4, 2025

• 119

upvoted a collection 12 months ago

MedGemma Release

Collection

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 9 items • Updated Mar 12 • 492

upvoted a collection about 1 year ago

Qwen3

Collection

84 items • Updated Dec 31, 2025 • 1.78k

Anurag

AI & ML interests

Recent Activity

Organizations

edwixx's activity

You could have designed state of the art positional encoding

Introducing Buckets: S3-like storage on the Hub

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

Continuous batching from first principles

Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness

G2P Shrinks Speech Models

Set Default Sorting in the Community Tab

Large-scale Near-deduplication Behind BigCode

Building the Open Agent Ecosystem Together: Introducing OpenEnv

Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨

KV Cache from scratch in nanoVLM