Xuesong Yang's picture

3 2

Xuesong Yang

magicyoung8

·

http://goo.gl/86Fx5i

AI & ML interests

Speech AI

Recent Activity

liked a Space about 1 month ago

rc19477/Speech_Enhancement_Mamba

liked a dataset about 1 month ago

nvidia/hifitts-2

authored a paper 7 months ago

AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

View all activity

Organizations

liked a Space about 1 month ago

Speech Enhancement Mamba

Speech Enhancemet using Mamba (SEMamba)

liked a dataset about 1 month ago

nvidia/hifitts-2

Viewer • Updated Nov 18, 2025 • 16.6M • 527 • 24

authored 4 papers 7 months ago

AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Paper • 1905.05879 • Published May 14, 2019 • 1

NeKo: Toward Post Recognition Generative Correction Large Language Models with Task-Oriented Experts

Paper • 2411.05945 • Published Nov 8, 2024 • 4

Koel-TTS: Enhancing LLM based Speech Generation with Preference Alignment and Classifier Free Guidance

Paper • 2502.05236 • Published Feb 7, 2025

HiFiTTS-2: A Large-Scale High Bandwidth Speech Dataset

Paper • 2506.04152 • Published Jun 4, 2025

New activity in nvidia/hifitts-2 7 months ago

update config of README to address schema mismatch issue

#3 opened 7 months ago by

added bandwidth estimation details in the README.md

#2 opened 7 months ago by

updated a model 11 months ago

nvidia/diar_sortformer_4spk-v1

Automatic Speech Recognition • 0.1B • Updated 19 days ago • 5.47k • 120