AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss Paper • 1905.05879 • Published May 14, 2019 • 1
NeKo: Toward Post Recognition Generative Correction Large Language Models with Task-Oriented Experts Paper • 2411.05945 • Published Nov 8, 2024 • 4
Koel-TTS: Enhancing LLM based Speech Generation with Preference Alignment and Classifier Free Guidance Paper • 2502.05236 • Published Feb 7, 2025
HiFiTTS-2: A Large-Scale High Bandwidth Speech Dataset Paper • 2506.04152 • Published Jun 4, 2025
nvidia/diar_sortformer_4spk-v1 Automatic Speech Recognition • 0.1B • Updated 19 days ago • 5.47k • 120