EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer Paper • 2409.10819 • Published Sep 17, 2024 • 18
Audio Match Cutting: Finding and Creating Matching Audio Transitions in Movies and Videos Paper • 2408.10998 • Published Aug 20, 2024 • 9
Accelerating High-Fidelity Waveform Generation via Adversarial Flow Matching Optimization Paper • 2408.08019 • Published Aug 15, 2024 • 11
PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation Paper • 2408.07547 • Published Aug 14, 2024 • 8
DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion Paper • 2305.15816 • Published May 25, 2023
PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation Paper • 2408.07547 • Published Aug 14, 2024 • 8
Accelerating High-Fidelity Waveform Generation via Adversarial Flow Matching Optimization Paper • 2408.08019 • Published Aug 15, 2024 • 11
HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis Paper • 2311.12454 • Published Nov 21, 2023 • 30
HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis Paper • 2311.12454 • Published Nov 21, 2023 • 30
Runtime error 398 HierSpeech++ (Zero-shot TTS) ⚡ 398 Generate high-quality speech from text using a prompt audio
Runtime error 398 HierSpeech++ (Zero-shot TTS) ⚡ 398 Generate high-quality speech from text using a prompt audio