7 5 21

Helin Wang

westbrook

WangHelin1997

AI & ML interests

Speech & Audio Processing

Recent Activity

liked a model about 2 months ago

maya-research/maya1

liked a Space 2 months ago

OpenSound/FlexSED

liked a dataset 3 months ago

MBZUAI/ArVoice

View all activity

Organizations

liked a model about 2 months ago

maya-research/maya1

Text-to-Speech • 3B • Updated Nov 12 • 80.6k • • 829

liked a Space 2 months ago

FlexSED

🎧

an open-vocabulary sound event detection model

liked a dataset 3 months ago

MBZUAI/ArVoice

Viewer • Updated Oct 31 • 46.2k • 1.26k • 26

liked a dataset 6 months ago

espnet/mms_ulab_v2

Viewer • Updated Feb 4 • 20.7k • 1.65k • 24

liked a dataset 7 months ago

google/fleurs

Updated Aug 25, 2024 • 51.1k • 361

liked a Space 7 months ago

EzAudio

🟣

275

Generate and edit audio from text prompts

New activity in OpenSound/CapSpeech 7 months ago

Improve dataset card with paper link and Github links

#2 opened 7 months ago by

nielsr

authored a paper 7 months ago

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech

Paper • 2506.02863 • Published Jun 3 • 8

commented a paper 7 months ago

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech

Paper • 2506.02863 • Published Jun 3 • 8 •

upvoted a paper 7 months ago

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech

Paper • 2506.02863 • Published Jun 3 • 8

liked a Space 7 months ago

CapSpeech TTS

🧢

Stylized TTS – design voice, accent, and emotion your way

New activity in OpenSound/CapSpeech-PT-SEDB-Audio 7 months ago

Update README.md

#1 opened 7 months ago by

westbrook

authored 3 papers 7 months ago

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline

Paper • 2505.19314 • Published May 25 • 4

Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits

Paper • 2505.14648 • Published May 20 • 9

Noise-robust Speech Separation with Fast Generative Correction

Paper • 2406.07461 • Published Jun 11, 2024

upvoted a paper 7 months ago

Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits

Paper • 2505.14648 • Published May 20 • 9

updated a collection 7 months ago

speech

Collection

2 items • Updated May 28

commented a paper 7 months ago

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline

Paper • 2505.19314 • Published May 25 • 4 •

upvoted a paper 7 months ago

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline

Paper • 2505.19314 • Published May 25 • 4

liked a dataset 7 months ago

OpenSound/CapSpeech

Viewer • Updated Jun 4 • 20.8M • 725 • 24

Helin Wang

AI & ML interests

Recent Activity

Organizations

westbrook's activity

FlexSED

EzAudio

Improve dataset card with paper link and Github links

CapSpeech TTS

Update README.md