|
|
--- |
|
|
title: Speech Resource Finder |
|
|
emoji: 🧭 |
|
|
colorFrom: gray |
|
|
colorTo: pink |
|
|
sdk: gradio |
|
|
sdk_version: 5.49.1 |
|
|
app_file: app.py |
|
|
pinned: false |
|
|
short_description: 'Discover ASR and TTS support and resources for any language ' |
|
|
--- |
|
|
|
|
|
# Speech Resource Finder |
|
|
|
|
|
## Description |
|
|
|
|
|
Almost 4 billion people speak languages with little or no speech technology support. This tool makes visible which languages have resources available and which communities are being left behind in the speech AI revolution. |
|
|
|
|
|
Built by CLEAR Global to support language inclusion and help close the digital language divide. |
|
|
|
|
|
## Data Sources |
|
|
|
|
|
### Commercial Speech Services |
|
|
|
|
|
Commercial service support is automatically pulled from the language support page of each service provider. |
|
|
|
|
|
- **Azure Speech Services** - [Speech-to-Text](https://learn.microsoft.com/en-us/azure/ai-services/speech-service/language-support?tabs=stt) | [Text-to-Speech](https://learn.microsoft.com/en-us/azure/ai-services/speech-service/language-support?tabs=tts) |
|
|
- **Google Cloud Speech** - [Speech-to-Text](https://cloud.google.com/speech-to-text/docs/speech-to-text-supported-languages) | [Text-to-Speech](https://cloud.google.com/text-to-speech/docs/voices) |
|
|
- **AWS** - [Transcribe](https://docs.aws.amazon.com/transcribe/latest/dg/supported-languages.html) | [Polly](https://docs.aws.amazon.com/polly/latest/dg/supported-languages.html) |
|
|
- **ElevenLabs** - [Multilingual v2](https://elevenlabs.io/docs/models#multilingual-v2) | [Turbo v3](https://elevenlabs.io/docs/models#eleven-v3-alpha) |
|
|
|
|
|
### Open Source Resources |
|
|
- **HuggingFace Models** - Pre-trained speech models sorted by downloads |
|
|
- [ASR Models](https://huggingface.co/models?pipeline_tag=automatic-speech-recognition) |
|
|
- [TTS Models](https://huggingface.co/models?pipeline_tag=text-to-speech) |
|
|
- **HuggingFace Datasets** - Speech corpora for training and evaluation |
|
|
- [ASR Datasets](https://huggingface.co/datasets?task_categories=task_categories:automatic-speech-recognition) |
|
|
- [TTS Datasets](https://huggingface.co/datasets?task_categories=task_categories:text-to-speech) |
|
|
|
|
|
## How to Use |
|
|
1. Select a language from the dropdown (type to search by name or ISO code) |
|
|
2. Toggle model deduplication if desired (enabled by default) |
|
|
3. Review results: commercial availability, models, and datasets |
|
|
4. Click model/dataset names to open on HuggingFace |
|
|
|
|
|
## Disclaimer |
|
|
|
|
|
- Currently lists only 487 languages and is taken from this [Github repository](https://github.com/datasets/language-codes). |
|
|
- Data fetched in real-time and can change. |
|
|
- This is not an exhaustive list. There are other commercial voice technology providers and dataset/model resources that this app doesn't cover. |
|
|
- Deduplication discards models with same name uploaded by others and keeps the most downloaded version in the list. |
|
|
|
|
|
## Feedback |
|
|
|
|
|
We would love to hear your feedback and suggestions. Please write us at [email protected]. |
|
|
|