Spaces:

CLEAR-Global
/

speech-resource-finder

Running

App Files Files Community

speech-resource-finder / README.md

Alp

first

87f167e about 1 month ago

preview code

raw

history blame contribute delete

2.91 kB

	---
	title: Speech Resource Finder
	emoji: 🧭
	colorFrom: gray
	colorTo: pink
	sdk: gradio
	sdk_version: 5.49.1
	app_file: app.py
	pinned: false
	short_description: 'Discover ASR and TTS support and resources for any language '
	---

	# Speech Resource Finder

	## Description

	Almost 4 billion people speak languages with little or no speech technology support. This tool makes visible which languages have resources available and which communities are being left behind in the speech AI revolution.

	Built by CLEAR Global to support language inclusion and help close the digital language divide.

	## Data Sources

	### Commercial Speech Services

	Commercial service support is automatically pulled from the language support page of each service provider.

	- Azure Speech Services - [Speech-to-Text](https://learn.microsoft.com/en-us/azure/ai-services/speech-service/language-support?tabs=stt) \| [Text-to-Speech](https://learn.microsoft.com/en-us/azure/ai-services/speech-service/language-support?tabs=tts)
	- Google Cloud Speech - [Speech-to-Text](https://cloud.google.com/speech-to-text/docs/speech-to-text-supported-languages) \| [Text-to-Speech](https://cloud.google.com/text-to-speech/docs/voices)
	- AWS - [Transcribe](https://docs.aws.amazon.com/transcribe/latest/dg/supported-languages.html) \| [Polly](https://docs.aws.amazon.com/polly/latest/dg/supported-languages.html)
	- ElevenLabs - [Multilingual v2](https://elevenlabs.io/docs/models#multilingual-v2) \| [Turbo v3](https://elevenlabs.io/docs/models#eleven-v3-alpha)

	### Open Source Resources
	- HuggingFace Models - Pre-trained speech models sorted by downloads
	- [ASR Models](https://huggingface.co/models?pipeline_tag=automatic-speech-recognition)
	- [TTS Models](https://huggingface.co/models?pipeline_tag=text-to-speech)
	- HuggingFace Datasets - Speech corpora for training and evaluation
	- [ASR Datasets](https://huggingface.co/datasets?task_categories=task_categories:automatic-speech-recognition)
	- [TTS Datasets](https://huggingface.co/datasets?task_categories=task_categories:text-to-speech)

	## How to Use
	1. Select a language from the dropdown (type to search by name or ISO code)
	2. Toggle model deduplication if desired (enabled by default)
	3. Review results: commercial availability, models, and datasets
	4. Click model/dataset names to open on HuggingFace

	## Disclaimer

	- Currently lists only 487 languages and is taken from this [Github repository](https://github.com/datasets/language-codes).
	- Data fetched in real-time and can change.
	- This is not an exhaustive list. There are other commercial voice technology providers and dataset/model resources that this app doesn't cover.
	- Deduplication discards models with same name uploaded by others and keeps the most downloaded version in the list.

	## Feedback

	We would love to hear your feedback and suggestions. Please write us at [email protected].