razhan/DOLMA-speech
Viewer • Updated • 41.7k • 121 • 1
How to use razhan/whisper-base-mzn with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="razhan/whisper-base-mzn") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("razhan/whisper-base-mzn")
model = AutoModelForSpeechSeq2Seq.from_pretrained("razhan/whisper-base-mzn")This model is a fine-tuned version of openai/whisper-base on the razhan/DOLMA-speech mazanderani dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
|---|---|---|---|---|---|
| No log | 1.0 | 3 | 2.1575 | 1.3606 | 0.7428 |
| No log | 2.0 | 6 | 2.1575 | 1.3606 | 0.7428 |
| No log | 3.0 | 9 | 1.6346 | 0.9451 | 0.6618 |
| 2.0051 | 4.0 | 12 | 1.4320 | 0.8414 | 0.3594 |
| 2.0051 | 5.0 | 15 | 1.3368 | 0.8150 | 0.3173 |
Base model
openai/whisper-base