Speech AI models for Apple Silicon via MLX. ASR, TTS, VAD, diarization, speaker embedding.
-
aufklarer/WeSpeaker-ResNet34-LM-MLX
Audio Classification • Updated • 344k • 2 -
aufklarer/Qwen3-ASR-0.6B-MLX-4bit
0.3B • Updated • 53.8k • 2 -
aufklarer/Qwen3-ForcedAligner-0.6B-4bit
Audio Classification • Updated • 46.8k • 1 -
aufklarer/Pyannote-Segmentation-MLX
Voice Activity Detection • Updated • 6.53k