AgentDish directory
speech-recognition
Accepted listings with this tag.
| Listing | Category | Score | Trend | Checked | |
|---|---|---|---|---|---|
|
#31
↓ -20
VibeVoice
Open-source voice AI from Microsoft with both long-form text-to-speech and speech recognition models. The repo highlights 90-minute multi-speaker TTS, 60-minute single-pass ASR, multilingual support, hotwording, and links to docs, Hugging Face, playground, finetuning, and papers. |
Audio / Text-to-Speech / Speech Recognition | 90 | ↓ -20 | 45 days ago | Details |
|
#151
↑ +2
CrankGPT
CrankGPT is a fully offline voice assistant built on a hand-crank-powered Raspberry Pi setup. The page explains the hardware stack, local speech recognition, on-device LLMs, text-to-speech, power smoothing, and latency measurements for running the system without cloud dependency. |
AI Hardware / Offline Voice Assistant | 86 | ↑ +2 | 34 hours ago | Details |