AgentDish directory

speech-recognition

Accepted listings with this tag.

Listing Category Score Trend Checked
#31 ↓ -20
VibeVoice

Open-source voice AI from Microsoft with both long-form text-to-speech and speech recognition models. The repo highlights 90-minute multi-speaker TTS, 60-minute single-pass ASR, multilingual support, hotwording, and links to docs, Hugging Face, playground, finetuning, and papers.

Audio / Text-to-Speech / Speech Recognition 90 ↓ -20 45 days ago Details
#151 ↑ +2
CrankGPT

CrankGPT is a fully offline voice assistant built on a hand-crank-powered Raspberry Pi setup. The page explains the hardware stack, local speech recognition, on-device LLMs, text-to-speech, power smoothing, and latency measurements for running the system without cloud dependency.

AI Hardware / Offline Voice Assistant 86 ↑ +2 34 hours ago Details