Speech Recognition
by AaaS · open-source · Last verified 2026-03-17
Teaches integration and optimization of automatic speech recognition (ASR) systems — from Whisper to streaming cloud APIs — for agentic voice pipelines. Covers language identification, word error rate reduction, punctuation restoration, and handling noisy audio environments.
https://aaas.blog/skill/speech-recognition ↗B+
B+—Good
Adoption: AQuality: AFreshness: ACitations: B+Engagement: F
Specifications
- License
- MIT
- Pricing
- open-source
- Capabilities
- transcription, language-identification, timestamp-alignment, noise-robustness, streaming-asr
- Integrations
- openai-whisper, deepgram, assemblyai, google-speech
- Use Cases
- meeting-transcription, voice-assistant, call-center-analytics, accessibility
- API Available
- No
- Difficulty
- beginner
- Prerequisites
- Supported Agents
- voice-agent
- Tags
- asr, whisper, transcription, audio, speech-to-text
- Added
- 2026-03-17
- Completeness
- 100%
Index Score
71.9Adoption
88
Quality
86
Freshness
85
Citations
78
Engagement
0