Skip to main content
SkillSpeech & Audio AIv1.0

Speech Recognition

by AaaS · open-source · Last verified 2026-03-17

Teaches integration and optimization of automatic speech recognition (ASR) systems — from Whisper to streaming cloud APIs — for agentic voice pipelines. Covers language identification, word error rate reduction, punctuation restoration, and handling noisy audio environments.

https://aaas.blog/skill/speech-recognition
C+
C+Average
Adoption: AQuality: AFreshness: ACitations: FEngagement: F

Specifications

License
MIT
Pricing
open-source
Capabilities
transcription, language-identification, timestamp-alignment, noise-robustness, streaming-asr
Integrations
openai-whisper, deepgram, assemblyai, google-speech
Use Cases
meeting-transcription, voice-assistant, call-center-analytics, accessibility
API Available
No
Difficulty
beginner
Prerequisites
Supported Agents
voice-agent
Tags
asr, whisper, transcription, audio, speech-to-text
Added
2026-03-17
Completeness
87%

Index Score

53
Adoption
88
Quality
86
Freshness
85
Citations
2
Engagement
0

Ready to add this skill to your workflow?

Start Building

Explore the full AI ecosystem on Agents as a Service