stt-transcription
Speech-to-text transcription using multiple engines (Whisper, Google Speech, Azure, AssemblyAI). Record audio, transcribe files, real-time transcription, speaker diarization, timestamps, and multi-language support. Use for meeting transcription, voice notes, audio file processing, or accessibility features.
Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation for Agentic Skill
View all platforms →skilz install astoreyai/claude-skills/stt-transcription skilz install astoreyai/claude-skills/stt-transcription --agent opencode skilz install astoreyai/claude-skills/stt-transcription --agent codex skilz install astoreyai/claude-skills/stt-transcription --agent gemini
First time? Install Skilz: pip install skilz
Works with 14 AI coding assistants
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/astoreyai/claude-skills cp -r claude-skills/skills/utility/stt-transcription ~/.claude/skills/ Need detailed installation help? Check our platform-specific guides:
Related Agentic Skills
whisper-transcribe
by SpillwaveSolutions| Transcribes audio and video files to text using OpenAI's Whisper CLI with contextual grounding. This skill should be used when users need to convert...
Video Processor
by MicrockProcess video files with audio extraction, format conversion (mp4, webm), and Whisper transcription. Use when user mentions video conversion, audio ex...
architect-agent
by SpillwaveSolutions"Use this skill ONLY when user explicitly requests: (1) 'write instructions for code agent' or 'create instructions', (2) 'this is a new architect age...
confluence
by SpillwaveSolutionsThis skill should be used when working with Confluence documentation - downloading pages to Markdown, converting between Wiki Markup and Markdown, cre...
Agentic Skill Details
- Repository
- claude-skills
- Type
- Non-Technical
- Meta-Domain
- general
- Primary Domain
- general
- Sub-Domain
- whisper transcription
- Market Score
- 25.1
Browse Category
More general Agentic SkillsReport Security Issue
Found a security vulnerability in this agent skill?