whisper-transcribe
Transcribes audio and video files to text using OpenAI's Whisper CLI with contextual grounding. Converts audio/video to text, transcribes recordings, and creates transcripts from media files. Use when asked to "whisper transcribe", "transcribe audio", "convert recording to text", or "speech to text". Uses markdown files in the same directory as context to improve transcription accuracy for technical terms, proper nouns, and domain-specific vocabulary.
Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation for Agentic Skill
View all platforms →skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe --agent opencode skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe --agent codex skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe --agent gemini
First time? Install Skilz: pip install skilz
Works with 22+ AI coding agents
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/SpillwaveSolutions/whisper-transcribe cp -r whisper-transcribe ~/.claude/skills/ Need detailed installation help? Check our platform-specific guides:
Related Agentic Skills
stt-transcription
by astoreyaiSpeech-to-text transcription using multiple engines (Whisper, Google Speech, Azure, AssemblyAI). Record audio, transcribe files, real-time transcripti...
Video Processor
by MicrockProcess video files with audio extraction, format conversion (mp4, webm), and Whisper transcription. Use when user mentions video conversion, audio ex...
opencode_cli
by SpillwaveSolutionsThis skill should be used when configuring or using the OpenCode CLI for headless LLM automation. Use when the user asks to "configure opencode", "use...
sdd
by SpillwaveSolutionsThis skill should be used when users want guidance on Spec-Driven Development methodology using GitHub's Spec-Kit. Guide users through executable spec...
Agentic Skill Details
- Owner
- SpillwaveSolutions (GitHub)
- Repository
- whisper-transcribe
- Type
- Non-Technical
- Meta-Domain
- general
- Primary Domain
- general
- Sub-Domain
- whisper transcription
- Market Score
- 89.0
Agent Skill Grade
B
Score: 89/100
Click to see breakdown
Score Breakdown
Areas to Improve
- SKILL.md is 254 lines but lacks a Table of Contents for navigation.
- Model selection examples appear in both files; SKILL.md should just reference the detailed comparison.
- Installation takes 26 lines; could be condensed or moved to reference.
Recommendations
- Add trigger phrases to description for discoverability
- Add table of contents for files over 100 lines
Graded: 1/18/2026
Developer Feedback
I took a look at your whisper-transcribe skill and wanted to share some thoughts.
Links:
The TL;DR
You're at 89/100, solid B-grade territory. This is based on Anthropic's skill best practices rubric. Your strongest area is Spec Compliance (14/15) – you nailed the YAML frontmatter and naming conventions. The weaker spots are Progressive Disclosure (25/30) and Utility (17/20), mostly around how you're organizing information and guiding users through context file creation.
What's Working Well
- Excellent trigger phrases – Your metadata includes file extensions (.mp3, .wav) and descriptive triggers like "speech-to-text" that'll activate the skill appropriately in real workflows.
- Smart reference architecture – You've got
whisper-options.mdhandling the deep CLI details andcontext-template.mdas a practical asset. This layering is solid. - Real workflow clarity – The 4-step process (find files → transcribe → ground → save) is concrete and actually shows what users will do, not just theory.
- Context grounding is genuinely useful – This isn't another wrapper around Whisper; the markdown context feature solves a real problem (accuracy on technical terms and names).
The Big One: Missing Table of Contents
Your SKILL.md hits 254 lines but has no TOC. For a document that long, users browsing in Claude Code are bouncing around blind. Add this right after the description:
## Contents
- [Purpose](#purpose)
- [When to Use](#when-to-use)
- [Installation](#installation)
- [Transcription Workflow](#transcription-workflow)
- [Context Files](#context-files)
- [Model Selection Guide](#model-selection-guide)
- [Troubleshooting](#troubleshooting)
This alone gets you +1 point toward PDA and makes the skill way more usable in Claude Code's interface.
Other Things Worth Fixing
Trim the installation section – Currently 26 lines; condense to 8-10 lines with platform-specific commands side-by-side. The verbose explanations belong in a reference file, not inline.
Guide context file creation – Step 1 says "optionally create one using
assets/context-template.md" but doesn't explain what minimum content is needed. Change to: "Create one with at minimum: speaker names and key technical terms. Seeassets/context-template.mdfor the full template."Deduplicate the model comparison – You've got the same table in both SKILL.md and
references/whisper-options.md. Keep the detailed one in the reference; SKILL.md should just say "Seereferences/whisper-options.mdfor the full model comparison."
Quick Wins
- Add TOC to SKILL.md (+1 point, biggest bang for buck)
- Consolidate installation section (+1 point)
- Remove duplicate model table (+1 point)
- Clarify context file requirements (+1 point)
These four changes realistically get you to 93/100, A-grade territory.
Checkout your skill here: SkillzWave.ai | SpillWave We have an agentic skill installer that installs skills in 14+ coding agent platforms. Check out this guide on how to improve your agentic skills.
AI-Detected Topics
Extracted using NLP analysis
Browse Category
More general Agentic SkillsReport Security Issue
Found a security vulnerability in this agent skill?