whisper-transcribe
Transcribes audio and video files to text using OpenAI's Whisper CLI with contextual grounding.Converts audio/video to text, transcribes recordings, and creates transcripts from media files.Use when asked to "whisper transcribe", "transcribe audio", "convert recording to text", or"speech to text". Uses markdown files in the same directory as context to improve transcriptionaccuracy for technical terms, proper nouns, and domain-specific vocabulary.
Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation for Agentic Skill
View all platforms →skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribeskilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe --agent opencodeskilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe --agent codexskilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe --agent geminiFirst time? Install Skilz: pip install skilz
Works with 22+ AI coding assistants
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/SpillwaveSolutions/whisper-transcribecp -r whisper-transcribe ~/.claude/skills/Need detailed installation help? Check our platform-specific guides:
Related Agentic Skills
stt-transcription
by astoreyai
Speech-to-text transcription using multiple engines (Whisper, Google Speech, Azure, AssemblyAI). Record audio, transcribe files, real-time transcri...
Video Processor
by Microck
Process video files with audio extraction, format conversion (mp4, webm), and Whisper transcription. Use when user mentions video conversion, audio...
opencode_cli
by SpillwaveSolutions
This skill should be used when configuring or using the OpenCode CLI for headless LLM automation. Use when the user asks to "configure opencode", "...
sdd
by SpillwaveSolutions
This skill should be used when users want guidance on Spec-Driven Development methodology using GitHub's Spec-Kit. Guide users through executable s...
Agentic Skill Details
- Owner
- SpillwaveSolutions (GitHub)
- Repository
- whisper-transcribe
- Type
- Non-Technical
- Meta-Domain
- general
- Primary Domain
- general
- Sub-Domain
- video whisper transcription
- Market Score
- 89
Agent Skill Grade
B Score: 89/100 Click to see breakdown
Score Breakdown
Areas to Improve
- Missing TOC in SKILL.md
- Duplicated model comparison
- Installation section too verbose
Recommendations
- Add trigger phrases to description for discoverability
- Add table of contents for files over 100 lines
Graded: 2026-01-19
Developer Feedback
I took a look at your whisper-transcribe skill and wanted to share some thoughts.
Links:
The TL;DR
You're at 89/100, solid B-grade territory. This is based on Anthropic's skill best practices rubric. Your strongest area is Spec Compliance (14/15) – you nailed the YAML frontmatter and naming conventions. The weaker spots are Progressive Disclosure (25/30) and Utility (17/20), mostly around how you're organizing information and guiding users through context file creation.
What's Working Well
- Excellent trigger phrases – Your metadata includes file extensions (.mp3, .wav) and descriptive triggers like "speech-to-text" that'll activate the skill appropriately in real workflows.
- Smart reference architecture – You've got
whisper-options.mdhandling the deep CLI details andcontext-template.mdas a practical asset. This layering is solid. - Real workflow clarity – The 4-step process (find files → transcribe → ground → save) is concrete and actually shows what users will do, not just theory.
- Context grounding is genuinely useful – This isn't another wrapper around Whisper; the markdown context feature solves a real problem (accuracy on technical terms and names).
The Big One: Missing Table of Contents
Your SKILL.md hits 254 lines but has no TOC. For a document that long, users browsing in Claude Code are bouncing around blind. Add this right after the description:
## Contents
- [Purpose](#purpose)
- [When to Use](#when-to-use)
- [Installation](#installation)
- [Transcription Workflow](#transcription-workflow)
- [Context Files](#context-files)
- [Model Selection Guide](#model-selection-guide)
- [Troubleshooting](#troubleshooting)
This alone gets you +1 point toward PDA and...
AI-Detected Topics
Extracted using NLP analysis
Browse Category
More general Agentic SkillsReport Security Issue
Found a security vulnerability in this agent skill?
Report Security Issue
Thank you for helping keep SkillzWave secure. We'll review your report and take appropriate action.
Note: For critical security issues that require immediate attention, please also email security@skillzwave.ai directly.