Skillzwave Logo
Skillzwave

whisper-transcribe

89.0
B

Transcribes audio and video files to text using OpenAI's Whisper CLI with contextual grounding. Converts audio/video to text, transcribes recordings, and creates transcripts from media files. Use when asked to "whisper transcribe", "transcribe audio", "convert recording to text", or "speech to text". Uses markdown files in the same directory as context to improve transcription accuracy for technical terms, proper nouns, and domain-specific vocabulary.

#context#text#files#transcribe#agentic-skill#whisper transcribe#context files#markdown files

Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security

Installation for Agentic Skill

View all platforms →
skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe
skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe --agent opencode
skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe --agent codex
skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe --agent gemini

First time? Install Skilz: pip install skilz

Works with 22+ AI coding agents

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents
Download Agent Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

1. Clone the repository:
git clone https://github.com/SpillwaveSolutions/whisper-transcribe
2. Copy the agent skill directory:
cp -r whisper-transcribe ~/.claude/skills/

Need detailed installation help? Check our platform-specific guides:

Related Agentic Skills

stt-transcription

by astoreyai

Speech-to-text transcription using multiple engines (Whisper, Google Speech, Azure, AssemblyAI). Record audio, transcribe files, real-time transcripti...

25
generalwhisper transcription

Video Processor

by Microck

Process video files with audio extraction, format conversion (mp4, webm), and Whisper transcription. Use when user mentions video conversion, audio ex...

19
generalwhisper transcription

opencode_cli

by SpillwaveSolutions

This skill should be used when configuring or using the OpenCode CLI for headless LLM automation. Use when the user asks to "configure opencode", "use...

100
generalpatterns skill

sdd

by SpillwaveSolutions

This skill should be used when users want guidance on Spec-Driven Development methodology using GitHub's Spec-Kit. Guide users through executable spec...

100
generalskill use

Agentic Skill Details

Type
Non-Technical
Meta-Domain
general
Primary Domain
general
Sub-Domain
whisper transcription
Market Score
89.0

Agent Skill Grade

B
Score: 89/100 Click to see breakdown

Score Breakdown

Spec Compliance
14/15
PDA Architecture
25/30
Ease of Use
21/25
Writing Style
8/10
Utility
17/20
Modifiers: +4

Areas to Improve

  • SKILL.md is 254 lines but lacks a Table of Contents for navigation.
  • Model selection examples appear in both files; SKILL.md should just reference the detailed comparison.
  • Installation takes 26 lines; could be condensed or moved to reference.

Recommendations

  • Add trigger phrases to description for discoverability
  • Add table of contents for files over 100 lines

Graded: 1/18/2026

Developer Feedback

I took a look at your whisper-transcribe skill and wanted to share some thoughts.

Links:

The TL;DR

You're at 89/100, solid B-grade territory. This is based on Anthropic's skill best practices rubric. Your strongest area is Spec Compliance (14/15) – you nailed the YAML frontmatter and naming conventions. The weaker spots are Progressive Disclosure (25/30) and Utility (17/20), mostly around how you're organizing information and guiding users through context file creation.

What's Working Well

  • Excellent trigger phrases – Your metadata includes file extensions (.mp3, .wav) and descriptive triggers like "speech-to-text" that'll activate the skill appropriately in real workflows.
  • Smart reference architecture – You've got whisper-options.md handling the deep CLI details and context-template.md as a practical asset. This layering is solid.
  • Real workflow clarity – The 4-step process (find files → transcribe → ground → save) is concrete and actually shows what users will do, not just theory.
  • Context grounding is genuinely useful – This isn't another wrapper around Whisper; the markdown context feature solves a real problem (accuracy on technical terms and names).

The Big One: Missing Table of Contents

Your SKILL.md hits 254 lines but has no TOC. For a document that long, users browsing in Claude Code are bouncing around blind. Add this right after the description:

## Contents
- [Purpose](#purpose)
- [When to Use](#when-to-use)
- [Installation](#installation)
- [Transcription Workflow](#transcription-workflow)
- [Context Files](#context-files)
- [Model Selection Guide](#model-selection-guide)
- [Troubleshooting](#troubleshooting)

This alone gets you +1 point toward PDA and makes the skill way more usable in Claude Code's interface.

Other Things Worth Fixing

  1. Trim the installation section – Currently 26 lines; condense to 8-10 lines with platform-specific commands side-by-side. The verbose explanations belong in a reference file, not inline.

  2. Guide context file creation – Step 1 says "optionally create one using assets/context-template.md" but doesn't explain what minimum content is needed. Change to: "Create one with at minimum: speaker names and key technical terms. See assets/context-template.md for the full template."

  3. Deduplicate the model comparison – You've got the same table in both SKILL.md and references/whisper-options.md. Keep the detailed one in the reference; SKILL.md should just say "See references/whisper-options.md for the full model comparison."

Quick Wins

  • Add TOC to SKILL.md (+1 point, biggest bang for buck)
  • Consolidate installation section (+1 point)
  • Remove duplicate model table (+1 point)
  • Clarify context file requirements (+1 point)

These four changes realistically get you to 93/100, A-grade territory.


Checkout your skill here: SkillzWave.ai | SpillWave We have an agentic skill installer that installs skills in 14+ coding agent platforms. Check out this guide on how to improve your agentic skills.

AI-Detected Topics

Extracted using NLP analysis

context text files transcribe agentic-skill whisper transcribe context files markdown files claude-code-skill audio transcription Whisper

Report Security Issue

Found a security vulnerability in this agent skill?