whisper-transcribe

89
B

Transcribes audio and video files to text using OpenAI's Whisper CLI with contextual grounding.Converts audio/video to text, transcribes recordings, and creates transcripts from media files.Use when asked to "whisper transcribe", "transcribe audio", "convert recording to text", or"speech to text". Uses markdown files in the same directory as context to improve transcriptionaccuracy for technical terms, proper nouns, and domain-specific vocabulary.

#context#text#files#transcribe#agentic-skill#whisper transcribe#context files#markdown files

Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security

Installation for Agentic Skill

View all platforms →
skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe
skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe --agent opencode
skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe --agent codex
skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe --agent gemini

First time? Install Skilz: pip install skilz

Works with 22+ AI coding assistants

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents
Download Agent Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

1. Clone the repository:
git clone https://github.com/SpillwaveSolutions/whisper-transcribe
2. Copy the agent skill directory:
cp -r whisper-transcribe ~/.claude/skills/

Need detailed installation help? Check our platform-specific guides:

Related Agentic Skills

Agentic Skill Details

Type
Non-Technical
Meta-Domain
general
Primary Domain
general
Sub-Domain
video whisper transcription
Market Score
89

Agent Skill Grade

B
Score: 89/100 Click to see breakdown

Score Breakdown

Spec Compliance
14/15
PDA Architecture
25/30
Ease of Use
21/25
Writing Style
8/10
Utility
17/20
Modifiers: +4

Areas to Improve

  • Missing TOC in SKILL.md
  • Duplicated model comparison
  • Installation section too verbose

Recommendations

  • Add trigger phrases to description for discoverability
  • Add table of contents for files over 100 lines

Graded: 2026-01-19

Developer Feedback

I took a look at your whisper-transcribe skill and wanted to share some thoughts.

Links:

The TL;DR

You're at 89/100, solid B-grade territory. This is based on Anthropic's skill best practices rubric. Your strongest area is Spec Compliance (14/15) – you nailed the YAML frontmatter and naming conventions. The weaker spots are Progressive Disclosure (25/30) and Utility (17/20), mostly around how you're organizing information and guiding users through context file creation.

What's Working Well

  • Excellent trigger phrases – Your metadata includes file extensions (.mp3, .wav) and descriptive triggers like "speech-to-text" that'll activate the skill appropriately in real workflows.
  • Smart reference architecture – You've got whisper-options.md handling the deep CLI details and context-template.md as a practical asset. This layering is solid.
  • Real workflow clarity – The 4-step process (find files → transcribe → ground → save) is concrete and actually shows what users will do, not just theory.
  • Context grounding is genuinely useful – This isn't another wrapper around Whisper; the markdown context feature solves a real problem (accuracy on technical terms and names).

The Big One: Missing Table of Contents

Your SKILL.md hits 254 lines but has no TOC. For a document that long, users browsing in Claude Code are bouncing around blind. Add this right after the description:

## Contents
- [Purpose](#purpose)
- [When to Use](#when-to-use)
- [Installation](#installation)
- [Transcription Workflow](#transcription-workflow)
- [Context Files](#context-files)
- [Model Selection Guide](#model-selection-guide)
- [Troubleshooting](#troubleshooting)

This alone gets you +1 point toward PDA and...

AI-Detected Topics

Extracted using NLP analysis

context text files transcribe agentic-skill whisper transcribe context files markdown files claude-code-skill audio

Report Security Issue

Found a security vulnerability in this agent skill?