whisper-transcribe

Name: whisper-transcribe
Rating: 4.5 (1 reviews)
Author: SpillwaveSolutions

89.0

Transcribes audio and video files to text using OpenAI's Whisper CLI with contextual grounding. Converts audio/video to text, transcribes recordings, and creates transcripts from media files. Use when asked to "whisper transcribe", "transcribe audio", "convert recording to text", or "speech to text". Uses markdown files in the same directory as context to improve transcription accuracy for technical terms, proper nouns, and domain-specific vocabulary.

#context#text#files#transcribe#agentic-skill#whisper transcribe#context files#markdown files

Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security

Installation for Agentic Skill

View all platforms →

Claude Code (CLI) Fast

skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe

OpenCode (CLI) Fast

skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe --agent opencode

OpenAI Codex (CLI) Native

skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe --agent codex

Gemini CLI (Project) Project

skilz install SpillwaveSolutions/whisper-transcribe/whisper-transcribe --agent gemini

First time? Install Skilz: pip install skilz

Works with 22+ AI coding agents

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents

For Claude Desktop Easy

Download Agent Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

Manual Installation

1. Clone the repository:

git clone https://github.com/SpillwaveSolutions/whisper-transcribe

2. Copy the agent skill directory:

cp -r whisper-transcribe ~/.claude/skills/

View on GitHub

Need detailed installation help? Check our platform-specific guides:

Claude Desktop Guide Claude Code Guide Troubleshooting

Related Agentic Skills

stt-transcription

by astoreyai

Speech-to-text transcription using multiple engines (Whisper, Google Speech, Azure, AssemblyAI). Record audio, transcribe files, real-time transcripti...

general› whisper transcription

Video Processor

by Microck

Process video files with audio extraction, format conversion (mp4, webm), and Whisper transcription. Use when user mentions video conversion, audio ex...

general› whisper transcription

opencode_cli

by SpillwaveSolutions

This skill should be used when configuring or using the OpenCode CLI for headless LLM automation. Use when the user asks to "configure opencode", "use...

100

general› patterns skill

sdd

by SpillwaveSolutions

This skill should be used when users want guidance on Spec-Driven Development methodology using GitHub's Spec-Kit. Guide users through executable spec...

100

general› skill use

Agentic Skill Details

Owner: SpillwaveSolutions (GitHub)
Repository: whisper-transcribe
Type: Non-Technical
Meta-Domain: general
Primary Domain: general
Sub-Domain: whisper transcription
Market Score: 89.0

Agentic Skill Grades →

Agent Skill Grade

Score: 89/100 Click to see breakdown

Score Breakdown

Spec Compliance

14/15

PDA Architecture

25/30

Ease of Use

21/25

Writing Style

8/10

Utility

17/20

Modifiers: +4

Areas to Improve

SKILL.md is 254 lines but lacks a Table of Contents for navigation.
Model selection examples appear in both files; SKILL.md should just reference the detailed comparison.
Installation takes 26 lines; could be condensed or moved to reference.

Recommendations

Add trigger phrases to description for discoverability
Add table of contents for files over 100 lines

Graded: 1/18/2026

Developer Feedback

I took a look at your whisper-transcribe skill and wanted to share some thoughts.

Links:

The TL;DR

You're at 89/100, solid B-grade territory. This is based on Anthropic's skill best practices rubric. Your strongest area is Spec Compliance (14/15) – you nailed the YAML frontmatter and naming conventions. The weaker spots are Progressive Disclosure (25/30) and Utility (17/20), mostly around how you're organizing information and guiding users through context file creation.

What's Working Well

Excellent trigger phrases – Your metadata includes file extensions (.mp3, .wav) and descriptive triggers like "speech-to-text" that'll activate the skill appropriately in real workflows.
Smart reference architecture – You've got whisper-options.md handling the deep CLI details and context-template.md as a practical asset. This layering is solid.
Real workflow clarity – The 4-step process (find files → transcribe → ground → save) is concrete and actually shows what users will do, not just theory.
Context grounding is genuinely useful – This isn't another wrapper around Whisper; the markdown context feature solves a real problem (accuracy on technical terms and names).

The Big One: Missing Table of Contents

Your SKILL.md hits 254 lines but has no TOC. For a document that long, users browsing in Claude Code are bouncing around blind. Add this right after the description:

## Contents
- [Purpose](#purpose)
- [When to Use](#when-to-use)
- [Installation](#installation)
- [Transcription Workflow](#transcription-workflow)
- [Context Files](#context-files)
- [Model Selection Guide](#model-selection-guide)
- [Troubleshooting](#troubleshooting)

This alone gets you +1 point toward PDA and makes the skill way more usable in Claude Code's interface.

Other Things Worth Fixing

Trim the installation section – Currently 26 lines; condense to 8-10 lines with platform-specific commands side-by-side. The verbose explanations belong in a reference file, not inline.
Guide context file creation – Step 1 says "optionally create one using assets/context-template.md" but doesn't explain what minimum content is needed. Change to: "Create one with at minimum: speaker names and key technical terms. See assets/context-template.md for the full template."
Deduplicate the model comparison – You've got the same table in both SKILL.md and references/whisper-options.md. Keep the detailed one in the reference; SKILL.md should just say "See references/whisper-options.md for the full model comparison."

Quick Wins

Add TOC to SKILL.md (+1 point, biggest bang for buck)
Consolidate installation section (+1 point)
Remove duplicate model table (+1 point)
Clarify context file requirements (+1 point)

These four changes realistically get you to 93/100, A-grade territory.

Checkout your skill here: SkillzWave.ai | SpillWave We have an agentic skill installer that installs skills in 14+ coding agent platforms. Check out this guide on how to improve your agentic skills.

AI-Detected Topics

Extracted using NLP analysis

context text files transcribe agentic-skill whisper transcribe context files markdown files claude-code-skill audio transcription Whisper

Browse Category

More general Agentic Skills

Report Security Issue

Found a security vulnerability in this agent skill?