Skillzwave

ai-multimodal

993 stars 188 forks Updated Nov 15, 2025
60.8

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (captioning, object detection, OCR, visual Q&A, segmentation), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image, editing, com

Commands
#ai multimodal description#analysis youtube processing#process audio images#gemini api audio#ai features
Also in: video api data analysis

Third-Party Skill: Review the code before installing. Skills execute in your AI assistant's environment and can access your files. Learn more about security

skilz install mrgoonie_claudekit-skills/ai-multimodal
skilz install mrgoonie_claudekit-skills/ai-multimodal --agent opencode
skilz install mrgoonie_claudekit-skills/ai-multimodal --agent codex
skilz install mrgoonie_claudekit-skills/ai-multimodal --agent gemini

First time? Install Skilz: pip install skilz

Works with 14 AI coding assistants

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents
Download Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

1. Clone the repository:
git clone https://github.com/mrgoonie/claudekit-skills
2. Copy the skill directory:
cp -r claudekit-skills/.claude/skills/ai-multimodal ~/.claude/skills/

Need detailed installation help? Check our platform-specific guides:

Related Skills

Details

Stars
993
Forks
188
Type
Technical
Meta-Domain
media
Primary Domain
image
Sub-Domain
audio hours
Skill Size
188.7 KB
Files
15
Quality Score
60.8

AI-Detected Topics

Extracted using NLP analysis

ai multimodal description analysis youtube processing process audio images gemini api audio ai features

Browse Category

More media skills

Report Security Issue

Found a security vulnerability in this skill?