ai-multimodal
Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (captioning, object detection, OCR, visual Q&A, segmentation), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image, editing, ...
Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation for Agentic Skill
View all platforms →skilz install mrgoonie/claudekit-skills/ai-multimodalskilz install mrgoonie/claudekit-skills/ai-multimodal --agent opencodeskilz install mrgoonie/claudekit-skills/ai-multimodal --agent codexskilz install mrgoonie/claudekit-skills/ai-multimodal --agent geminiFirst time? Install Skilz: pip install skilz
Works with 22+ AI coding assistants
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/mrgoonie/claudekit-skillscp -r claudekit-skills/.claude/skills/ai-multimodal ~/.claude/skills/Need detailed installation help? Check our platform-specific guides:
Related Agentic Skills
api-error-handling
by aj-geddes
Implement comprehensive API error handling with standardized error responses, logging, monitoring, and user-friendly messages. Use when building re...
markdownlint-integration
by TheBushidoCollective
Integrate markdownlint into development workflows including CLI usage, programmatic API, CI/CD pipelines, and editor integration.
route-tester
by diet103
Test authenticated routes in the your project using cookie-based authentication. Use this skill when testing API endpoints, validating route functi...
hook-development
by anthropics
This skill should be used when the user asks to "create a hook", "add a PreToolUse/PostToolUse/Stop hook", "validate tool use", "implement prompt-b...
Agentic Skill Details
- Repository
- claudekit-skills
- Stars
- 713
- Forks
- 113
- Type
- Technical
- Meta-Domain
- web api
- Primary Domain
- api
- Market Score
- 47
Browse Category
More web api Agentic SkillsReport Security Issue
Found a security vulnerability in this agent skill?
Report Security Issue
Thank you for helping keep SkillzWave secure. We'll review your report and take appropriate action.
Note: For critical security issues that require immediate attention, please also email security@skillzwave.ai directly.