ai-multimodal
Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (captioning, object detection, OCR, visual Q&A, segmentation), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image, editing, com
Third-Party Skill: Review the code before installing. Skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation
View all platforms →skilz install Linhv14_claude-skill/ai-multimodal skilz install Linhv14_claude-skill/ai-multimodal --agent opencode skilz install Linhv14_claude-skill/ai-multimodal --agent codex skilz install Linhv14_claude-skill/ai-multimodal --agent gemini
First time? Install Skilz: pip install skilz
Works with 14 AI coding assistants
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/Linhv14/claude-skill cp -r claude-skill/.claude/skills/ai-multimodal ~/.claude/skills/ Need detailed installation help? Check our platform-specific guides:
Related Skills
opencode_cli
This skill should be used when configuring or using the OpenCode CLI for headless LLM automation. Use when the user asks to "configure opencode", "use...
treatment-plans
"Generate concise (3-4 page), focused medical treatment plans in LaTeX/PDF format for all clinical specialties. Supports general medical treatment, re...
citation-management
Comprehensive citation management for academic research. Search Google Scholar and PubMed for papers, extract accurate metadata, validate citations, a...
markitdown
"Convert files and office documents to Markdown. Supports PDF, DOCX, PPTX, XLSX, images (with OCR), audio (with transcription), HTML, CSV, JSON, XML, ...
Details
- Owner
- Linhv14
- Repository
- claude-skill
- Stars
- 0
- Forks
- 0
- Type
- Technical
- Meta-Domain
- web api
- Primary Domain
- api
- Sub-Domain
- patterns skill
- Skill Size
- 188.7 KB
- Files
- 15
- Quality Score
- 60.8
AI-Detected Topics
Extracted using NLP analysis
Browse Category
More web api skillsReport Security Issue
Found a security vulnerability in this skill?