llava

62 stars 2 forks

"Large Language and Vision Assistant. Enables visual instruction tuning and image-based conversations. Combines CLIP vision encoder with Vicuna/LLaMA language models. Supports multi-turn image chat, visual question answering, and instruction following. Use for vision-language chatbots or image understanding tasks. Best for conversational image analysis."

Also in: docker data analysis

Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security

Installation for Agentic Skill

View all platforms →

Claude Code (CLI) Fast

skilz install zechenzhangAGI/AI-research-SKILLs/llava

OpenCode (CLI) Fast

skilz install zechenzhangAGI/AI-research-SKILLs/llava --agent opencode

OpenAI Codex (CLI) Native

skilz install zechenzhangAGI/AI-research-SKILLs/llava --agent codex

Gemini CLI (Project) Project

skilz install zechenzhangAGI/AI-research-SKILLs/llava --agent gemini

First time? Install Skilz: pip install skilz

Works with 22+ AI coding assistants

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents

For Claude Desktop Easy

Download Agent Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

Manual Installation

1. Clone the repository:

git clone https://github.com/zechenzhangAGI/AI-research-SKILLs

2. Copy the agent skill directory:

cp -r AI-research-SKILLs/18-multimodal/llava ~/.claude/skills/

View on GitHub

Need detailed installation help? Check our platform-specific guides:

Claude Desktop Guide Claude Code Guide Troubleshooting

Related Agentic Skills

image-gen

by SpillwaveSolutions

Generate compelling cover images and in-article illustrations for technical articles using the imagen CLI tool. Use when asked to "generate images"...

TECHimage

Marketplace

image-gen

by SpillwaveSolutions

Generate compelling cover images and in-article illustrations for technical articles using the imagen CLI tool. Use when asked to "generate images"...

TECHimage

Marketplace

processing-computer-vision-tasks

by jeremylongshore

Process images using object detection, classification, and segmentation. Use when requesting "analyze image", "object detection", "image classifica...

image

Marketplace

processing-computer-vision-tasks

by jeremylongshore

Process images using object detection, classification, and segmentation. Use when requesting "analyze image", "object detection", "image classifica...

image

Marketplace

Agentic Skill Details

Owner: zechenzhangAGI (GitHub)
Repository: AI-research-SKILLs
Stars: 62
Forks: 2
Type: Technical
Meta-Domain: media
Primary Domain: image
Market Score: 26

Agentic Skill Grades →

Browse Category

More media Agentic Skills

Report Security Issue

Found a security vulnerability in this agent skill?