blip-2-vision-language
Vision-language pre-training framework bridging frozen image encoders and LLMs. Use when you need image captioning, visual question answering, image-text retrieval, or multimodal chat with state-of-the-art zero-shot performance.
Third-Party Skill: Review the code before installing. Skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation
View all platforms →skilz install zechenzhangAGI_AI-research-SKILLs/blip-2-vision-language skilz install zechenzhangAGI_AI-research-SKILLs/blip-2-vision-language --agent opencode skilz install zechenzhangAGI_AI-research-SKILLs/blip-2-vision-language --agent codex skilz install zechenzhangAGI_AI-research-SKILLs/blip-2-vision-language --agent gemini
First time? Install Skilz: pip install skilz
Works with 14 AI coding assistants
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/zechenzhangAGI/AI-research-SKILLs cp -r AI-research-SKILLs/18-multimodal/blip-2 ~/.claude/skills/ Need detailed installation help? Check our platform-specific guides:
Related Skills
image-gen
Generate compelling cover images and in-article illustrations for technical articles using the imagen CLI tool. Use this skill when creating visual as...
scientific-slides
"Build slide decks and presentations for research talks. Use this for making PowerPoint slides, conference presentations, seminar talks, research pres...
latex-posters
"Create professional research posters in LaTeX using beamerposter, tikzposter, or baposter. Support for conference presentations, academic posters, an...
firstspirit-templating
This skill provides comprehensive knowledge for templating in the FirstSpirit CMS, specifically focused on SiteArchitect development. This skill shoul...
Details
- Owner
- zechenzhangAGI
- Repository
- AI-research-SKILLs
- Stars
- 422
- Forks
- 30
- Type
- Technical
- Meta-Domain
- media
- Primary Domain
- image
- Sub-Domain
- images text
- Skill Size
- 48.3 KB
- Files
- 3
- Quality Score
- 48.0
AI-Detected Topics
Extracted using NLP analysis
Browse Category
More media skillsReport Security Issue
Found a security vulnerability in this skill?