blip-2-vision-language

422 stars 30 forks Updated Dec 17, 2025

48.0

Vision-language pre-training framework bridging frozen image encoders and LLMs. Use when you need image captioning, visual question answering, image-text retrieval, or multimodal chat with state-of-the-art zero-shot performance.

Marketplace

#errors Error#solutions#image#image captioning#Model

Also in: machine learning docker ci cd

Third-Party Skill: Review the code before installing. Skills execute in your AI assistant's environment and can access your files. Learn more about security

Installation

View all platforms →

Claude Code (CLI) Fast

skilz install zechenzhangAGI_AI-research-SKILLs/blip-2-vision-language

OpenCode (CLI) Fast

skilz install zechenzhangAGI_AI-research-SKILLs/blip-2-vision-language --agent opencode

OpenAI Codex (CLI) Native

skilz install zechenzhangAGI_AI-research-SKILLs/blip-2-vision-language --agent codex

Gemini CLI (Project) Project

skilz install zechenzhangAGI_AI-research-SKILLs/blip-2-vision-language --agent gemini

First time? Install Skilz: pip install skilz

Works with 14 AI coding assistants

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents

For Claude Desktop Easy

Download Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

Manual Installation

1. Clone the repository:

git clone https://github.com/zechenzhangAGI/AI-research-SKILLs

2. Copy the skill directory:

cp -r AI-research-SKILLs/18-multimodal/blip-2 ~/.claude/skills/

View on GitHub

Need detailed installation help? Check our platform-specific guides:

Claude Desktop Guide Claude Code Guide Troubleshooting

Related Skills

image-gen

Generate compelling cover images and in-article illustrations for technical articles using the imagen CLI tool. Use this skill when creating visual as...

100.0

TECH image › images text

#cover images#images#ALT text

+networking +docker

scientific-slides

"Build slide decks and presentations for research talks. Use this for making PowerPoint slides, conference presentations, seminar talks, research pres...

78.0

TECH image › images text

Commands Marketplace

#slides#slide#generate slide

+pdf

latex-posters

"Create professional research posters in LaTeX using beamerposter, tikzposter, or baposter. Support for conference presentations, academic posters, an...

71.2

TECH image › images text

Commands Marketplace

#poster#content#echo

+data analysis +pdf

firstspirit-templating

This skill provides comprehensive knowledge for templating in the FirstSpirit CMS, specifically focused on SiteArchitect development. This skill shoul...

66.0

TECH image › images text

Marketplace

#template#templates#page templates

+database +javascript

Details

Owner: zechenzhangAGI
Repository: AI-research-SKILLs
Stars: 422
Forks: 30
Type: Technical
Meta-Domain: media
Primary Domain: image
Sub-Domain: images text
Skill Size: 48.3 KB
Files: 3
Quality Score: 48.0

AI-Detected Topics

Extracted using NLP analysis

errors Error solutions image image captioning Model

Browse Category

More media skills

Report Security Issue

Found a security vulnerability in this skill?