blip-2-vision-language
Vision-language pre-training framework bridging frozen image encoders and LLMs. Use when you need image captioning, visual question answering, image-text retrieval, or multimodal chat with state-of-the-art zero-shot performance.
Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation for Agentic Skill
View all platforms →skilz install zechenzhangAGI/AI-research-SKILLs/blip-2-vision-language skilz install zechenzhangAGI/AI-research-SKILLs/blip-2-vision-language --agent opencode skilz install zechenzhangAGI/AI-research-SKILLs/blip-2-vision-language --agent codex skilz install zechenzhangAGI/AI-research-SKILLs/blip-2-vision-language --agent gemini
First time? Install Skilz: pip install skilz
Works with 14 AI coding assistants
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/zechenzhangAGI/AI-research-SKILLs cp -r AI-research-SKILLs/18-multimodal/blip-2 ~/.claude/skills/ Need detailed installation help? Check our platform-specific guides:
Related Agentic Skills
image-gen
by SpillwaveSolutionsGenerate compelling cover images and in-article illustrations for technical articles using the imagen CLI tool. Use this skill when creating visual as...
scientific-slides
by davila7"Build slide decks and presentations for research talks. Use this for making PowerPoint slides, conference presentations, seminar talks, research pres...
latex-posters
by davila7"Create professional research posters in LaTeX using beamerposter, tikzposter, or baposter. Support for conference presentations, academic posters, an...
latex-posters
by davila7"Create professional research posters in LaTeX using beamerposter, tikzposter, or baposter. Support for conference presentations, academic posters, an...
Agentic Skill Details
- Owner
- zechenzhangAGI (GitHub)
- Repository
- AI-research-SKILLs
- Type
- Non-Technical
- Meta-Domain
- general
- Primary Domain
- general
- Sub-Domain
- images text
- Market Score
- 28.5
Browse Category
More general Agentic SkillsReport Security Issue
Found a security vulnerability in this agent skill?