markitdown
Convert various file formats (PDF, Office documents, images, audio, web content, structured data) to Markdown optimized for LLM processing. Use when converting documents to markdown, extracting text from PDFs/Office files, transcribing audio, performing OCR on images, extracting YouTube transcripts, or processing batches of files. Supports 20+ formats including DOCX, XLSX, PPTX, PDF, HTML, EPUB, CSV, JSON, images with OCR, and audio with transcription.
Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation for Agentic Skill
View all platforms →skilz install jimmc414/Kosmos/markitdownskilz install jimmc414/Kosmos/markitdown --agent opencodeskilz install jimmc414/Kosmos/markitdown --agent codexskilz install jimmc414/Kosmos/markitdown --agent geminiFirst time? Install Skilz: pip install skilz
Works with 22+ AI coding assistants
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/jimmc414/Kosmoscp -r Kosmos/kosmos-claude-scientific-skills/scientific-skills/markitdown ~/.claude/skills/Need detailed installation help? Check our platform-specific guides:
Related Agentic Skills
markitdown
by jackspace
Convert various file formats (PDF, Office documents, images, audio, web content, structured data) to Markdown optimized for LLM processing. Use whe...
markitdown
by jackspace
Convert various file formats (PDF, Office documents, images, audio, web content, structured data) to Markdown optimized for LLM processing. Use whe...
agentdb-vector-search
by ruvnet
"Implement semantic vector search with AgentDB for intelligent document retrieval, similarity matching, and context-aware querying. Use when buildi...
rag-implementation
by wshobson
Build Retrieval-Augmented Generation (RAG) systems for LLM applications with vector databases and semantic search. Use when implementing knowledge-...
Agentic Skill Details
- Repository
- Kosmos
- Stars
- 304
- Forks
- 61
- Type
- Technical
- Meta-Domain
- productivity
- Primary Domain
- Market Score
- 80
Agent Skill Grade
B Score: 80/100 Click to see breakdown
Score Breakdown
Areas to Improve
- Second-person voice in references
- Verbose code comments
- Missing TOC in long references
Recommendations
- Add trigger phrases to description for discoverability
- Add table of contents for files over 100 lines
Graded: 2026-01-05
Developer Feedback
I took a look at your markitdown skill and wanted to share some thoughts.
Links:
The TL;DR
You're at 80/100, solidly in B territory. This evaluation is based on Anthropic's Claude Skills best practices across five pillars. Your strongest area is Progressive Disclosure Architecture (26/30) — you've nailed the layered structure with a clean SKILL.md overview and five focused reference files. The weakest area is Spec Compliance (12/15) and Writing Style (7/10), where some smaller refinements would push you higher.
What's Working Well
- Progressive disclosure is chef's kiss — Your five reference files (structured_data, web_content, document_conversion, media_processing, advanced_integrations) sit exactly one level deep from SKILL.md. That's the sweet spot for token economy and discoverability.
- Practical utility — You're solving a real problem: converting 20+ file formats to Markdown for LLM processing. The input/output examples and batch processing templates show you understand actual workflows.
- Modular design — Trigger phrases cover the common cases (convert, extract, transcribe, OCR, batch). The "When to Use" section helps developers understand scope without reading everything.
- Rich examples — Both CLI and Python code examples; good error handling patterns scattered through the references.
The Big One
Your writing voice is inconsistent, and it's costing you points. The spec wants imperative/instructional voice throughout, but your references slip into second-person statements like "Use high-resolution images for better accuracy" in media_processing.md. This violates the voice requirements and pulls down y...
Browse Category
More productivity Agentic SkillsReport Security Issue
Found a security vulnerability in this agent skill?
Report Security Issue
Thank you for helping keep SkillzWave secure. We'll review your report and take appropriate action.
Note: For critical security issues that require immediate attention, please also email security@skillzwave.ai directly.