markitdown
Convert various file formats (PDF, Office documents, images, audio, web content, structured data) to Markdown optimized for LLM processing. Use when converting documents to markdown, extracting text from PDFs/Office files, transcribing audio, performing OCR on images, extracting YouTube transcripts, or processing batches of files. Supports 20+ formats including DOCX, XLSX, PPTX, PDF, HTML, EPUB, CSV, JSON, images with OCR, and audio with transcription.
Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation for Agentic Skill
View all platforms →skilz install jimmc414/Kosmos/markitdown skilz install jimmc414/Kosmos/markitdown --agent opencode skilz install jimmc414/Kosmos/markitdown --agent codex skilz install jimmc414/Kosmos/markitdown --agent gemini
First time? Install Skilz: pip install skilz
Works with 22+ AI coding agents
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/jimmc414/Kosmos cp -r Kosmos/kosmos-claude-scientific-skills/scientific-skills/markitdown ~/.claude/skills/ Need detailed installation help? Check our platform-specific guides:
Related Agentic Skills
markitdown
by jackspaceConvert various file formats (PDF, Office documents, images, audio, web content, structured data) to Markdown optimized for LLM processing. Use when c...
markitdown
by jackspaceConvert various file formats (PDF, Office documents, images, audio, web content, structured data) to Markdown optimized for LLM processing. Use when c...
agentdb-vector-search
by ruvnet"Implement semantic vector search with AgentDB for intelligent document retrieval, similarity matching, and context-aware querying. Use when building ...
rag-implementation
by wshobsonBuild Retrieval-Augmented Generation (RAG) systems for LLM applications with vector databases and semantic search. Use when implementing knowledge-gro...
Agentic Skill Details
- Repository
- Kosmos
- Type
- Technical
- Meta-Domain
- productivity
- Primary Domain
- Market Score
- 80.0
Agent Skill Grade
B
Score: 80/100
Click to see breakdown
Score Breakdown
Areas to Improve
- Uses second-person 'you' which violates skill spec voice requirements
- Code examples include comments like '# Handles UTF-8, special characters, quotes, etc.' that state the obvious
- References over 200 lines lack table of contents for quick navigation
Recommendations
- Add trigger phrases to description for discoverability
- Add table of contents for files over 100 lines
Graded: 1/5/2026
Developer Feedback
I took a look at your markitdown skill and wanted to share some thoughts.
Links:
The TL;DR
You're at 80/100, solidly in B territory. This evaluation is based on Anthropic's Claude Skills best practices across five pillars. Your strongest area is Progressive Disclosure Architecture (26/30) — you've nailed the layered structure with a clean SKILL.md overview and five focused reference files. The weakest area is Spec Compliance (12/15) and Writing Style (7/10), where some smaller refinements would push you higher.
What's Working Well
- Progressive disclosure is chef's kiss — Your five reference files (structured_data, web_content, document_conversion, media_processing, advanced_integrations) sit exactly one level deep from SKILL.md. That's the sweet spot for token economy and discoverability.
- Practical utility — You're solving a real problem: converting 20+ file formats to Markdown for LLM processing. The input/output examples and batch processing templates show you understand actual workflows.
- Modular design — Trigger phrases cover the common cases (convert, extract, transcribe, OCR, batch). The "When to Use" section helps developers understand scope without reading everything.
- Rich examples — Both CLI and Python code examples; good error handling patterns scattered through the references.
The Big One
Your writing voice is inconsistent, and it's costing you points. The spec wants imperative/instructional voice throughout, but your references slip into second-person statements like "Use high-resolution images for better accuracy" in media_processing.md. This violates the voice requirements and pulls down your Spec Compliance and Writing Style scores.
Fix: Rewrite passive instructions as imperative declarations. Instead of "Use high-resolution images...", say "High-resolution images improve OCR accuracy." This is one pass through all references (especially media_processing.md:72-77, advanced_integrations.md, and document_conversion.md). Impact: +2 points.
Other Things Worth Fixing
Strip verbose code comments — Lines like
# Handles UTF-8, special characters, quotes, etc.state the obvious. Let the code be self-documenting. This cuts unnecessary tokens and cleans up your token economy. Impact: +1 point.Add trigger phrases to your description — You've got solid ones (convert, extract, transcribe), but your SKILL.md description only lists 1-2. Expand that list so developers find you faster through search. Impact: +1 point.
Add TOCs to long references — Files over 200 lines (document_conversion.md, advanced_integrations.md) need a table of contents for quick navigation. Add a
## Contentssection at the top with anchor links. Impact: +1 point.Consolidate repetitive imports — Every code block re-imports
MarkItDownandOpenAI. Show imports once per section, then use abbreviated examples. Saves tokens. Impact: +1 point.
Quick Wins
- Fix voice consistency (second-person → imperative) across references — biggest bang for buck
- Strip obvious code comments
- Add 3-4 trigger phrases to SKILL.md description
- Add TOCs to references over 200 lines
- Deduplicate imports in code examples
These changes push you from 80 → 86-88 range with minimal effort.
Checkout your skill here: SkillzWave.ai | SpillWave We have an agentic skill installer that install skills in 14+ coding agent platforms. Check out this guide on how to improve your agentic skills.
Browse Category
More productivity Agentic SkillsReport Security Issue
Found a security vulnerability in this agent skill?