markitdown
Convert various file formats (PDF, Office documents, images, audio, web content, structured data) to Markdown optimized for LLM processing. Use when converting documents to markdown, extracting text from PDFs/Office files, transcribing audio, performing OCR on images, extracting YouTube transcripts, or processing batches of files. Supports 20+ formats including DOCX, XLSX, PPTX, PDF, HTML, EPUB, CSV, JSON, images with OCR, and audio with transcription.
Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation for Agentic Skill
View all platforms →skilz install jimmc414/Kosmos/markitdownskilz install jimmc414/Kosmos/markitdown --agent opencodeskilz install jimmc414/Kosmos/markitdown --agent codexskilz install jimmc414/Kosmos/markitdown --agent geminiFirst time? Install Skilz: pip install skilz
Works with 14 AI coding assistants
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/jimmc414/Kosmoscp -r Kosmos/kosmos-claude-scientific-skills/scientific-skills/markitdown ~/.claude/skills/Need detailed installation help? Check our platform-specific guides:
Related Agentic Skills
markitdown
by jackspace
Convert various file formats (PDF, Office documents, images, audio, web content, structured data) to Markdown optimized for LLM processing. Use whe...
frontend-design
by anthropics
Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages...
canvas-design
by anthropics
Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user asks to create a poster, pi...
by anthropics
Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Cla...
Agentic Skill Details
- Repository
- Kosmos
- Stars
- 304
- Forks
- 61
- Type
- Technical
- Meta-Domain
- productivity
- Primary Domain
- Market Score
- 80
Agent Skill Grade
B Score: 80/100 Click to see breakdown
Score Breakdown
Areas to Improve
- Second-person voice in references
- Verbose code comments
- Missing TOC in long references
Recommendations
- Add trigger phrases to description for discoverability
- Add table of contents for files over 100 lines
Graded: 2026-01-05
Developer Feedback
I took a look at your markitdown skill and wanted to share some thoughts.
Links:
The TL;DR
You're at 80/100, solidly in B territory. This evaluation is based on Anthropic's Claude Skills best practices across five pillars. Your strongest area is Progressive Disclosure Architecture (26/30) — you've nailed the layered structure with a clean SKILL.md overview and five focused reference files. The weakest area is Spec Compliance (12/15) and Writing Style (7/10), where some smaller refinements would push you higher.
What's Working Well
- Progressive disclosure is chef's kiss — Your five reference files (structured_data, web_content, document_conversion, media_processing, advanced_integrations) sit exactly one level deep from SKILL.md. That's the sweet spot for token economy and discoverability.
- Practical utility — You're solving a real problem: converting 20+ file formats to Markdown for LLM processing. The input/output examples and batch processing templates show you understand actual workflows.
- Modular design — Trigger phrases cover the common cases (convert, extract, transcribe, OCR, batch). The "When to Use" section helps developers understand scope without reading everything.
- Rich examples — Both CLI and Python code examples; good error handling patterns scattered through the references.
The Big One
Your writing voice is inconsistent, and it's costing you points. The spec wants imperative/instructional voice throughout, but your references slip into second-person statements like "Use high-resolution images for better accuracy" in media_processing.md. This violates the voice requirements and pulls down y...
Browse Category
More productivity Agentic SkillsReport Security Issue
Found a security vulnerability in this agent skill?
Report Security Issue
Thank you for helping keep SkillzWave secure. We'll review your report and take appropriate action.
Note: For critical security issues that require immediate attention, please also email security@skillzwave.ai directly.