senior-data-engineer
World-class data engineering skill for building scalable data pipelines, ETL/ELT systems, and data infrastructure. Expertise in Python, SQL, Spark, Airflow, dbt, Kafka, and modern data stack. Includes data modeling, pipeline orchestration, data quality, and DataOps. Use when designing data architectures, building data pipelines, optimizing data workflows, or implementing data governance.
Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation for Agentic Skill
View all platforms →skilz install alirezarezvani/claude-skills/senior-data-engineer skilz install alirezarezvani/claude-skills/senior-data-engineer --agent opencode skilz install alirezarezvani/claude-skills/senior-data-engineer --agent codex skilz install alirezarezvani/claude-skills/senior-data-engineer --agent gemini
First time? Install Skilz: pip install skilz
Works with 22+ AI coding agents
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/alirezarezvani/claude-skills cp -r claude-skills/engineering-team/senior-data-engineer ~/.claude/skills/ Need detailed installation help? Check our platform-specific guides:
Related Agentic Skills
automating-mail
by SpillwaveSolutionsAutomates Apple Mail via JXA with AppleScript dictionary discovery. Use when asked to "automate email", "send mail via script", "JXA Mail automation",...
automating-reminders
by SpillwaveSolutionsAutomates Apple Reminders using JavaScript for Automation (JXA). Use when asked to "create reminders programmatically", "automate reminder lists", "JX...
mastering-postgresql
by SpillwaveSolutionsPostgreSQL development for Python with full-text search (tsvector, tsquery, BM25 via pg_search), vector similarity (pgvector with HNSW/IVFFlat), JSONB...
automating-contacts
by SpillwaveSolutionsAutomates macOS Contacts via JXA with AppleScript dictionary discovery. Use when asked to "automate contacts", "JXA contacts automation", "macOS addre...
Agentic Skill Details
- Owner
- alirezarezvani (GitHub)
- Repository
- claude-skills
- Type
- Other
- Meta-Domain
- N/A
- Primary Domain
- N/A
- Market Score
- 0.0
Agent Skill Grade
F
Score: 43/100
Click to see breakdown
Score Breakdown
Areas to Improve
- All three reference files contain nearly identical boilerplate content with no actual specialized information. They're placeholder templates, not real documentation.
- Excessive use of 'world-class' (6+ times), 'senior', 'advanced', 'comprehensive' without substance. Reads like a job posting, not technical documentation.
- Zero numbered step-by-step workflows for common tasks. Quick Start shows commands but not when/why to use them or what they do. No checklists for complex operations.
Recommendations
- Focus on improving Pda (currently 8/30)
- Focus on improving Ease Of Use (currently 11/25)
- Focus on improving Writing Style (currently 3/10)
Graded: 1/23/2026
Developer Feedback
Looking at your senior data engineer skill—I'm curious how you're thinking about the progression from spec to actual implementation, since there's a gap there that's affecting the score.
Links:
The TL;DR
You're at 43/100, which puts you in F territory. This is based on Anthropic's best practices for agentic skills. Your strongest area is Spec Compliance (12/15)—the YAML frontmatter and metadata are solid. But Progressive Disclosure Architecture (8/30) is where you're losing the most points. The reference files are basically empty templates, and the main file is bloated with marketing language instead of concrete guidance.
What's Working Well
- Metadata is clean. Your frontmatter validates, name follows conventions, and the description has solid trigger terms (designing data architectures, building pipelines, etc.)
- Spec compliance is tight. You nailed the required fields and file structure—no friction there.
- Breadth of scope. Covering ML, LLM, ETL, and streaming shows you're thinking about the full data engineering landscape.
The Big One: Empty Reference Files
This is your biggest issue. You've got three reference files (data_modeling_patterns.md, data_pipeline_architecture.md, dataops_best_practices.md) that are basically identical boilerplate placeholders. They say things like "World-class data pipeline architecture" and "Core Principles" but provide zero actual content—no schema patterns, no architecture diagrams, no CI/CD strategies, nothing.
Why it matters: References are supposed to be your PDA leverage. They let you keep SKILL.md lean while providing depth on demand. Right now they're just adding noise.
Concrete fix: Replace the boilerplate with actual content:
data_pipeline_architecture.md: Lambda vs Kappa architectures, stream vs batch tradeoffs, backpressure handling, exactly-once semantics with code snippetsdata_modeling_patterns.md: Star/snowflake schemas, SCD types, slowly changing dimensions with SQL examplesdataops_best_practices.md: Data testing frameworks, data contracts, pipeline observability, sample health check queries
Estimate: +12 points if you nail this.
Other Things Worth Fixing
Kill the marketing speak. "World-class" appears 6+ times. "Senior-level" gets repeated. Remove all of it—just say what the skill does. "Build scalable data pipelines" beats "World-class capabilities for building scalable data pipelines" every time. Impact: +5 points
Add workflows, not just commands. Your Quick Start shows command examples but no step-by-step workflows. Add numbered checklists for common tasks (e.g., "Building a Batch ETL Pipeline: 1) Design data model, 2) Validate schema, 3) Create pipeline, 4) Test"). Impact: +4 points
Narrow your scope or add decision frameworks. You're covering ML, LLMs, ETL, and streaming—that's huge. Either focus on one domain, or add a decision tree ("Use streaming for real-time events, batch for historical aggregations"). Right now it feels scattered. Impact: +3 points
Add examples and troubleshooting. Include before/after scenarios, sample configs, and "If X fails, try Y" sections. This bridges the gap between concept and execution. Impact: +3 points
Quick Wins
- Strip marketing language from every section
- Replace reference file boilerplate with one complete, concrete example per file
- Add a numbered workflow for at least one common data engineering task
- Include a troubleshooting section in SKILL.md
- Consider a table of contents at the top—227 lines is long without navigation
This is solid foundational work. The scaffolding is there. You just need to fill in the substance and cut the fluff.
Checkout your skill here: SkillzWave.ai | SpillWave We have an agentic skill installer that install skills in 14+ coding agent platforms. Check out this guide on how to improve your agentic skills.
AI-Detected Topics
Extracted using NLP analysis
Report Security Issue
Found a security vulnerability in this agent skill?