evaluation
Build evaluation frameworks for agent systems. Use when testing agent performance, validating context engineering choices, or measuring improvements over time.
Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation for Agentic Skill
View all platforms →skilz install muratcankoylan/Agent-Skills-for-Context-Engineering/evaluation skilz install muratcankoylan/Agent-Skills-for-Context-Engineering/evaluation --agent opencode skilz install muratcankoylan/Agent-Skills-for-Context-Engineering/evaluation --agent codex skilz install muratcankoylan/Agent-Skills-for-Context-Engineering/evaluation --agent gemini
First time? Install Skilz: pip install skilz
Works with 14 AI coding assistants
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/muratcankoylan/Agent-Skills-for-Context-Engineering cp -r Agent-Skills-for-Context-Engineering/skills/evaluation ~/.claude/skills/ Need detailed installation help? Check our platform-specific guides:
Related Agentic Skills
Command Development
by davila7This skill should be used when the user asks to "create a slash command", "add a command", "write a custom command", "define command arguments", "use ...
command-development
by fcakyonThis skill should be used when the user asks to "create a slash command", "add a command", "write a custom command", "define command arguments", "use ...
plan-down
by VCnoCMethod clarity-driven planning workflow using zen-mcp tools (chat, planner, consensus). Phase 0 uses chat to judge if user provides clear implementati...
context-engineering-collection
by muratcankoylanA comprehensive collection of Agent Skills for context engineering, multi-agent architectures, and production agent systems. Use when building, optimi...
Agentic Skill Details
- Owner
- muratcankoylan (GitHub)
- Repository
- Agent-Skills-for-Context-Engineering
- Type
- Non-Technical
- Meta-Domain
- general
- Primary Domain
- general
- Sub-Domain
- cd build pipeline
- Market Score
- 23.1
Browse Category
More general Agentic SkillsReport Security Issue
Found a security vulnerability in this agent skill?