advanced-evaluation

27

Master LLM-as-a-Judge evaluation techniques including direct scoring, pairwise comparison, rubric generation, and bias mitigation. Use when building evaluation systems, comparing model outputs, or establishing quality standards for AI-generated content.

Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security

Installation for Agentic Skill

View all platforms →
skilz install rohunvora/my-claude-skills/advanced-evaluation
skilz install rohunvora/my-claude-skills/advanced-evaluation --agent opencode
skilz install rohunvora/my-claude-skills/advanced-evaluation --agent codex
skilz install rohunvora/my-claude-skills/advanced-evaluation --agent gemini

First time? Install Skilz: pip install skilz

Works with 22+ AI coding assistants

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents
Download Agent Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

1. Clone the repository:
git clone https://github.com/rohunvora/my-claude-skills
2. Copy the agent skill directory:
cp -r my-claude-skills/.claude/skills/advanced-evaluation ~/.claude/skills/

Need detailed installation help? Check our platform-specific guides:

Related Agentic Skills

Agentic Skill Details

Type
Non-Technical
Meta-Domain
general
Primary Domain
general
Sub-Domain
api patterns skill
Market Score
27

Report Security Issue

Found a security vulnerability in this agent skill?