prompt-benchmark

Systematic prompt evaluation framework with MATH, GSM8K, and Game of 24 benchmarks. Use when evaluating prompt effectiveness on standard benchmarks, comparing meta-prompting strategies quantitatively, measuring prompt quality improvements, or validating categorical prompt optimizations against ground truth datasets.

Also in: github data analysis

Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security

Installation for Agentic Skill

View all platforms →

Claude Code (CLI) Fast

skilz install manutej/categorical-meta-prompting/prompt-benchmark

OpenCode (CLI) Fast

skilz install manutej/categorical-meta-prompting/prompt-benchmark --agent opencode

OpenAI Codex (CLI) Native

skilz install manutej/categorical-meta-prompting/prompt-benchmark --agent codex

Gemini CLI (Project) Project

skilz install manutej/categorical-meta-prompting/prompt-benchmark --agent gemini

First time? Install Skilz: pip install skilz

Works with 22+ AI coding assistants

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents

For Claude Desktop Easy

Download Agent Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

Manual Installation

1. Clone the repository:

git clone https://github.com/manutej/categorical-meta-prompting

2. Copy the agent skill directory:

cp -r categorical-meta-prompting/.claude/skills/prompt-benchmark ~/.claude/skills/

View on GitHub

Need detailed installation help? Check our platform-specific guides:

Claude Desktop Guide Claude Code Guide Troubleshooting

Related Agentic Skills

tailwind-shadcn-setup

by vanman2024

Setup Tailwind CSS and shadcn/ui component library for Next.js projects. Use when configuring Tailwind CSS, installing shadcn/ui, setting up design...

TECHjavascript

Marketplace

spanish-language-tutor

by sandraschi

Comprehensive Spanish language expert covering grammar, conversation, regional dialects, and language learning strategies

javascript

Marketplace

frontend-dev-guidelines

by diet103

Frontend development guidelines for React/TypeScript applications. Modern patterns including Suspense, lazy loading, useSuspenseQuery, file organiz...

TECHjavascript

Marketplace

torch-geometric

by davila7

Graph Neural Networks (PyG). Node/graph classification, link prediction, GCN, GAT, GraphSAGE, heterogeneous graphs, molecular property prediction, ...

TECHjavascript

Marketplace

Agentic Skill Details

Owner: manutej (GitHub)
Repository: categorical-meta-prompting
Type: Non-Technical
Meta-Domain: development
Primary Domain: javascript
Market Score: 12

Agentic Skill Grades →

Browse Category

More development Agentic Skills

Report Security Issue

Found a security vulnerability in this agent skill?