evaluating-machine-learning-models

Name: evaluating-machine-learning-models
Rating: 0.9 (1 reviews)
Author: jeremylongshore

17.1

This skill allows Claude to evaluate machine learning models using a comprehensive suite of metrics. It should be used when the user requests model performance analysis, validation, or testing. Claude can use this skill to assess model accuracy, precision, recall, F1-score, and other relevant metrics. Trigger this skill when the user mentions "evaluate model", "model performance", "testing metrics", "validation results", or requests a comprehensive "model evaluation".

Marketplace

Also in: testing monitoring data analysis

Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security

Installation for Agentic Skill

View all platforms →

Claude Code (CLI) Fast

skilz install jeremylongshore/claude-code-plugins-nixtla/evaluating-machine-learning-models

OpenCode (CLI) Fast

skilz install jeremylongshore/claude-code-plugins-nixtla/evaluating-machine-learning-models --agent opencode

OpenAI Codex (CLI) Native

skilz install jeremylongshore/claude-code-plugins-nixtla/evaluating-machine-learning-models --agent codex

Gemini CLI (Project) Project

skilz install jeremylongshore/claude-code-plugins-nixtla/evaluating-machine-learning-models --agent gemini

First time? Install Skilz: pip install skilz

Works with 14 AI coding assistants

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents

For Claude Desktop Easy

Download Agent Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

Manual Installation

1. Clone the repository:

git clone https://github.com/jeremylongshore/claude-code-plugins-nixtla

2. Copy the agent skill directory:

 cp -r claude-code-plugins-nixtla/archive/backups-20251108/skill-structure-cleanup-20251108-073936/plugins/ai-ml/model-evaluation-suite/skills/model-evaluation-suite ~/.claude/skills/

View on GitHub

Need detailed installation help? Check our platform-specific guides:

Claude Desktop Guide Claude Code Guide Troubleshooting

Related Agentic Skills

hooks-automation

by ruvnet

Automated coordination, formatting, and learning from Claude Code operations using intelligent hooks with MCP integration. Includes pre/post task hook...

TECHmachine learning

Marketplace

+git

ml-pipeline-workflow

by wshobson

Build end-to-end MLOps pipelines from data preparation through model training, validation, and production deployment. Use when creating ML pipelines, ...

TECHmachine learning

Marketplace

book-sft-pipeline

by muratcankoylan

End-to-end system for creating supervised fine-tuning datasets from books and training style-transfer models. Covers text extraction, intelligent segm...

TECHmachine learning

Marketplace

dspy-ruby

by EveryInc

This skill should be used when working with DSPy.rb, a Ruby framework for building type-safe, composable LLM applications. Use this when implementing ...

TECHmachine learning

Marketplace

+testing

Agentic Skill Details

Owner: jeremylongshore (GitHub)
Repository: claude-code-plugins-nixtla
Type: Technical
Meta-Domain: data ai
Primary Domain: machine learning
Market Score: 17.1

Agentic Skill Grades →

Browse Category

More data ai Agentic Skills

Report Security Issue

Found a security vulnerability in this agent skill?