elo-ratings-math

1 stars
14
B

Explains the mathematical principles behind Elo rating systems, including expected score calculation, rating updates, and the K-factor. Use when implementing or understanding competitive rating systems.

Also in: ci cd networking

Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security

Installation for Agentic Skill

View all platforms →
skilz install mcclowes/elo-elo/elo-ratings-math
skilz install mcclowes/elo-elo/elo-ratings-math --agent opencode
skilz install mcclowes/elo-elo/elo-ratings-math --agent codex
skilz install mcclowes/elo-elo/elo-ratings-math --agent gemini

First time? Install Skilz: pip install skilz

Works with 22+ AI coding assistants

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents
Download Agent Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

1. Clone the repository:
git clone https://github.com/mcclowes/elo-elo
2. Copy the agent skill directory:
cp -r elo-elo/.claude/skills/elo-ratings-math ~/.claude/skills/

Need detailed installation help? Check our platform-specific guides:

Related Agentic Skills

Agentic Skill Details

Repository
elo-elo
Stars
1
Type
Non-Technical
Meta-Domain
development
Primary Domain
github
Market Score
14

Agent Skill Grade

B
Score: 80/100 Click to see breakdown

Score Breakdown

Spec Compliance
12/15
PDA Architecture
18/30
Ease of Use
21/25
Writing Style
8/10
Utility
18/20
Modifiers: +3

Areas to Improve

  • Missing TOC
  • No Reference Files
  • Missing Implementation Templates

Recommendations

  • Address 1 high-severity issues first
  • Add trigger phrases to description for discoverability
  • Add table of contents for files over 100 lines

Graded: 2026-01-24

Developer Feedback

I've been digging into Elo rating implementations lately, and your skill's approach to the math fundamentals is solid—especially how you're breaking down the rating delta calculations. One thing I'm curious about: how do you handle the K-factor tuning for different skill levels, since that's where most implementations get brittle?

Links:

The TL;DR

You're at 80/100, solid B-grade territory. This evaluation's based on Anthropic's skill architecture best practices. Your strongest area is Utility (18/20)—the math is comprehensive and addresses a real capability gap. Weakest area is Progressive Disclosure Architecture (18/30)—you've got 229 lines crammed into a single file, which kills your token efficiency and navigation.

What's Working Well

  • Comprehensive mathematical coverage: You go from basic rating calculations all the way through multi-player extensions and variants (Glicko, TrueSkill). That's serious depth.
  • Excellent worked example: Lines 143-162 show step-by-step calculations with input/output pairs. That's the kind of concrete thing that actually helps people understand.
  • Clear metadata and triggers: Your description ("implementing or understanding competitive rating systems") is specific enough that the skill surfaces at the right times. The YAML frontmatter is clean and valid.
  • Bonus points for grep-friendliness: Your structure and consistent terminology (E_A, R_A, K-factor) makes it easy to search and reference.

The Big One: Missing Progressive Disclosure Structure

This is what's eating 12 points. You've written a comprehensive guide, but it's all sitting in one 229-line file. For files over 100 lines, Anthropic's best practice is to split content into reference files so developers c...

Report Security Issue

Found a security vulnerability in this agent skill?