Skillzwave Logo
Skillzwave

hypothesis-library

28.6
F

Curated repository of experiment hypotheses, assumptions, and historical learnings.

Marketplace

Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security

Installation for Agentic Skill

View all platforms →
skilz install gtmagents/gtm-agents/hypothesis-library
skilz install gtmagents/gtm-agents/hypothesis-library --agent opencode
skilz install gtmagents/gtm-agents/hypothesis-library --agent codex
skilz install gtmagents/gtm-agents/hypothesis-library --agent gemini

First time? Install Skilz: pip install skilz

Works with 22+ AI coding agents

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents
Download Agent Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

1. Clone the repository:
git clone https://github.com/gtmagents/gtm-agents
2. Copy the agent skill directory:
cp -r gtm-agents/plugins/growth-experiments/skills/hypothesis-library ~/.claude/skills/

Need detailed installation help? Check our platform-specific guides:

Related Agentic Skills

opencode_cli

by SpillwaveSolutions

This skill should be used when configuring or using the OpenCode CLI for headless LLM automation. Use when the user asks to "configure opencode", "use...

100
generalpatterns skill

sdd

by SpillwaveSolutions

This skill should be used when users want guidance on Spec-Driven Development methodology using GitHub's Spec-Kit. Guide users through executable spec...

100
generalskill use

sdd

by SpillwaveSolutions

This skill should be used when users want guidance on Spec-Driven Development methodology using GitHub's Spec-Kit. Guide users through executable spec...

100
generalskill use

malformed-yaml-skill

by SpillwaveSolutions

This skill has intentionally malformed YAML frontmatter for testing error handling. The YAML parser should detect the syntax error and handle it grace...

100
generalyaml session

Agentic Skill Details

Repository
gtm-agents
Type
Non-Technical
Meta-Domain
general
Primary Domain
general
Market Score
28.6

Agent Skill Grade

F
Score: 50/100 Click to see breakdown

Score Breakdown

Spec Compliance
11/15
PDA Architecture
12/30
Ease of Use
11/25
Writing Style
7/10
Utility
8/20
Modifiers: +1

Areas to Improve

  • No trigger phrases
  • Templates are listed but not provided; violates layered structure principle
  • Description lacks specific action verbs that would trigger skill activation

Recommendations

  • Focus on improving Pda (currently 12/30)
  • Focus on improving Ease Of Use (currently 11/25)
  • Focus on improving Utility (currently 8/20)

Graded: 1/24/2026

Developer Feedback

I've been diving into property-based testing frameworks lately, and your hypothesis-library skill caught my attention—though the 50/100 score suggests there might be some gaps between the concept and execution that are worth digging into.

Links:

TL;DR

You're at 50/100, solidly in F grade territory. This is based on Anthropic's skill evaluation best practices across five pillars. Your Spec Compliance is actually solid (11/15)—the frontmatter is valid and naming conventions are right—but Progressive Disclosure Architecture (12/30) and Utility (8/20) are where you're losing the most points. The core issue: you've got a great framework concept, but it's missing the teeth to be actually useful.

What's Working Well

  • Consistent terminology: You use 'hypothesis', 'learnings', and 'experiment' consistently throughout—no confusing terminology shifts that would make users stumble.
  • Solid metadata schema thinking: The idea of using ID, theme, persona, funnel stage, and metrics is the right foundation for structured experimentation tracking.
  • Logical section flow: "When to Use" → "Framework" → "Templates" → "Tips" follows a reasonable progression that's easy to scan.

The Big One: Missing Reference Files Kills Progressive Disclosure

Your skill lists three templates—intake form, learning card, portfolio dashboard—but provides zero actual content. This is a critical gap. You're telling users "here are templates" without showing them what they look like, which means they either guess or bounce.

The fix: Create three reference files:

  • references/intake-form.md – actual template with example fields
  • references/learning-card.md – structured format showing context/result/recommendation
  • references/portfolio-dashboard.md – example dashboard structure

This alone could add 7 points and make the skill actually usable instead of just aspirational. Right now it reads like a design doc, not executable guidance.

Other Things Worth Fixing

  1. Description needs trigger phrases (Medium severity) – You're missing specific action verbs. Instead of "Curated repository of experiment hypotheses," try: "Capture experiment hypotheses with structured metadata, track status from idea to archived, and retrieve past learnings. Use when 'log hypothesis', 'capture experiment idea', or 'search past experiments'." This helps users know when to invoke you.

  2. Framework lacks implementation steps (Medium severity) – "Metadata Schema – hypothesis ID, theme, persona..." reads like a spec. Make it actionable: "Assign unique ID (format: YYYY-MM-HYP-###), select theme from [list], identify target persona..." Users need to know how, not just what.

  3. Gerund forms in triggers (Medium severity) – Your "When to Use" section says "Capturing new experiment ideas" and "Referencing past wins"—use imperative instead: "Capture new experiment ideas" and "Reference past wins." Small change, huge readability win.

  4. No validation loop (Medium severity) – You mention "review cadence" but never define how to validate hypothesis quality. Add a checklist: measurable metric? supporting data? owner assigned? success/failure criteria?

Quick Wins

  • Add description trigger phrases (+2 points)
  • Create the three template reference files (+7 points)
  • Switch to imperative voice in "When to Use" (+2 points)
  • Add validation checklist to Framework (+2 points)

That's roughly 13 points of improvement with focused changes. Focus on the templates first—that's your biggest bang for buck.


Checkout your skill here: SkillzWave.ai | SpillWave We have an agentic skill installer that install skills in 14+ coding agent platforms. Check out this guide on how to improve your agentic skills.

Report Security Issue

Found a security vulnerability in this agent skill?