offer-testing
Use when designing copy experiments to optimize hooks, offers, and CTAs.
Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation for Agentic Skill
View all platforms →skilz install gtmagents/gtm-agents/offer-testing skilz install gtmagents/gtm-agents/offer-testing --agent opencode skilz install gtmagents/gtm-agents/offer-testing --agent codex skilz install gtmagents/gtm-agents/offer-testing --agent gemini
First time? Install Skilz: pip install skilz
Works with 22+ AI coding agents
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/gtmagents/gtm-agents cp -r gtm-agents/plugins/copywriting/skills/offer-testing ~/.claude/skills/ Need detailed installation help? Check our platform-specific guides:
Related Agentic Skills
tailwind-shadcn-setup
by vanman2024Setup Tailwind CSS and shadcn/ui component library for Next.js projects. Use when configuring Tailwind CSS, installing shadcn/ui, setting up design to...
spanish-language-tutor
by sandraschiComprehensive Spanish language expert covering grammar, conversation, regional dialects, and language learning strategies
frontend-dev-guidelines
by diet103Frontend development guidelines for React/TypeScript applications. Modern patterns including Suspense, lazy loading, useSuspenseQuery, file organizati...
torch-geometric
by davila7Graph Neural Networks (PyG). Node/graph classification, link prediction, GCN, GAT, GraphSAGE, heterogeneous graphs, molecular property prediction, for...
Agentic Skill Details
- Repository
- gtm-agents
- Type
- Non-Technical
- Meta-Domain
- development
- Primary Domain
- javascript
- Market Score
- 28.6
Agent Skill Grade
D
Score: 64/100
Click to see breakdown
Score Breakdown
Areas to Improve
- Templates are listed but not provided; violates layered structure principle
- No worked examples showing before/after or input/output pairs
- Generic description lacks specific technical terms users would search for
Recommendations
- Focus on improving Utility (currently 9/20)
- Address 2 high-severity issues first
- Add trigger phrases to description for discoverability
Graded: 1/24/2026
Developer Feedback
I spent some time with your skill and noticed the grading came in at 64/100 — there's solid foundation here, but the structure feels like it's fighting against itself a bit. The core idea of offer testing is useful, but I'm curious about the choices you made around when and how to surface those options to users.
Links:
The TL;DR
You're at 64/100, solid D territory. This is based on Anthropic's progressive disclosure architecture principles and skill utility standards. Your strongest area is Spec Compliance (12/15) — the YAML frontmatter is clean and technically sound. But Utility is dragging you down hard at 9/20, which makes sense because the skill reads like an outline instead of a guide users can actually execute.
What's Working Well
- Valid frontmatter structure — Your metadata follows conventions correctly with proper hyphen-case naming
- Clear 5-step framework — The progression from hypothesis through analysis shows you've thought about the testing flow
- Objective tone — No marketing fluff, just instructional content that gets to the point
- Grep-friendly formatting — Clean enough that reference tools could parse it without breaking a sweat
The Big One: Missing Actual Templates and Examples
Here's what's killing your utility score: you mention templates and examples but don't provide them. You say "Experiment brief (variable, control, variant, KPI, sample size, duration)" but there's no actual markdown template users can copy-paste. Same with results reporting.
Why this matters: A marketer reading your skill has the framework, sure, but when they sit down to actually run a test, they're googling for a template anyway. You've created demand but not fulfilled it.
Concrete fix: Create references/templates.md with 2-3 actual templates (experiment brief, results report, prioritization matrix) users can steal directly. Add references/examples.md with a worked example — like an A/B test on email subject lines showing control/variant setup, sample size calculation, and results interpretation. That's probably +6-7 points right there.
Other Things Worth Fixing
Weak search triggers — Your description says "designing copy experiments" but nobody searches that. Add specific terms: "A/B testing", "split test", "statistical significance", "multivariate testing", "email subject lines", "CTA optimization". That's +3 points in discoverability.
Statistical guidance is hand-wavy — You mention "chi-square, z-test" but provide no thresholds, sample size calculators, or confidence intervals. Add a
references/statistical-analysis.mdwith formulas and tool recommendations. +4 points.No validation checklist — Where's the "before you launch" checklist? Before deploying a test, users should verify: tracking pixels work, control/variant render correctly, audience split logic is sound. Add a run-check-fix pattern. +3 points.
Inconsistent voice — Mix of noun phrases ("Hypothesis – statement...") and imperatives ("Limit to one variable"). Pick one and stick with it. Minor but adds polish.
Quick Wins
- Add
references/templates.mdwith copy-paste templates (+6-7 pts) - Expand description with specific search terms: A/B test, split test, statistical significance, multivariate (+3 pts)
- Create
references/examples.mdwith a worked scenario (+5 pts) - Add validation checklist before launch (+3 pts)
Hitting those four moves gets you to 80+, easy. You've got the foundation — just need the actual scaffolding users can climb.
Checkout your skill here: SkillzWave.ai | SpillWave We have an agentic skill installer that install skills in 14+ coding agent platforms. Check out this guide on how to improve your agentic skills.
Browse Category
More development Agentic SkillsReport Security Issue
Found a security vulnerability in this agent skill?