grpo-rl-training

Name: grpo-rl-training
Rating: 1.3 (1 reviews)
Author: zechenzhangAGI

26.2

"Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training"

Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security

Installation for Agentic Skill

View all platforms →

Claude Code (CLI) Fast

skilz install zechenzhangAGI/AI-research-SKILLs/grpo-rl-training

OpenCode (CLI) Fast

skilz install zechenzhangAGI/AI-research-SKILLs/grpo-rl-training --agent opencode

OpenAI Codex (CLI) Native

skilz install zechenzhangAGI/AI-research-SKILLs/grpo-rl-training --agent codex

Gemini CLI (Project) Project

skilz install zechenzhangAGI/AI-research-SKILLs/grpo-rl-training --agent gemini

First time? Install Skilz: pip install skilz

Works with 14 AI coding assistants

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents

For Claude Desktop Easy

Download Agent Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

Manual Installation

1. Clone the repository:

git clone https://github.com/zechenzhangAGI/AI-research-SKILLs

2. Copy the agent skill directory:

cp -r AI-research-SKILLs/06-post-training/grpo-rl-training ~/.claude/skills/

View on GitHub

Need detailed installation help? Check our platform-specific guides:

Claude Desktop Guide Claude Code Guide Troubleshooting

Related Agentic Skills

hooks-automation

by ruvnet

Automated coordination, formatting, and learning from Claude Code operations using intelligent hooks with MCP integration. Includes pre/post task hook...

TECHmachine learning

Marketplace

+git

ml-pipeline-workflow

by wshobson

Build end-to-end MLOps pipelines from data preparation through model training, validation, and production deployment. Use when creating ML pipelines, ...

TECHmachine learning

Marketplace

book-sft-pipeline

by muratcankoylan

End-to-end system for creating supervised fine-tuning datasets from books and training style-transfer models. Covers text extraction, intelligent segm...

TECHmachine learning

Marketplace

dspy-ruby

by EveryInc

This skill should be used when working with DSPy.rb, a Ruby framework for building type-safe, composable LLM applications. Use this when implementing ...

TECHmachine learning

Marketplace

+testing

Agentic Skill Details

Owner: zechenzhangAGI (GitHub)
Repository: AI-research-SKILLs
Type: Technical
Meta-Domain: data ai
Primary Domain: machine learning
Market Score: 26.2

Agentic Skill Grades →

Browse Category

More data ai Agentic Skills

Report Security Issue

Found a security vulnerability in this agent skill?