model-trainer

672 stars 73 forks
47

This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment. Includes guidance on the TRL Jobs package, UV scripts with PEP 723 format, dataset preparation and validation, hardware selection, cost estimation, Trackio monitoring, Hub authentication, and model persistence. Should be invoked f...

Marketplace
Also in: security monitoring kubernetes

Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security

Installation for Agentic Skill

View all platforms →
skilz install huggingface/skills/model-trainer
skilz install huggingface/skills/model-trainer --agent opencode
skilz install huggingface/skills/model-trainer --agent codex
skilz install huggingface/skills/model-trainer --agent gemini

First time? Install Skilz: pip install skilz

Works with 22+ AI coding assistants

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents
Download Agent Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

1. Clone the repository:
git clone https://github.com/huggingface/skills
2. Copy the agent skill directory:
cp -r skills/hf-llm-trainer/skills/model-trainer ~/.claude/skills/

Need detailed installation help? Check our platform-specific guides:

Related Agentic Skills

Agentic Skill Details

Repository
skills
Stars
672
Forks
73
Type
Technical
Meta-Domain
data ai
Primary Domain
machine learning
Market Score
47

Report Security Issue

Found a security vulnerability in this agent skill?