model-trainer

Name: model-trainer
Rating: 2.4 (1 reviews)
Author: huggingface

47.9

This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment. Includes guidance on the TRL Jobs package, UV scripts with PEP 723 format, dataset preparation and validation, hardware selection, cost estimation, Trackio monitoring, Hub authentication, and model persistence. Should be invoked for tasks involving cloud GPU training, GGUF conversion, or when users mention training on Hugging Face Jobs without local GPU setup.

Marketplace

Also in: security monitoring kubernetes

Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security

Installation for Agentic Skill

View all platforms →

Claude Code (CLI) Fast

skilz install huggingface/skills/model-trainer

OpenCode (CLI) Fast

skilz install huggingface/skills/model-trainer --agent opencode

OpenAI Codex (CLI) Native

skilz install huggingface/skills/model-trainer --agent codex

Gemini CLI (Project) Project

skilz install huggingface/skills/model-trainer --agent gemini

First time? Install Skilz: pip install skilz

Works with 22+ AI coding agents

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents

For Claude Desktop Easy

Download Agent Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

Manual Installation

1. Clone the repository:

git clone https://github.com/huggingface/skills

2. Copy the agent skill directory:

cp -r skills/hf-llm-trainer/skills/model-trainer ~/.claude/skills/

View on GitHub

Need detailed installation help? Check our platform-specific guides:

Claude Desktop Guide Claude Code Guide Troubleshooting

Related Agentic Skills

flow-nexus-neural

by ruvnet

Train and deploy neural networks in distributed E2B sandboxes with Flow Nexus

TECHmachine learning

Marketplace

+ci cd

hooks-automation

by ruvnet

Automated coordination, formatting, and learning from Claude Code operations using intelligent hooks with MCP integration. Includes pre/post task hook...

TECHmachine learning

Marketplace

+git

ml-pipeline-workflow

by wshobson

Build end-to-end MLOps pipelines from data preparation through model training, validation, and production deployment. Use when creating ML pipelines, ...

TECHmachine learning

Marketplace

book-sft-pipeline

by muratcankoylan

End-to-end system for creating supervised fine-tuning datasets from books and training style-transfer models. Covers text extraction, intelligent segm...

TECHmachine learning

Marketplace

Agentic Skill Details

Owner: huggingface (GitHub)
Repository: skills
Type: Technical
Meta-Domain: data ai
Primary Domain: machine learning
Market Score: 47.9

Agentic Skill Grades →

Browse Category

More data ai Agentic Skills

Report Security Issue

Found a security vulnerability in this agent skill?