Skillzwave Logo
Skillzwave

model-trainer

50.2

This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment. Includes guidance on the TRL Jobs package, UV scripts with PEP 723 format, dataset preparation and validation, hardware selection, cost estimation, Trackio monitoring, Hub authentication, and model persistence. Should be invoked for

Commands Marketplace

Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security

Installation for Agentic Skill

View all platforms →
skilz install kntism/skills/model-trainer
skilz install kntism/skills/model-trainer --agent opencode
skilz install kntism/skills/model-trainer --agent codex
skilz install kntism/skills/model-trainer --agent gemini

First time? Install Skilz: pip install skilz

Works with 14 AI coding assistants

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents
Download Agent Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

1. Clone the repository:
git clone https://github.com/kntism/skills
2. Copy the agent skill directory:
cp -r skills/model-trainer ~/.claude/skills/

Need detailed installation help? Check our platform-specific guides:

Related Agentic Skills

deepspeed

by zechenzhangAGI

Expert guidance for distributed training with DeepSpeed - ZeRO optimization stages, pipeline parallelism, FP16/BF16/FP8, 1-bit Adam, sparse attention

56
generallearning models model
Marketplace

hugging-face-evaluation-manager

by kntism

Add and manage evaluation results in Hugging Face model cards. Supports extracting eval tables from README content, importing scores from Artificial A...

46
generallearning models model
CommandsMarketplace

PRFAQ Writer

by thinkbigleaders

**Description**: Write compelling PRFAQ (Press Release + FAQ) documents using Amazon's Working Backwards methodology to develop customer-centric innov...

44
generallearning models model

scientific-writing

by davila7

"Core skill for the deep research and writing tool. Write scientific manuscripts in full paragraphs (never bullet points). Use two-stage process: (1) ...

38
generallearning models model
CommandsMarketplace

Agentic Skill Details

Repository
skills
Type
Non-Technical
Meta-Domain
general
Primary Domain
general
Sub-Domain
learning models model
Market Score
50.2

Report Security Issue

Found a security vulnerability in this agent skill?