model-trainer
This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment. Includes guidance on the TRL Jobs package, UV scripts with PEP 723 format, dataset preparation and validation, hardware selection, cost estimation, Trackio monitoring, Hub authentication, and model persistence. Should be invoked for
Third-Party Skill: Review the code before installing. Skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation
View all platforms →skilz install kntism_skills/model-trainer skilz install kntism_skills/model-trainer --agent opencode skilz install kntism_skills/model-trainer --agent codex skilz install kntism_skills/model-trainer --agent gemini
First time? Install Skilz: pip install skilz
Works with 14 AI coding assistants
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/kntism/skills cp -r skills/model-trainer ~/.claude/skills/ Need detailed installation help? Check our platform-specific guides:
Related Skills
pytorch-lightning
"Deep learning framework (PyTorch Lightning). Organize PyTorch code into LightningModules, configure Trainers for multi-GPU/TPU, implement data pipeli...
qiskit
Comprehensive quantum computing toolkit for building, optimizing, and executing quantum circuits. Use when working with quantum algorithms, simulation...
scikit-learn
Machine learning in Python with scikit-learn. Use when working with supervised learning (classification, regression), unsupervised learning (clusterin...
scvi-tools
This skill should be used when working with single-cell omics data analysis using scvi-tools, including scRNA-seq, scATAC-seq, CITE-seq, spatial trans...
Details
AI-Detected Topics
Extracted using NLP analysis
Browse Category
More data ai skillsReport Security Issue
Found a security vulnerability in this skill?