model-trainer
This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment. Includes guidance on the TRL Jobs package, UV scripts with PEP 723 format, dataset preparation and validation, hardware selection, cost estimation, Trackio monitoring, Hub authentication, and model persistence. Should be invoked for
Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation for Agentic Skill
View all platforms →skilz install kntism/skills/model-trainerskilz install kntism/skills/model-trainer --agent opencodeskilz install kntism/skills/model-trainer --agent codexskilz install kntism/skills/model-trainer --agent geminiFirst time? Install Skilz: pip install skilz
Works with 22+ AI coding assistants
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/kntism/skillscp -r skills/model-trainer ~/.claude/skills/Need detailed installation help? Check our platform-specific guides:
Related Agentic Skills
hugging-face-evaluation-manager
by kntism
Add and manage evaluation results in Hugging Face model cards. Supports extracting eval tables from README content, importing scores from Artificia...
PRFAQ Writer
by thinkbigleaders
**Description**: Write compelling PRFAQ (Press Release + FAQ) documents using Amazon's Working Backwards methodology to develop customer-centric in...
scientific-writing
by davila7
"Core skill for the deep research and writing tool. Write scientific manuscripts in full paragraphs (never bullet points). Use two-stage process: (...
aws-sdk-java-v2-bedrock
by giuseppe-trisciuoglio
Amazon Bedrock patterns using AWS SDK for Java 2.x. Use when working with foundation models (listing, invoking), text generation, image generation,...
Agentic Skill Details
- Repository
- skills
- Type
- Non-Technical
- Meta-Domain
- general
- Primary Domain
- general
- Sub-Domain
- machine learning models model
- Market Score
- 50
Browse Category
More general Agentic SkillsReport Security Issue
Found a security vulnerability in this agent skill?
Report Security Issue
Thank you for helping keep SkillzWave secure. We'll review your report and take appropriate action.
Note: For critical security issues that require immediate attention, please also email security@skillzwave.ai directly.