hugging-face-evaluation-manager
Add and manage evaluation results in Hugging Face model cards. Supports extracting eval tables from README content, importing scores from Artificial Analysis API, and running custom model evaluations with vLLM/lighteval. Works with the model-index metadata format.
Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation for Agentic Skill
View all platforms →skilz install kntism/skills/hugging-face-evaluation-manager skilz install kntism/skills/hugging-face-evaluation-manager --agent opencode skilz install kntism/skills/hugging-face-evaluation-manager --agent codex skilz install kntism/skills/hugging-face-evaluation-manager --agent gemini
First time? Install Skilz: pip install skilz
Works with 14 AI coding assistants
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/kntism/skills cp -r skills/hugging-face-evaluation-manager ~/.claude/skills/ Need detailed installation help? Check our platform-specific guides:
Related Agentic Skills
deepspeed
by zechenzhangAGIExpert guidance for distributed training with DeepSpeed - ZeRO optimization stages, pipeline parallelism, FP16/BF16/FP8, 1-bit Adam, sparse attention
model-trainer
by kntismThis skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs in...
PRFAQ Writer
by thinkbigleaders**Description**: Write compelling PRFAQ (Press Release + FAQ) documents using Amazon's Working Backwards methodology to develop customer-centric innov...
scientific-writing
by davila7"Core skill for the deep research and writing tool. Write scientific manuscripts in full paragraphs (never bullet points). Use two-stage process: (1) ...
Agentic Skill Details
- Repository
- skills
- Type
- Non-Technical
- Meta-Domain
- general
- Primary Domain
- general
- Sub-Domain
- learning models model
- Market Score
- 45.7
Browse Category
More general Agentic SkillsReport Security Issue
Found a security vulnerability in this agent skill?