nemo-curator
"GPU-accelerated data curation for LLM training. Supports text/image/video/audio. Features: fuzzy deduplication (16× faster), quality filtering (30+ heuristics), semantic deduplication, PII redaction, NSFW detection. Scales across GPUs with RAPIDS. Use for preparing high-quality training datasets, cleaning web data, or deduplicating large corpora."
Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation for Agentic Skill
View all platforms →skilz install zechenzhangAGI/AI-research-SKILLs/nemo-curator skilz install zechenzhangAGI/AI-research-SKILLs/nemo-curator --agent opencode skilz install zechenzhangAGI/AI-research-SKILLs/nemo-curator --agent codex skilz install zechenzhangAGI/AI-research-SKILLs/nemo-curator --agent gemini
First time? Install Skilz: pip install skilz
Works with 14 AI coding assistants
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/zechenzhangAGI/AI-research-SKILLs cp -r AI-research-SKILLs/05-data-processing/nemo-curator ~/.claude/skills/ Need detailed installation help? Check our platform-specific guides:
Related Agentic Skills
processing-computer-vision-tasks
by jeremylongshoreProcess images using object detection, classification, and segmentation. Use when requesting "analyze image", "object detection", "image classificatio...
processing-computer-vision-tasks
by jeremylongshoreProcess images using object detection, classification, and segmentation. Use when requesting "analyze image", "object detection", "image classificatio...
task-breakdown
by jasonkneenConvert technical designs into actionable, sequenced implementation tasks. Create clear coding tasks that enable incremental progress, respect depende...
gemini-logo-remover
by bear2uRemove Gemini logos, watermarks, or AI-generated image markers using OpenCV inpainting. Use this skill when the user asks to remove Gemini logo, AI wa...
Agentic Skill Details
- Owner
- zechenzhangAGI (GitHub)
- Repository
- AI-research-SKILLs
- Type
- Technical
- Meta-Domain
- media
- Primary Domain
- image
- Market Score
- 26.2
Browse Category
More media Agentic SkillsReport Security Issue
Found a security vulnerability in this agent skill?