nemo-curator
"GPU-accelerated data curation for LLM training. Supports text/image/video/audio. Features: fuzzy deduplication (16× faster), quality filtering (30+ heuristics), semantic deduplication, PII redaction, NSFW detection. Scales across GPUs with RAPIDS. Use for preparing high-quality training datasets, cleaning web data, or deduplicating large corpora."
Third-Party Agent Skill: Review the code before installing. Agent skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation for Agentic Skill
View all platforms →skilz install zechenzhangAGI/AI-research-SKILLs/nemo-curatorskilz install zechenzhangAGI/AI-research-SKILLs/nemo-curator --agent opencodeskilz install zechenzhangAGI/AI-research-SKILLs/nemo-curator --agent codexskilz install zechenzhangAGI/AI-research-SKILLs/nemo-curator --agent geminiFirst time? Install Skilz: pip install skilz
Works with 22+ AI coding assistants
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/zechenzhangAGI/AI-research-SKILLscp -r AI-research-SKILLs/05-data-processing/nemo-curator ~/.claude/skills/Need detailed installation help? Check our platform-specific guides:
Related Agentic Skills
image-gen
by SpillwaveSolutions
Generate compelling cover images and in-article illustrations for technical articles using the imagen CLI tool. Use when asked to "generate images"...
image-gen
by SpillwaveSolutions
Generate compelling cover images and in-article illustrations for technical articles using the imagen CLI tool. Use when asked to "generate images"...
processing-computer-vision-tasks
by jeremylongshore
Process images using object detection, classification, and segmentation. Use when requesting "analyze image", "object detection", "image classifica...
processing-computer-vision-tasks
by jeremylongshore
Process images using object detection, classification, and segmentation. Use when requesting "analyze image", "object detection", "image classifica...
Agentic Skill Details
- Owner
- zechenzhangAGI (GitHub)
- Repository
- AI-research-SKILLs
- Stars
- 62
- Forks
- 2
- Type
- Technical
- Meta-Domain
- media
- Primary Domain
- image
- Market Score
- 26
Browse Category
More media Agentic SkillsReport Security Issue
Found a security vulnerability in this agent skill?
Report Security Issue
Thank you for helping keep SkillzWave secure. We'll review your report and take appropriate action.
Note: For critical security issues that require immediate attention, please also email security@skillzwave.ai directly.