tensorrt-llm
Optimizes LLM inference with NVIDIA TensorRT for maximum throughput and lowest latency. Use for production deployment on NVIDIA GPUs (A100/H100), when you need 10-100x faster inference than PyTorch, or for serving models with quantization (FP8/INT4), in-flight batching, and multi-GPU scaling.
Third-Party Skill: Review the code before installing. Skills execute in your AI assistant's environment and can access your files. Learn more about security
Installation
View all platforms →skilz install zechenzhangAGI_AI-research-SKILLs/tensorrt-llm skilz install zechenzhangAGI_AI-research-SKILLs/tensorrt-llm --agent opencode skilz install zechenzhangAGI_AI-research-SKILLs/tensorrt-llm --agent codex skilz install zechenzhangAGI_AI-research-SKILLs/tensorrt-llm --agent gemini
First time? Install Skilz: pip install skilz
Works with 14 AI coding assistants
Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...
Extract and copy to ~/.claude/skills/ then restart Claude Desktop
git clone https://github.com/zechenzhangAGI/AI-research-SKILLs cp -r AI-research-SKILLs/12-inference-serving/tensorrt-llm ~/.claude/skills/ Need detailed installation help? Check our platform-specific guides:
Related Skills
flux-gitops-scaffold
> This skill should be used when the user asks to "create a GitOps project", "scaffold Flux project", "set up GitOps repository", "add Flux applicatio...
deploying-monitoring-stacks
| Use when deploying monitoring stacks including Prometheus, Grafana, and Datadog. Trigger with phrases like "deploy monitoring stack", "setup prometh...
managing-deployment-rollbacks
| Use when you need to work with deployment and CI/CD. This skill provides deployment automation and orchestration with comprehensive guidance and aut...
orchestrating-deployment-pipelines
| Use when you need to work with deployment and CI/CD. This skill provides deployment automation and orchestration with comprehensive guidance and aut...
Details
- Owner
- zechenzhangAGI
- Repository
- AI-research-SKILLs
- Stars
- 422
- Forks
- 30
- Type
- Technical
- Meta-Domain
- cloud infrastructure
- Primary Domain
- kubernetes
- Sub-Domain
- deployment path
- Skill Size
- 26.5 KB
- Files
- 4
- Quality Score
- 34.3
AI-Detected Topics
Extracted using NLP analysis
Browse Category
More cloud infrastructure skillsReport Security Issue
Found a security vulnerability in this skill?