tensorrt-llm

422 stars 30 forks Updated Dec 17, 2025

34.3

Optimizes LLM inference with NVIDIA TensorRT for maximum throughput and lowest latency. Use for production deployment on NVIDIA GPUs (A100/H100), when you need 10-100x faster inference than PyTorch, or for serving models with quantization (FP8/INT4), in-flight batching, and multi-GPU scaling.

Marketplace

#NVIDIA GPUs#GPU#GPUs#throughput#latency

Also in: github api machine learning

Third-Party Skill: Review the code before installing. Skills execute in your AI assistant's environment and can access your files. Learn more about security

Installation

View all platforms →

Claude Code (CLI) Fast

skilz install zechenzhangAGI_AI-research-SKILLs/tensorrt-llm

OpenCode (CLI) Fast

skilz install zechenzhangAGI_AI-research-SKILLs/tensorrt-llm --agent opencode

OpenAI Codex (CLI) Native

skilz install zechenzhangAGI_AI-research-SKILLs/tensorrt-llm --agent codex

Gemini CLI (Project) Project

skilz install zechenzhangAGI_AI-research-SKILLs/tensorrt-llm --agent gemini

First time? Install Skilz: pip install skilz

Works with 14 AI coding assistants

Cursor, Aider, Copilot, Windsurf, Qwen, Kimi, and more...

View All Agents

For Claude Desktop Easy

Download Skill ZIP

Extract and copy to ~/.claude/skills/ then restart Claude Desktop

Manual Installation

1. Clone the repository:

git clone https://github.com/zechenzhangAGI/AI-research-SKILLs

2. Copy the skill directory:

cp -r AI-research-SKILLs/12-inference-serving/tensorrt-llm ~/.claude/skills/

View on GitHub

Need detailed installation help? Check our platform-specific guides:

Claude Desktop Guide Claude Code Guide Troubleshooting

Related Skills

flux-gitops-scaffold

> This skill should be used when the user asks to "create a GitOps project", "scaffold Flux project", "set up GitOps repository", "add Flux applicatio...

72.0

TECH kubernetes › deployment path

Commands Marketplace

#Flux#image automation#Check

+docker +image

deploying-monitoring-stacks

| Use when deploying monitoring stacks including Prometheus, Grafana, and Datadog. Trigger with phrases like "deploy monitoring stack", "setup prometh...

49.1

TECH kubernetes › deployment path

Agents Marketplace

#source#True#Deployment

+monitoring +data analysis

managing-deployment-rollbacks

| Use when you need to work with deployment and CI/CD. This skill provides deployment automation and orchestration with comprehensive guidance and aut...

48.5

TECH kubernetes › deployment path

Agents Marketplace

#print#Path#deployment

+networking +monitoring

orchestrating-deployment-pipelines

| Use when you need to work with deployment and CI/CD. This skill provides deployment automation and orchestration with comprehensive guidance and aut...

33.5

TECH kubernetes › deployment path

Agents Marketplace

#configuration#Path#deployment

+json +monitoring

Details

Owner: zechenzhangAGI
Repository: AI-research-SKILLs
Stars: 422
Forks: 30
Type: Technical
Meta-Domain: cloud infrastructure
Primary Domain: kubernetes
Sub-Domain: deployment path
Skill Size: 26.5 KB
Files: 4
Quality Score: 34.3

AI-Detected Topics

Extracted using NLP analysis

NVIDIA GPUs GPU GPUs throughput latency

Browse Category

More cloud infrastructure skills

Report Security Issue

Found a security vulnerability in this skill?