Add NVIDIA TensorRT Model Optimizer in vLLM documentation (#17561)

This commit is contained in:
Zhiyu
2025-05-02 11:36:46 -07:00
committed by GitHub
parent 3e887d2e0c
commit 182f40ea8b
3 changed files with 90 additions and 1 deletions

View File

@@ -129,7 +129,17 @@ The table below shows the compatibility of various quantization implementations
*
*
*
- * modelopt
* ✅︎
* ✅︎
* ✅︎
* ✅︎
* ✅︎︎
*
*
*
*
*
:::
- Volta refers to SM 7.0, Turing to SM 7.5, Ampere to SM 8.0/8.6, Ada to SM 8.9, and Hopper to SM 9.0.