Add NVIDIA TensorRT Model Optimizer in vLLM documentation (#17561)

2025-05-02 11:36:46 -07:00
parent 3e887d2e0c
commit 182f40ea8b
3 changed files with 90 additions and 1 deletions
--- a/docs/source/features/quantization/supported_hardware.md
+++ b/docs/source/features/quantization/supported_hardware.md
@@ -129,7 +129,17 @@ The table below shows the compatibility of various quantization implementations
  * ❌
  * ❌
  * ❌
-
+- * modelopt
+  * ✅︎
+  * ✅︎
+  * ✅︎
+  * ✅︎
+  * ✅︎︎
+  * ❌
+  * ❌
+  * ❌
+  * ❌
+  * ❌
 :::

 - Volta refers to SM 7.0, Turing to SM 7.5, Ampere to SM 8.0/8.6, Ada to SM 8.9, and Hopper to SM 9.0.