Add NVIDIA TensorRT Model Optimizer in vLLM documentation (#17561)
This commit is contained in:
@@ -17,6 +17,7 @@ gptqmodel
|
||||
int4
|
||||
int8
|
||||
fp8
|
||||
modelopt
|
||||
quark
|
||||
quantized_kvcache
|
||||
torchao
|
||||
|
||||
Reference in New Issue
Block a user