[Frontend] [Core] perf: Automatically detect vLLM-tensorized model, update tensorizer to version 2.9.0 (#4208)
This commit is contained in:
@@ -167,8 +167,8 @@ class EngineArgs:
|
||||
'* "dummy" will initialize the weights with random values, '
|
||||
'which is mainly for profiling.\n'
|
||||
'* "tensorizer" will load the weights using tensorizer from '
|
||||
'CoreWeave which assumes tensorizer_uri is set to the location of '
|
||||
'the serialized weights.')
|
||||
'CoreWeave. See the Tensorize vLLM Model script in the Examples'
|
||||
'section for more information.\n')
|
||||
parser.add_argument(
|
||||
'--dtype',
|
||||
type=str,
|
||||
|
||||
Reference in New Issue
Block a user