Signed-off-by: Trevor Royer <troyer@redhat.com>
This commit is contained in:
@@ -7,6 +7,8 @@ Engine arguments control the behavior of the vLLM engine.
|
||||
- For [offline inference](#offline-inference), they are part of the arguments to `LLM` class.
|
||||
- For [online serving](#openai-compatible-server), they are part of the arguments to `vllm serve`.
|
||||
|
||||
For references to all arguments available from `vllm serve` see the [serve args](#serve-args) documentation.
|
||||
|
||||
Below, you can find an explanation of every engine argument:
|
||||
|
||||
<!--- pyml disable-num-lines 7 no-space-in-emphasis -->
|
||||
|
||||
Reference in New Issue
Block a user