Files

Michael Goin c39ee9ee2b [Docs] Add sections on process architecture and minimum CPU resources (#33940 )

It seems users can be confused about vLLM's performance when running
with very small amounts of CPU cores available. We are missing a clear
overview of what vLLM's process architecture is, so I added this along with
some diagrams in arch_overview.md, and included a section on CPU resource
recommendations in optimization.md

Signed-off-by: mgoin <mgoin64@gmail.com>

2026-02-06 15:26:43 +00:00

conserving_memory.md

[Doc] Update more docs with respect to V1 (#29188 )

2025-11-23 10:58:48 +08:00

engine_args.md

[Docs] Fix some snippets (#31378 )

2025-12-26 12:47:41 +00:00

env_vars.md

[Docs] Take env var definition out of folded admonition (#29005 )

2025-11-19 03:32:04 -08:00

model_resolution.md

[Misc] unify variable for LLM instance (#20996 )