vllm/docs/assets/design/arch_overview/v1_process_architecture_tp4.png at 5719a4e4e601fb91274294d25370b7aad656d629

Files

Michael Goin c39ee9ee2b [Docs] Add sections on process architecture and minimum CPU resources (#33940 )

It seems users can be confused about vLLM's performance when running
with very small amounts of CPU cores available. We are missing a clear
overview of what vLLM's process architecture is, so I added this along with
some diagrams in arch_overview.md, and included a section on CPU resource
recommendations in optimization.md

Signed-off-by: mgoin <mgoin64@gmail.com>

2026-02-06 15:26:43 +00:00

3.8 MiB

2816x1536px

Raw History

/biondizzle/vllm/raw/commit/5719a4e4e601fb91274294d25370b7aad656d629/docs/assets/design/arch_overview/v1_process_architecture_tp4.png

3.8 MiB 2816x1536px Raw History

3.8 MiB

2816x1536px

Raw History