vllm/docs/assets/design at f1ff50c86cfac67b68ddef67336e96a1b6e424b6 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Woosuk Kwon 4f85bae9d6 [Docs][Model Runner V2] Add Design Docs (#35819 )

Signed-off-by: Woosuk Kwon <woosuk@inferact.ai>

2026-03-02 19:58:14 -08:00

..

[Docs] Add sections on process architecture and minimum CPU resources (#33940 )

2026-02-06 15:26:43 +00:00

[Docs] add docs for cuda graph v1 (#24374 )

2025-10-07 05:25:05 -07:00

debug_vllm_compile

[Docs] Add guide to debugging vLLM-torch.compile integration (#28094 )

2025-11-05 21:31:46 +00:00

fused_moe_modular_kernel

[Doc] Add FusedMoE Modular Kernel Documentation (#21623 )

2025-07-29 10:32:30 -07:00

hybrid_kv_cache_manager

[doc] Hybrid KV Cache Manager design doc (#22688 )

2025-08-26 20:19:05 +00:00

[DOC] Fix path of v1 related figures (#21868 )

2025-07-29 19:45:18 -07:00

model_runner_v2

[Docs][Model Runner V2] Add Design Docs (#35819 )

2026-03-02 19:58:14 -08:00

paged_attention

[Doc] Remove vLLM prefix and add citation for PagedAttention (#21910 )

2025-07-30 06:36:34 -07:00

[DOC] Fix path of v1 related figures (#21868 )

2025-07-29 19:45:18 -07:00

[DOC] Fix path of v1 related figures (#21868 )

2025-07-29 19:45:18 -07:00

hierarchy.png

Migrate docs from Sphinx to MkDocs (#18145 )

2025-05-23 02:09:53 -07:00