vllm/docs/serving at c63ca2b2e696e8dd1ae0f5ace08fd57a6a95a65f - vllm

Files

wang.yuqi f9e2a38386 [Docs] Reorganize pooling docs. (#35592 )

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <noooop@126.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

2026-03-19 11:25:47 +00:00

integrations

[Frontend] Exclude anthropic billing header to avoid prefix cache miss (#36829 )

2026-03-12 01:20:34 +00:00

context_parallel_deployment.md

[Doc]: fixing multiple typos in diverse files (#33256 )

2026-01-29 16:52:03 +08:00

data_parallel_deployment.md

[Docs] Clarify Expert Parallel behavior for attention and MoE layers (#30615 )

2025-12-13 08:37:59 -09:00

distributed_troubleshooting.md

[Docs] Replace all explicit anchors with real links (#27087 )