vllm/docs/serving at 04147dcfa70fb7228ce9e2f88fa7dd41631d17f0 - vllm

Files

Jakub Zakrzewski 23daef548d [Frontend] Support using chat template as custom score template for reranking models (#30550 )

Signed-off-by: Jakub Zakrzewski <jzakrzewski@nvidia.com>
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <noooop@126.com>
Co-authored-by: wang.yuqi <yuqi.wang@daocloud.io>

2025-12-23 11:19:16 +00:00

integrations

[CI Failure] Disable mosaicml/mpt-7b and databricks/dbrx-instruct tests (#31182 )

2025-12-22 15:40:35 -08:00

context_parallel_deployment.md

[doc] add Context Parallel Deployment doc (#26877 )

2025-10-15 16:33:52 +08:00

data_parallel_deployment.md

[Docs] Clarify Expert Parallel behavior for attention and MoE layers (#30615 )

2025-12-13 08:37:59 -09:00

distributed_troubleshooting.md

[Docs] Replace all explicit anchors with real links (#27087 )

2025-10-17 02:22:06 -07:00

expert_parallel_deployment.md

[Docs] Clarify Expert Parallel behavior for attention and MoE layers (#30615 )

2025-12-13 08:37:59 -09:00

offline_inference.md

[Docs] Replace all explicit anchors with real links (#27087 )

2025-10-17 02:22:06 -07:00

openai_compatible_server.md

[Frontend] Support using chat template as custom score template for reranking models (#30550 )

2025-12-23 11:19:16 +00:00

parallelism_scaling.md

[Doc]: fixing typos in various files (#30540 )

2025-12-14 02:14:37 -08:00