2025-05-15 16:57:49 +01:00
|
|
|
# Disaggregated Serving
|
|
|
|
|
|
|
|
|
|
This example contains scripts that demonstrate the disaggregated serving features of vLLM.
|
|
|
|
|
|
|
|
|
|
## Files
|
|
|
|
|
|
|
|
|
|
- `disagg_proxy_demo.py` - Demonstrates XpYd (X prefill instances, Y decode instances).
|
|
|
|
|
- `kv_events.sh` - Demonstrates KV cache event publishing.
|
2026-02-04 00:08:25 +08:00
|
|
|
- `mooncake_connector` - A proxy demo for MooncakeConnector.
|