vllm/vllm/engine at 756848e79e1bd557d36b20e70cbd48ddba48ea51 - vllm

Files

Flex Wang 18445edd0f [Misc] Change buckets of histogram_iteration_tokens to [1, 8, 16, 32, 64, 128, 256, 512, 1024, 2048, 4096, 8096] to represent number of tokens (#17033 )

Signed-off-by: sfc-gh-zhwang <flex.wang@snowflake.com>

2025-04-27 12:30:53 +00:00

2025-04-24 04:43:56 -07:00

2025-04-26 11:51:17 -04:00

__init__.py

2023-06-17 03:07:40 -07:00

arg_utils.py

2025-04-25 19:55:05 -07:00

async_llm_engine.py

2025-04-24 20:16:52 +00:00

async_timeout.py

2025-02-02 11:58:18 -08:00

llm_engine.py

2025-04-24 04:43:56 -07:00

metrics_types.py

2025-02-22 00:20:00 -08:00

metrics.py

2025-04-27 12:30:53 +00:00

protocol.py

2025-04-15 11:50:38 +00:00