vllm/vllm/engine at bc0c0192d13ca6ea4aeea4725f752a89483895bc - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

SangBin Cho 18de883489 [Chunked Prefill][4/n] Chunked prefill scheduler. (#3853 )

2024-04-05 10:17:58 -07:00

..

__init__.py

Change the name to vLLM (#150 )

2023-06-17 03:07:40 -07:00

arg_utils.py

Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290 )

2024-04-03 14:15:55 -07:00

async_llm_engine.py

[Speculative decoding] Adding configuration object for speculative decoding (#3706 )

2024-04-03 00:40:57 +00:00

llm_engine.py

[Chunked Prefill][4/n] Chunked prefill scheduler. (#3853 )

2024-04-05 10:17:58 -07:00

metrics.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

ray_utils.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00