vllm/vllm/engine at 671af2b1c0b3ed6d856d37c21a561cc429a10701 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Woosuk Kwon 30fb0956df [Minor] Add more detailed explanation on quantization argument (#2145 )

2023-12-17 01:56:16 -08:00

..

__init__.py

Change the name to vLLM (#150 )

2023-06-17 03:07:40 -07:00

arg_utils.py

[Minor] Add more detailed explanation on quantization argument (#2145 )

2023-12-17 01:56:16 -08:00

async_llm_engine.py

Fix typing in AsyncLLMEngine & add toml to requirements-dev (#2100 )

2023-12-14 00:19:41 -08:00

llm_engine.py

Remove dependency on CuPy (#2152 )

2023-12-17 01:49:07 -08:00

metrics.py

Add Production Metrics in Prometheus format (#1890 )

2023-12-02 16:37:44 -08:00

ray_utils.py

Optimize model execution with CUDA graph (#1926 )

2023-12-16 21:12:08 -08:00