vllm/vllm/engine at 90fbf12540da089fcc7dc825ce2ceb7ea3a3df33 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Sherry 54d3544784 Fix: Output text is always truncated in some models (#3016 )

2024-03-01 07:52:22 +00:00

..

__init__.py

Change the name to vLLM (#150 )

2023-06-17 03:07:40 -07:00

arg_utils.py

[Neuron] Support inference with transformers-neuronx (#2569 )

2024-02-28 09:34:34 -08:00

async_llm_engine.py

Add guided decoding for OpenAI API server (#2819 )

2024-02-29 22:13:08 +00:00

llm_engine.py

Fix: Output text is always truncated in some models (#3016 )

2024-03-01 07:52:22 +00:00

metrics.py

add cache_config's info to prometheus metrics. (#3100 )

2024-02-29 06:15:18 +00:00

ray_utils.py

[Ray] Integration compiled DAG off by default (#2471 )

2024-02-08 09:57:25 -08:00