This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
90fbf12540da089fcc7dc825ce2ceb7ea3a3df33
vllm
/
vllm
/
engine
History
Sherry
54d3544784
Fix: Output text is always truncated in some models (
#3016
)
2024-03-01 07:52:22 +00:00
..
__init__.py
Change the name to vLLM (
#150
)
2023-06-17 03:07:40 -07:00
arg_utils.py
[Neuron] Support inference with transformers-neuronx (
#2569
)
2024-02-28 09:34:34 -08:00
async_llm_engine.py
Add guided decoding for OpenAI API server (
#2819
)
2024-02-29 22:13:08 +00:00
llm_engine.py
Fix: Output text is always truncated in some models (
#3016
)
2024-03-01 07:52:22 +00:00
metrics.py
add cache_config's info to prometheus metrics. (
#3100
)
2024-02-29 06:15:18 +00:00
ray_utils.py
[Ray] Integration compiled DAG off by default (
#2471
)
2024-02-08 09:57:25 -08:00