This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
4db72e57f6e8da5e78285e9868e9327167bea973
vllm
/
vllm
/
entrypoints
History
Joe Runde
4db72e57f6
[Bugfix][Refactor] Unify model management in frontend (
#11660
)
...
Signed-off-by: Joe Runde <
Joseph.Runde@ibm.com
>
2025-01-01 02:21:51 +00:00
..
openai
[Bugfix][Refactor] Unify model management in frontend (
#11660
)
2025-01-01 02:21:51 +00:00
__init__.py
Change the name to vLLM (
#150
)
2023-06-17 03:07:40 -07:00
api_server.py
[2/N] API Server: Avoid ulimit footgun (
#11530
)
2024-12-26 23:43:05 +00:00
chat_utils.py
[Misc] Abstract the logic for reading and writing media content (
#11527
)
2024-12-27 19:21:23 +08:00
launcher.py
[Core][Bugfix][Perf] Introduce
MQLLMEngine
to avoid
asyncio
OH (
#8157
)
2024-09-18 13:56:58 +00:00
llm.py
[Misc]Suppress irrelevant exception stack trace information when CUDA… (
#11438
)
2024-12-24 08:43:39 +00:00
logger.py
[Frontend] API support for beam search (
#9087
)
2024-10-05 23:39:03 -07:00
utils.py
[Bugfix] Fix request cancellation without polling (
#11190
)
2024-12-17 12:26:32 -08:00