This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
84cf78acee1e75bfa163863b3674aeb3ba266844
vllm
/
vllm
/
utils
History
Wentao Ye
f7dcce7a4a
[Feature] Add
VLLM_USE_DEEP_GEMM_E8M0
Env to Control E8M0 Scale (
#21968
)
...
Signed-off-by: yewentao256 <
zhyanwentao@126.com
>
2025-08-11 09:39:08 -07:00
..
__init__.py
[Docs] Add comprehensive CLI reference for all large
vllm
subcommands (
#22601
)
2025-08-11 00:13:33 -07:00
deep_gemm.py
[Feature] Add
VLLM_USE_DEEP_GEMM_E8M0
Env to Control E8M0 Scale (
#21968
)
2025-08-11 09:39:08 -07:00
flashinfer.py
Support Tensorrt-LLM MoE fp4 for low-latency (
#21331
)
2025-08-07 19:18:22 -07:00
jsontree.py
[Misc] Move jsontree to utils (
#22622
)
2025-08-11 03:49:32 -07:00
tensor_schema.py
Migrate LlavaNextImageInputs to TensorSchema (
#21774
)
2025-08-10 09:05:21 -07:00