This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
0fb142a454757ec2055000ca8a2607e797af3e71
vllm
/
vllm
/
model_executor
/
layers
/
fused_moe
/
experts
History
EdalatiAli
e5b807607c
[Quant][Feature] Support online MXFP8 quantization for MoE and dense models (
#35448
)
...
Signed-off-by: EdalatiAli <
aliedalati@cohere.com
>
2026-03-16 18:07:39 -04:00
..
__init__.py
[MoE Refactor] Create MK for TRTLLM Kernels (
#32564
)
2026-03-03 10:39:50 -08:00
trtllm_fp8_moe.py
[Quant][Feature] Support online MXFP8 quantization for MoE and dense models (
#35448
)
2026-03-16 18:07:39 -04:00
trtllm_nvfp4_moe.py
Fix eplb nvfp4 experts hook (
#37217
)
2026-03-16 22:03:54 +00:00