vllm/vllm/model_executor at 828f862acb5f46ffaa1633aa80d85af73c31c97a - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Michael Goin 9482b0b085 [Bugfix] Remove assertion for NVFP4 scale dynamic range (#37465 )

Signed-off-by: Michael Goin <mgoin64@gmail.com>

2026-03-18 15:37:49 -07:00

..

[Torch 2.11] Guard torch._C._cpu attribute checks for forward compatibility (#35673 )

2026-03-17 18:47:59 +00:00

[Bugfix] Remove assertion for NVFP4 scale dynamic range (#37465 )

2026-03-18 15:37:49 -07:00

[Bugfix] Fix EP weight filter breaking EPLB and NVFP4 accuracy (#37322 )

2026-03-18 18:30:29 +08:00

[LoRA][BugFix] Fix skipped LoRA adapters for Mistral3 (#36928 )

2026-03-18 22:34:19 +00:00

Bugfix for offloading+prefetch for GLM-4.7-FP8 (#37178 )

2026-03-17 21:22:09 +08:00

[Feature]: Remove Chunking From FusedMoE (#34086 )

2026-03-12 14:24:38 -04:00

__init__.py

[Platform] Deprecate seed_everything (#31659 )

2026-01-04 18:34:04 -08:00

custom_op.py

Add ability to replace oot ops when using lora (#37181 )

2026-03-16 18:04:15 -07:00

parameter.py

[QeRL] Layerwise Reloading (#32133 )

2026-01-30 08:50:05 -07:00

utils.py

[BugFix] Fix EPLB fail for MoeFP4 model with Marlin backend (#33262 )

2026-01-29 16:52:11 +08:00