vllm/vllm/model_executor at ef248ff740200c91791ba952b3458a5d5a016d26 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Kunshang Ji e10604480b [XPU][1/N] Deprecate ipex and switch to vllm-xpu-kernels for xpu platform (#33379 )

Signed-off-by: Kunshang Ji <kunshang.ji@intel.com>

2026-02-02 22:46:10 -08:00

..

[XPU][1/N] Deprecate ipex and switch to vllm-xpu-kernels for xpu platform (#33379 )

2026-02-02 22:46:10 -08:00

fix memory for online fp8 quantization with streaming weight load (#31914 )

2026-02-02 14:17:42 -05:00

Fix quantized Falcon-H1 model loading issues (#32728 )

2026-02-02 22:31:27 -08:00

[MoE Refactor] Integrate Naive Prepare Finalize into MK (#32567 )

2026-01-27 01:28:02 +00:00

__init__.py

[Platform] Deprecate seed_everything (#31659 )

2026-01-04 18:34:04 -08:00

custom_op.py

[torch.compile] Compile CustomOp.forward_native for SiluAndMul and QuantFP8 to avoid raw torch ops inside opaque custom ops (#32806 )

2026-01-22 19:52:26 -08:00

parameter.py

[QeRL] Layerwise Reloading (#32133 )

2026-01-30 08:50:05 -07:00

utils.py

[BugFix] Fix EPLB fail for MoeFP4 model with Marlin backend (#33262 )

2026-01-29 16:52:11 +08:00