vllm/vllm/model_executor at e054f152faa48ab27389f490d6e86c959d86d122 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

zhang-prog 0f5b526040 [Fix] Remove unused packing_position_embedding from PaddleOCRVL for better checkpoint compatibility (#38232 )

Signed-off-by: zhangyue66 <zhangyue66@baidu.com>

2026-03-26 15:34:49 +00:00

..

[Bugfix][CI] Fix Marlin FP8 Linear Kernel for Compressed Tensors Format (#38092 )

2026-03-25 21:11:43 -07:00

Revert "[MoE Kernel] Flashinfer nvfp4 cutedsl moe kernel integration" (#38050 ) (#38169 )

2026-03-26 07:59:09 -07:00

[ROCm][CI] Fix flaky GPTQ compile correctness test (#38161 )

2026-03-26 19:57:00 +08:00

[Fix] Remove unused packing_position_embedding from PaddleOCRVL for better checkpoint compatibility (#38232 )

2026-03-26 15:34:49 +00:00

Bugfix for offloading+prefetch for GLM-4.7-FP8 (#37178 )

2026-03-17 21:22:09 +08:00

[Bugfix] Fix AttributeError when serving MXFP8 models with DeepGEMM installed (#37358 )

2026-03-19 17:58:33 +00:00

__init__.py

[Platform] Deprecate seed_everything (#31659 )

2026-01-04 18:34:04 -08:00

custom_op.py

Add ability to replace oot ops when using lora (#37181 )

2026-03-16 18:04:15 -07:00

parameter.py

[Mypy] Fix mypy for vllm/model_executor (except vllm/model_executor/layers) (#37904 )

2026-03-24 17:14:01 +00:00

utils.py

[BugFix] Fix EPLB fail for MoeFP4 model with Marlin backend (#33262 )

2026-01-29 16:52:11 +08:00