vllm/vllm/model_executor at 894843eb25ddbdedec93b68140f2eb14fceea7ce - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Yan Ma 894843eb25 replace with torch.cuda.device with with torch.accelerator.device_index (#36144 )

Signed-off-by: Yan Ma <yan.ma@intel.com>

2026-03-11 23:12:57 -07:00

..

[Misc] Use envs module to get VLLM_DISABLED_KERNELS (#35776 )

2026-03-11 13:37:46 +00:00

replace with torch.cuda.device with with torch.accelerator.device_index (#36144 )

2026-03-11 23:12:57 -07:00

fix: Use iterator as not to store all the file loads in memory at once (#36149 )

2026-03-08 20:25:21 -07:00

Make Gemma and Gemma 2 accept inputs_embeds like Gemma 3 (#36787 )

2026-03-11 18:12:43 +00:00

[UX] Remove NoOpOffloader log (#35678 )

2026-03-04 12:13:40 -08:00

[MoE Refactor] Create MK for TRTLLM Kernels (#32564 )

2026-03-03 10:39:50 -08:00

__init__.py

[Platform] Deprecate seed_everything (#31659 )

2026-01-04 18:34:04 -08:00

custom_op.py

[Model Bash][DSR1] Add selective dynamic shape marking for CustomOp (#34900 )

2026-02-21 19:28:01 -05:00

parameter.py

[QeRL] Layerwise Reloading (#32133 )

2026-01-30 08:50:05 -07:00

utils.py

[BugFix] Fix EPLB fail for MoeFP4 model with Marlin backend (#33262 )

2026-01-29 16:52:11 +08:00