vllm/vllm/model_executor at a2268617cfe91c4eebed1944327d8869ad628b8b - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Thomas Parnell f296a1966d [Bugfix] Fix FlashInfer GDN warmup ValueError on SM90 GPUs (#36876 )

2026-03-13 07:09:39 +01:00

..

[Misc] Use envs module to get VLLM_DISABLED_KERNELS (#35776 )

2026-03-11 13:37:46 +00:00

[Bugfix] ep_scatter kernel store-load race condition (#34991 )

2026-03-13 01:07:59 +00:00

[Hardware] Replace torch.cuda.device_count/current_device/set_device API (#36145 )

2026-03-12 07:57:47 -07:00

[Bugfix] Fix FlashInfer GDN warmup ValueError on SM90 GPUs (#36876 )

2026-03-13 07:09:39 +01:00

[UX] Remove NoOpOffloader log (#35678 )

2026-03-04 12:13:40 -08:00

[Feature]: Remove Chunking From FusedMoE (#34086 )

2026-03-12 14:24:38 -04:00

__init__.py

[Platform] Deprecate seed_everything (#31659 )

2026-01-04 18:34:04 -08:00

custom_op.py

[MM][OOT] Support CPU seq_lens for OOT MMEncoderAttention kernels (#36605 )

2026-03-12 03:28:23 -07:00

parameter.py

[QeRL] Layerwise Reloading (#32133 )

2026-01-30 08:50:05 -07:00

utils.py

[BugFix] Fix EPLB fail for MoeFP4 model with Marlin backend (#33262 )

2026-01-29 16:52:11 +08:00