vllm/vllm/model_executor at 59a85c366ef3666d22b57f952979a3f74ee50f61 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Cyrus Leung 59a85c366e [Model] Use merge_by_field_config for MM models (H-L) (#26230 )

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

2025-10-05 11:54:17 +08:00

..

Revert "Add batch invariant kernel override for FlashInfer backend [2/n]" (#26220 )

2025-10-04 02:45:08 -07:00

[Quantization/NVFP4] Speed up TRTLLM NVFP4 MOE weight loading and fix K/V scale loading for MLA Attn (#25968 )

2025-10-03 19:35:06 +00:00

[Model] Use merge_by_field_config for MM models (H-L) (#26230 )

2025-10-05 11:54:17 +08:00

[V0 deprecation] Remove _VLLM_V1 suffixes from attention backend names (#25489 )

2025-09-25 17:37:50 +00:00

__init__.py

[V0 Deprecation] Remove V0 sampling metadata (#25345 )

2025-09-21 10:37:11 -07:00

custom_op.py

[V0 deprecation] Deprecate V0 Neuron backend (#21159 )

2025-09-06 16:15:18 -07:00

parameter.py

Revert "[Bug] Dynamo Unsupported due to BasevLLMParameter.torch_function calling disabled super()" (#25681 )

2025-09-25 09:45:06 -07:00

utils.py

[OOT] Support sync_model_loading for OOT (#25126 )

2025-09-19 05:41:53 +00:00