vllm/tests/model_executor/model_loader at 6557f4937fd2937ae4824beb492ff67625895d89 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Kyle Sayers d28d86e8a3 [QeRL] Fix online quantized reloading (#38442 )

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

2026-03-29 14:56:41 -06:00

..

fastsafetensors_loader

[BugFix] [FEAT] Enable fastsafetensors for ROCm platform (#28225 )

2025-11-20 16:34:11 +00:00

instanttensor_loader

[Feature] Add InstantTensor weight loader (#36139 )

2026-03-14 18:05:23 +01:00

runai_streamer_loader

[Feature] Add Azure Blob Storage support for RunAI Model Streamer (#34614 )

2026-03-15 19:38:21 +08:00

tensorizer_loader

[Hardware] Replace torch.cuda.device_count/current_device/set_device API (#36145 )

2026-03-12 07:57:47 -07:00

__init__.py

…

test_ep_weight_filter.py

[Performance][Model Loader] Skip non-local expert weights during EP model loading (#37136 )

2026-03-16 01:33:36 -07:00

test_registry.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_reload.py

[QeRL] Fix online quantized reloading (#38442 )

2026-03-29 14:56:41 -06:00

test_sharded_state_loader.py

[ROCm][CI] Force max_num_seqs=1 on ROCm In test_sharded_state_loader to reduce flakiness (#33277 )

2026-01-31 12:28:29 +08:00