This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
6557f4937fd2937ae4824beb492ff67625895d89
vllm
/
tests
/
model_executor
/
model_loader
History
Kyle Sayers
d28d86e8a3
[QeRL] Fix online quantized reloading (
#38442
)
...
Signed-off-by: Kyle Sayers <
kylesayrs@gmail.com
>
2026-03-29 14:56:41 -06:00
..
fastsafetensors_loader
[BugFix] [FEAT] Enable fastsafetensors for ROCm platform (
#28225
)
2025-11-20 16:34:11 +00:00
instanttensor_loader
[Feature] Add InstantTensor weight loader (
#36139
)
2026-03-14 18:05:23 +01:00
runai_streamer_loader
[Feature] Add Azure Blob Storage support for RunAI Model Streamer (
#34614
)
2026-03-15 19:38:21 +08:00
tensorizer_loader
[Hardware] Replace torch.cuda.device_count/current_device/set_device API (
#36145
)
2026-03-12 07:57:47 -07:00
__init__.py
…
test_ep_weight_filter.py
[Performance][Model Loader] Skip non-local expert weights during EP model loading (
#37136
)
2026-03-16 01:33:36 -07:00
test_registry.py
Convert formatting to use
ruff
instead of
yapf
+
isort
(
#26247
)
2025-10-05 07:06:22 -07:00
test_reload.py
[QeRL] Fix online quantized reloading (
#38442
)
2026-03-29 14:56:41 -06:00
test_sharded_state_loader.py
[ROCm][CI] Force max_num_seqs=1 on ROCm In test_sharded_state_loader to reduce flakiness (
#33277
)
2026-01-31 12:28:29 +08:00