This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
bdb903bb5f4b943ad2a2d1c08f1f70d866e26496
vllm
/
tests
/
model_executor
History
Roy Wang
821eb80c0d
[Performance][Model Loader] Skip non-local expert weights during EP model loading (
#37136
)
...
Signed-off-by: esmeetu <
jasonailu87@gmail.com
>
2026-03-16 01:33:36 -07:00
..
model_loader
[Performance][Model Loader] Skip non-local expert weights during EP model loading (
#37136
)
2026-03-16 01:33:36 -07:00
__init__.py
…
test_cpu_unquantized_gemm_dispatch.py
In-Tree AMD Zen CPU Backend via zentorch [1/N] (
#35970
)
2026-03-15 23:35:35 +00:00
test_eagle_quantization.py
[Hardware] Replace torch.cuda.device_count/current_device/set_device API (
#36145
)
2026-03-12 07:57:47 -07:00
test_enabled_custom_ops.py
…
test_model_load_with_params.py
…
test_oink_integration.py
…
test_qwen3_omni.py
…
test_qwen3_vl_mrope.py
[Bugfix] Fix EVS implementation for Qwen3 VL (
#33607
)
2026-03-04 02:18:11 +00:00
test_routed_experts_capture.py
…
test_weight_utils.py
[Bugfix][Model] Fix FP8 k_scale/v_scale not loaded for Qwen3-MoE (
#35656
)
2026-03-04 13:15:38 +00:00