vllm/tests/model_executor at 458c1a4b2d21965ecd41b76ec0506ffe5ed8c8a1 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

arlo 8c29042bb9 [Feature] Add InstantTensor weight loader (#36139 )

2026-03-14 18:05:23 +01:00

..

[Feature] Add InstantTensor weight loader (#36139 )

2026-03-14 18:05:23 +01:00

__init__.py

[CI/Build] Move test_utils.py to tests/utils.py (#4425 )

2024-05-13 23:50:09 +09:00

test_eagle_quantization.py

[Hardware] Replace torch.cuda.device_count/current_device/set_device API (#36145 )

2026-03-12 07:57:47 -07:00

test_enabled_custom_ops.py

[Kernel] Add topk_sigmoid kernel (#31246 )

2026-01-21 22:49:51 +00:00

test_model_load_with_params.py

[Bugfix] Replace PoolingParams.normalize with use_activation (#32243 )

2026-01-13 10:45:42 +00:00

test_oink_integration.py

[Perf] Add opt-in SM100 Oink RMSNorm custom-op path (#31828 )

2026-02-24 23:01:53 -08:00

test_qwen3_omni.py

[Refactor] Define MM data parser in processing info instead of processor itself (#33260 )

2026-01-29 13:55:17 +08:00

test_qwen3_vl_mrope.py

[Bugfix] Fix EVS implementation for Qwen3 VL (#33607 )

2026-03-04 02:18:11 +00:00

test_routed_experts_capture.py

[BugFix][Router Replay] Capture Logical Experts with EPLB (#33013 )

2026-01-31 10:12:17 -05:00

test_weight_utils.py

[Bugfix][Model] Fix FP8 k_scale/v_scale not loaded for Qwen3-MoE (#35656 )

2026-03-04 13:15:38 +00:00