vllm/tests/compile/fusions_e2e at fa6a6be51978bd4b49ba0da17039e60f96dc5b13 - vllm

Files

Jason Li 9d37941017 [torch.compile] Sequence Parallelism threshold compile ranges (#28672 )

Signed-off-by: jasonlizhengjian <jasonlizhengjian@gmail.com>
Signed-off-by: Jason Li <jasonlizhengjian@gmail.com>
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
Co-authored-by: Luka Govedič <ProExpertProg@users.noreply.github.com>

2026-02-26 05:00:12 +00:00

__init__.py

[CI][torch.compile] Reduce e2e fusion test time (#33293 )

2026-02-04 19:09:03 -05:00

common.py

[torch.compile] Reorganize vllm/compilation and tests/compile (0/N for vLLM IR) (#33731 )

2026-02-06 04:19:49 -08:00

conftest.py

[torch.compile] Sequence Parallelism threshold compile ranges (#28672 )

2026-02-26 05:00:12 +00:00

models.py

[Bugfix] Fix QK Norm+RoPE fusion pattern matching on B200+FP8 (#33967 )

2026-02-07 02:27:33 +00:00

test_tp1_quant.py

[Perf] Enable FlashInfer DeepGEMM swapAB on SM90 by default (#34924 )

2026-02-23 17:34:41 -08:00

test_tp2_ar_rms.py

[CI][torch.compile] Reduce e2e fusion test time (#33293 )

2026-02-04 19:09:03 -05:00

test_tp2_async_tp.py

[torch.compile] Sequence Parallelism threshold compile ranges (#28672 )

2026-02-26 05:00:12 +00:00