vllm/tests/compile/fusions_e2e at 0d81a1fe6190f47379c9905be5757e7b6bba5d14 - vllm

Files

elvischenv 296839a1b0 [Perf] Eliminate padding and slicing op for GPT-OSS with Flashinfer MXFP4 MXFP8 MoE (#30647 )

Signed-off-by: elvischenv <219235043+elvischenv@users.noreply.github.com>

2026-03-18 15:01:26 +00:00

__init__.py

2026-02-04 19:09:03 -05:00

common.py

2026-03-03 06:24:21 -08:00

conftest.py

2026-03-18 15:01:26 +00:00

models.py

2026-03-18 15:01:26 +00:00

test_tp1_quant.py

2026-03-11 10:56:55 -07:00

test_tp2_ar_rms.py

2026-03-18 15:01:26 +00:00

test_tp2_async_tp.py

2026-03-03 06:24:21 -08:00