vllm/tests/kernels at 3b3b778d4af545a30290275d3154bb0e514d2dcc - vllm

Files

Wentao Ye 42d440c22b [Perf] Use Triton instead of Torch for DeepGEMM Per Token Group Quant (#20841 )

Signed-off-by: yewentao256 <zhyanwentao@126.com>

2025-07-12 19:38:45 -07:00

2025-07-11 09:23:23 +00:00

2025-06-11 19:57:10 -07:00

2025-07-09 12:53:55 -07:00

2025-07-12 19:38:45 -07:00

2025-07-12 19:38:45 -07:00

__init__.py

2024-05-13 23:50:09 +09:00

allclose_default.py

2025-06-03 11:20:17 -07:00

quant_utils.py

2025-07-03 14:55:40 -07:00

test_apply_repetition_penalties.py

2025-07-05 19:38:02 -07:00

test_cutlass_mla_decode.py

2025-06-03 21:40:26 -07:00

test_flex_attention.py

2025-07-04 07:40:42 +00:00

test_fused_quant_activation.py

2025-06-03 11:20:17 -07:00

test_triton_flash_attention.py

2025-06-03 11:20:17 -07:00

utils.py

2025-07-11 07:51:46 -07:00