vllm/csrc/quantization/w8a8 at 1ac2ef2e5335ca0af99aee438998c9305461f563 - vllm

Files

Necofish e7221180e1 [Kernel] Optimize SM120 CUTLASS blockwise FP8 GEMM (#37970 )

Signed-off-by: Necofish <liuxiangyang@mail.ustc.edu.cn>
Co-authored-by: Michael Goin <mgoin64@gmail.com>

2026-03-25 08:20:04 -07:00

2026-03-25 08:20:04 -07:00

2026-02-17 23:35:04 -08:00

2025-11-08 14:31:33 -08:00

per_token_group_quant_8bit.h

2025-10-08 10:20:48 -04:00