vllm/csrc/quantization/w8a8 at a5f9fb59604f3a84e8be1317e33b2d368c9fc6f9 - vllm

Files

czhu-cohere f6227c22ab [Kernel]Support W4A8 Grouped GEMM on Hopper (#29691 )

Signed-off-by: czhu-cohere <conway.zhu@cohere.com>

2025-12-08 19:29:06 -08:00

2025-12-08 19:29:06 -08:00

2025-12-07 20:31:14 +08:00

2025-11-08 14:31:33 -08:00

per_token_group_quant_8bit.h

2025-10-08 10:20:48 -04:00