vllm/csrc/quantization/fp8 at ba5c5e5404d2d3fdee02e163fc75a44bd960935f - vllm

Files

Wentao Ye 75d29cf4e1 [Perf] Cuda Kernel for Int8 Per Token Group Quant (#21476 )

Signed-off-by: yewentao256 <zhyanwentao@126.com>

2025-07-25 17:07:07 -07:00

2025-06-15 20:05:28 -07:00

2024-08-05 16:00:01 -04:00

common.cu

2025-07-22 07:07:44 -07:00

common.cuh

2025-06-03 13:48:25 -07:00

per_token_group_quant.cu

2025-07-25 17:07:07 -07:00