Files
vllm/csrc/quantization/cutlass_w8a8/scaled_mm_c2x.cu
Varun Sundar Rabindranath af647fb8b3 [Kernel] Tuned int8 kernels for Ada Lovelace (#6848)
Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
2024-07-29 20:24:58 -06:00

6.3 KiB