Files
vllm/csrc/quantization/cutlass_w8a8/scaled_mm_c2x.cu
Varun Sundar Rabindranath 35e9c12bfa [Kernel] Tuned int8 Cutlass Kernels for SM75 (T4) (#6996)
Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
2024-07-31 14:40:32 -07:00

5.9 KiB