Files
vllm/benchmarks/cutlass_benchmarks/w8a8_benchmarks.py
Varun Sundar Rabindranath 35e9c12bfa [Kernel] Tuned int8 Cutlass Kernels for SM75 (T4) (#6996)
Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
2024-07-31 14:40:32 -07:00

12 KiB