Files
vllm/tests/kernels/quantization/test_flashinfer_scaled_mm.py
nvjullin f66673a39d [Kernel] Added flashinfer fp8 per-tensor gemms (#22895)
Signed-off-by: Julien Lin <jullin@nvidia.com>
Co-authored-by: Michael Goin <mgoin64@gmail.com>
2025-08-26 06:54:04 -07:00

2.0 KiB