vllm/csrc/quantization/fp4 at 7311f74468d2ba4f89658aa0fedf3811f8769b30 - vllm

Files

Michael Goin d47661f0cd [Kernel] Basic tuned configs for NVFP4 CUTLASS dense GEMM (#20646 )

Signed-off-by: mgoin <mgoin64@gmail.com>

2025-07-11 10:05:33 -06:00

nvfp4_blockwise_moe_kernel.cu

2025-07-01 18:05:47 -07:00

nvfp4_experts_quant.cu

2025-06-27 09:01:28 -07:00

nvfp4_quant_entry.cu

2025-05-09 16:24:41 -07:00

nvfp4_quant_kernels.cu

2025-06-27 09:01:28 -07:00

nvfp4_scaled_mm_entry.cu

2025-03-12 05:13:11 +00:00

nvfp4_scaled_mm_kernels.cu

2025-07-11 10:05:33 -06:00