vllm/csrc/quantization/fp4 at ec8ab9d254d3b2e6b919a55277da599a7b9ab146 - vllm

Files

Roberto L. Castro 86c3b5a808 [BugFix] Fix fp4 quant kernel on CUDA 12.8 (#35210 )

Signed-off-by: LopezCastroRoberto <rocastro@redhat.com>

2026-02-25 18:32:50 -08:00

activation_nvfp4_quant_fusion_kernels.cu

2026-02-25 18:32:50 -08:00

nvfp4_blockwise_moe_kernel.cu

2026-01-07 13:31:26 -05:00

nvfp4_experts_quant.cu

2026-01-24 18:45:27 -07:00

nvfp4_quant_entry.cu

2026-01-24 18:45:27 -07:00

nvfp4_quant_kernels.cu

2026-02-25 18:32:50 -08:00

nvfp4_scaled_mm_entry.cu

2025-12-01 17:24:18 -08:00

nvfp4_scaled_mm_kernels.cu

2025-07-11 10:05:33 -06:00

nvfp4_scaled_mm_sm120_kernels.cu

2025-08-03 00:54:22 -07:00

nvfp4_utils.cuh

2026-01-24 18:45:27 -07:00