vllm/csrc/quantization/fp4 at ab3a85fd688dc2544edab3098fab8e2c8893a707 - vllm

Files

Michael Goin 06d490282f [NVFP4][Perf] Tune NVFP4 input quant kernel for small batch size (#30897 )

Signed-off-by: mgoin <mgoin64@gmail.com>

2025-12-21 09:41:57 -08:00

activation_nvfp4_quant_fusion_kernels.cu

2025-12-21 09:41:57 -08:00

nvfp4_blockwise_moe_kernel.cu

2025-11-25 06:59:07 -08:00

nvfp4_experts_quant.cu

2025-12-21 09:41:57 -08:00

nvfp4_quant_entry.cu

2025-11-25 06:59:07 -08:00

nvfp4_quant_kernels.cu

2025-12-21 09:41:57 -08:00

nvfp4_scaled_mm_entry.cu

2025-12-01 17:24:18 -08:00

nvfp4_scaled_mm_kernels.cu

2025-07-11 10:05:33 -06:00

nvfp4_scaled_mm_sm120_kernels.cu

2025-08-03 00:54:22 -07:00

nvfp4_utils.cuh

2025-12-21 09:41:57 -08:00