vllm/csrc/quantization/fp4 at 4af9ed21cba9e4bb85cd7cc124aa6f23cd0ae9a5 - vllm

Files

Terry Gao 3e6a1e1686 [Custom Ops] Add functional + out variant for scaled_fp4_quant (#34389 )

Signed-off-by: tianrengao <terrygao87@gmail.com>

2026-03-16 18:51:46 -04:00

activation_nvfp4_quant_fusion_kernels.cu

2026-02-27 16:28:17 -08:00

nvfp4_blockwise_moe_kernel.cu

2026-01-07 13:31:26 -05:00

nvfp4_experts_quant.cu

2026-02-27 16:28:17 -08:00

nvfp4_quant_entry.cu

2026-03-16 18:51:46 -04:00

nvfp4_quant_kernels.cu

2026-02-27 16:28:17 -08:00

nvfp4_scaled_mm_entry.cu

2025-12-01 17:24:18 -08:00

nvfp4_scaled_mm_kernels.cu

2025-07-11 10:05:33 -06:00

nvfp4_scaled_mm_sm120_kernels.cu

2025-08-03 00:54:22 -07:00

nvfp4_utils.cuh

2026-03-16 18:51:46 -04:00