vllm/csrc/quantization/fp4 at 1a4f35e2eaa3ebdecb8ef9ff8302b01e289305c9 - vllm

Files

Tyler Michael Smith 3be8d312a2 [Kernel][Bugfix] Fixup some warnings in nvfp4_blockwise_moe when CUDA < 12.8 (#20324 )

Signed-off-by: Tyler Michael Smith <tyler@neuralmagic.com>

2025-07-01 18:05:47 -07:00

nvfp4_blockwise_moe_kernel.cu

2025-07-01 18:05:47 -07:00

nvfp4_experts_quant.cu

2025-06-27 09:01:28 -07:00

nvfp4_quant_entry.cu

2025-05-09 16:24:41 -07:00

nvfp4_quant_kernels.cu

2025-06-27 09:01:28 -07:00

nvfp4_scaled_mm_entry.cu

2025-03-12 05:13:11 +00:00

nvfp4_scaled_mm_kernels.cu

2025-06-27 09:01:28 -07:00