Files
nvfp4-megamoe-kernel/dsv4/kernels
biondizzle 60715f89bc Fix CUDA kernel compilation: use c10::cuda::getCurrentCUDAStream
- amax_gsa.cu: fix at::cuda::getCurrentCUDAStream → c10::
- amax_gsa.cu: fix torch::TensorOptions().device() → x.options()
- sampler.cu: same fixes for compilation on B200
- Both kernels now compile cleanly with torch.utils.cpp_extension.load
2026-06-01 20:49:55 +00:00
..