This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
DeepGEMM
Watch
1
Star
0
Fork
0
You've already forked DeepGEMM
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
6a348d543de00e0bea16f3ed93dac68bf621c308
DeepGEMM
/
csrc
/
jit_kernels
History
biondizzle
6a348d543d
fix: use raw cudaDeviceSynchronize instead of DG_CUDA_CHECK macro
2026-05-13 12:17:26 +00:00
..
heuristics
NVFP4: fix SF pipeline — 2 K-cols per BLOCK_K for group=16
2026-05-12 08:08:17 +00:00
impls
fix: use raw cudaDeviceSynchronize instead of DG_CUDA_CHECK macro
2026-05-13 12:17:26 +00:00