This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
DeepGEMM
Watch
1
Star
0
Fork
0
You've already forked DeepGEMM
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
ac428e25e0dbbb44a302ccf7c23a724206addbb3
DeepGEMM
/
deep_gemm
/
include
/
deep_gemm
History
shixianc
0c88cd0139
Fix illegal memory address when skipping -1 m indices (
#113
)
...
Co-authored-by: Shixian Cui <
shixian@amazon.com
>
2025-06-16 10:44:31 +08:00
..
fp8_gemm.cuh
Grouped GEMM skip useless computation for unaligned Ms (
#103
)
2025-05-27 13:43:38 +08:00
fp8_wgrad_gemm.cuh
Add a missing
#pragma once
2025-05-15 18:10:05 +08:00
mma_utils.cuh
Weight gradient kernels for dense and MoE models (
#95
)
2025-05-14 14:47:58 +08:00
nvrtc_std.cuh
Refactor JIT compilation (+NVRTC support) (
#94
)
2025-05-07 11:38:14 +08:00
scheduler.cuh
Fix illegal memory address when skipping -1 m indices (
#113
)
2025-06-16 10:44:31 +08:00
tma_utils.cuh
Refactor JIT compilation (+NVRTC support) (
#94
)
2025-05-07 11:38:14 +08:00
utils.cuh
Refactor JIT compilation (+NVRTC support) (
#94
)
2025-05-07 11:38:14 +08:00