This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
nvfp4-megamoe-kernel
Watch
1
Star
0
Fork
0
You've already forked nvfp4-megamoe-kernel
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
22a2fc563eee9cc7ff63fc89e1057516c646fcbc
nvfp4-megamoe-kernel
/
dsv4
/
kernels
/
attention
History
biondizzle
6cc151097e
Revert D2 multi-CTA attempts - keeping per-head launch approach (works correctly)
2026-05-25 01:08:38 +00:00
..
__init__.py
Restructure: cutedsl/ -> dsv4/ with proper layering
2026-05-21 17:30:44 +00:00
fmha_backup_pre_epilog.py
D1.5: Replace TMEM round-trip normalize with correction epilog (one-way: TMEM→reg→SMEM→GMEM)
2026-05-24 00:24:24 +00:00
fmha.py
Revert D2 multi-CTA attempts - keeping per-head launch approach (works correctly)
2026-05-25 01:08:38 +00:00
fmha.py.backup
auto: pre-test commit
2026-05-23 20:08:31 +00:00