This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
nvfp4-megamoe-kernel
Watch
1
Star
0
Fork
0
You've already forked nvfp4-megamoe-kernel
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
23d77286ff9d9cc0816c4796e363de50afeb7367
nvfp4-megamoe-kernel
/
tests
/
unit
History
biondizzle
23d77286ff
fix: correction_epilog with get_tmem_load_op paired atoms + direct TMA store
2026-05-23 02:19:41 +00:00
..
__init__.py
Restructure: cutedsl/ -> dsv4/ with proper layering
2026-05-21 17:30:44 +00:00
cudagraph_test.py
Restructure: cutedsl/ -> dsv4/ with proper layering
2026-05-21 17:30:44 +00:00
layertest.py
Restructure: cutedsl/ -> dsv4/ with proper layering
2026-05-21 17:30:44 +00:00
test_cutedsl.py
Restructure: cutedsl/ -> dsv4/ with proper layering
2026-05-21 17:30:44 +00:00
test_fmha_v3_stage_c.py
fix: correction_epilog with get_tmem_load_op paired atoms + direct TMA store
2026-05-23 02:19:41 +00:00
test_fmha_v3.py
FIX: (None,0,None,0) for ALL tma_partition outputs — verified shapes on B200
2026-05-22 23:35:55 +00:00
test_fp4_roundtrip.py
Restructure: cutedsl/ -> dsv4/ with proper layering
2026-05-21 17:30:44 +00:00