Commit Graph

17 Commits

Author SHA1 Message Date
fe3431e0b2 Docs: Update STAGE_D.md, README.md with hd=512 compilation blocker, lessons learned 2026-05-24 21:35:25 +00:00
f3d93bc810 Docs: Update STAGE_D.md, README.md status for D1 hd≤256 milestone 2026-05-24 04:32:43 +00:00
0fa1189937 Update STAGE_D.md with D5b results: merge cos 0.961, LSE err=0.0 2026-05-23 21:45:22 +00:00
0fe8bc7355 D5b MILESTONE: SWA+sink merge works! cos 0.969
- Run FMHA twice (compressed KV + SWA KV) with normalized O + LSE
- Merge with sink weights in Python
- LSE err=0.0, merge cos=0.969 PASS
- Update STAGE_D.md: D5b done, D5c/D5d are optimizations
2026-05-23 21:36:26 +00:00
6edb7a91a7 Update STAGE_D.md: D5a done, CG-2/CG-3 status updated, tOrP0 offset rule added 2026-05-23 21:16:52 +00:00
e3b2cbeaed Update STAGE_D.md: manual SMEM addressing blocked on layout mapping 2026-05-23 19:22:28 +00:00
fc3e9bf0ae auto: pre-test commit 2026-05-23 19:20:42 +00:00
3b6aab041a auto: pre-test commit 2026-05-23 19:14:02 +00:00
0c5a42c056 Update STAGE_D.md with current action plan - starting NVFP4-0 verification and D1.3 validation on B200 2026-05-23 19:09:56 +00:00
2d18fc9c1d 📋 Update STAGE_D.md: D1.3 SOLVED, D1.4 IMPLEMENTED, D1.5 🟡 complex refactor, checklist updated 2026-05-23 18:37:53 +00:00
54104eeb8f 🎉 Mark D1.3 as SOLVED! SMEM-P rank mismatch fixed, enables hd>64 support 2026-05-23 18:26:15 +00:00
c6f09d2ab8 Update STAGE_D.md checklist with current progress and lessons learned 2026-05-23 09:27:48 +00:00
324fce3f63 docs: add NVFP4 precision roadmap to STAGE_D.md (3 honest buckets + speculative bucket) 2026-05-23 07:39:09 +00:00
4eccbb05c1 shit carmine left dangling 2026-05-23 06:55:22 +00:00
fe81eba7aa D1.2: TMEM budget verified on B200. Split-PV mandatory at hd=512 (MMA max N=256) 2026-05-23 06:43:01 +00:00
17b40eb3f8 STAGE_D.md: restructure with correctness gaps, TMEM budget, execution order 2026-05-23 06:31:37 +00:00
4f8a1b0eb5 Add STAGE_D.md: step-by-step runbook and todo list for D1-D5 2026-05-23 05:52:03 +00:00