biondizzle
467ade37b2
Stage B: C-fragment vs A-fragment TMEM layout mismatch diagnosed
Key finding: C-fragment and A-fragment use different physical TMEM address
mappings. St32x32bOp with C-fragment writes to C-layout addresses, but PV MMA
reads from A-layout addresses. Forward FMHA recast validated FP16 only, not BF16.
Working: FP32 ld/st roundtrip, BF16 elemwise, BF16 recast ld S0->st S1 (all cos 0.999999)
Broken: C-frag st + A-frag read (NaN), A-frag store + PV MMA (cos -0.02)
Next: Fix register data flow (128 FP16/thread load vs 64 BF16/thread store mismatch)
2026-05-21 00:12:47 +00:00
..
2026-05-16 19:07:36 +00:00
2026-05-17 16:52:40 +00:00
2026-05-20 20:26:25 +00:00
2026-05-21 00:12:47 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 04:13:52 +00:00
2026-05-16 02:13:18 +00:00
2026-05-16 02:21:17 +00:00
2026-05-20 20:26:25 +00:00
2026-05-21 00:12:47 +00:00
2026-05-19 07:54:01 +00:00
2026-05-18 20:14:03 +00:00
2026-05-21 00:12:47 +00:00
2026-05-16 02:14:37 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-19 15:30:29 +00:00
2026-05-19 01:57:16 +00:00
2026-05-19 07:58:10 +00:00
2026-05-19 16:04:53 +00:00
2026-05-19 01:54:48 +00:00
2026-05-16 03:04:31 +00:00
2026-05-19 15:28:52 +00:00
2026-05-20 05:46:15 +00:00
2026-05-19 16:00:33 +00:00
2026-05-19 15:55:41 +00:00
2026-05-21 00:12:47 +00:00
2026-05-20 02:11:40 +00:00
2026-05-19 07:17:37 +00:00
2026-05-19 18:36:49 +00:00
2026-05-19 09:04:19 +00:00
2026-05-20 03:10:56 +00:00
2026-05-20 03:04:38 +00:00
2026-05-20 03:04:38 +00:00
2026-05-19 03:22:00 +00:00
2026-05-19 08:58:46 +00:00
2026-05-19 07:49:41 +00:00
2026-05-19 18:34:12 +00:00
2026-05-19 18:35:40 +00:00
2026-05-17 22:58:27 +00:00
2026-05-19 08:38:55 +00:00
2026-05-19 08:55:31 +00:00
2026-05-19 03:58:25 +00:00
2026-05-19 06:37:25 +00:00
2026-05-19 06:30:18 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-17 23:04:44 +00:00
2026-05-16 02:14:37 +00:00
2026-05-21 00:12:47 +00:00
2026-05-19 10:31:07 +00:00
2026-05-17 07:33:20 +00:00
2026-05-17 07:43:05 +00:00
2026-05-17 07:37:47 +00:00
2026-05-18 20:10:32 +00:00
2026-05-20 03:10:56 +00:00
2026-05-19 09:02:12 +00:00
2026-05-20 05:46:15 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-20 03:26:20 +00:00
2026-05-20 03:26:20 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-20 20:26:25 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-21 00:12:47 +00:00
2026-05-16 02:14:37 +00:00
2026-05-19 08:51:16 +00:00
2026-05-19 17:26:50 +00:00
2026-05-17 08:24:27 +00:00
2026-05-19 04:10:02 +00:00
2026-05-19 02:37:50 +00:00