biondizzle
97656a5cd1
Stage B: two MMAs + identity softmax — crash fixed, softmax output still wrong
Key fixes:
- PipelineUmmaAsync consumer group: 32*4=128 threads (not 4 warps)
- TMEM offsets computed from find_tmem_tensor_col_offset (not hardcoded)
- P fragment from p_tmem_s.outer + make_fragment_A (matching fmha.py)
- V SMEM aliasing via recast_ptr
Status:
- Stage A: cosine 0.999999 ✅
- Stage B: runs without crash, identity softmax cosine -0.02 ❌
- Diagnostics: TMEM layout inspection, bisection results
2026-05-20 20:26:25 +00:00
..
2026-05-16 19:07:36 +00:00
2026-05-17 16:52:40 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 04:13:52 +00:00
2026-05-16 02:13:18 +00:00
2026-05-16 02:21:17 +00:00
2026-05-20 20:26:25 +00:00
2026-05-19 07:54:01 +00:00
2026-05-18 20:14:03 +00:00
2026-05-16 02:14:37 +00:00
2026-05-19 15:30:29 +00:00
2026-05-19 01:57:16 +00:00
2026-05-19 07:58:10 +00:00
2026-05-19 16:04:53 +00:00
2026-05-19 01:54:48 +00:00
2026-05-16 03:04:31 +00:00
2026-05-19 15:28:52 +00:00
2026-05-20 05:46:15 +00:00
2026-05-19 16:00:33 +00:00
2026-05-19 15:55:41 +00:00
2026-05-20 02:11:40 +00:00
2026-05-19 07:17:37 +00:00
2026-05-19 18:36:49 +00:00
2026-05-19 09:04:19 +00:00
2026-05-20 03:10:56 +00:00
2026-05-20 03:04:38 +00:00
2026-05-20 03:04:38 +00:00
2026-05-19 03:22:00 +00:00
2026-05-19 08:58:46 +00:00
2026-05-19 07:49:41 +00:00
2026-05-19 18:34:12 +00:00
2026-05-19 18:35:40 +00:00
2026-05-17 22:58:27 +00:00
2026-05-19 08:38:55 +00:00
2026-05-19 08:55:31 +00:00
2026-05-19 03:58:25 +00:00
2026-05-19 06:37:25 +00:00
2026-05-19 06:30:18 +00:00
2026-05-17 23:04:44 +00:00
2026-05-16 02:14:37 +00:00
2026-05-19 10:31:07 +00:00
2026-05-17 07:33:20 +00:00
2026-05-17 07:43:05 +00:00
2026-05-17 07:37:47 +00:00
2026-05-18 20:10:32 +00:00
2026-05-20 03:10:56 +00:00
2026-05-19 09:02:12 +00:00
2026-05-20 05:46:15 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 20:26:25 +00:00
2026-05-20 03:26:20 +00:00
2026-05-20 03:26:20 +00:00
2026-05-20 20:26:25 +00:00
2026-05-16 02:14:37 +00:00
2026-05-19 08:51:16 +00:00
2026-05-19 17:26:50 +00:00
2026-05-17 08:24:27 +00:00
2026-05-19 04:10:02 +00:00
2026-05-19 02:37:50 +00:00