D1.5: Add TODO for correction epilog - keeping working TMEM round-trip for now
This commit is contained in:
@@ -410,6 +410,7 @@ class FmhaKernel:
|
||||
final_o_bar.arrive_and_wait()
|
||||
|
||||
# === NO-OP TMEM round-trip: re-map O from MMA layout to epilog layout ===
|
||||
# TODO: Replace with correction epilog (D1.5) for zero-error one-way trip
|
||||
tTMrO_noop = cute.make_rmem_tensor(
|
||||
(tTMEM_LOADcO.shape, 128 // corr_tile_size), self.acc_dtype
|
||||
)
|
||||
|
||||
Reference in New Issue
Block a user