The epilogue needs tmem_ptr for epilogue_tma_store. It must be part of the tmem alloc barrier to synchronize.