(silu_result > limit).float() doesn't work on TensorSSA. cute.where(cond, true_val, false_val) is the correct TensorSSA API.