Weight K dimension (hidden) must be the packed dimension, not N. Block scales computed along K dim. FP4 packing along K.