LJC00118
|
6bc75b549e
|
Fix smxx layout assertion (#141)
* Fix assertion error in smxx_layout.hpp for mn % 4 != 0 cases
* Fix assertion error in smxx_layout.hpp for mn % 4 != 0 cases
* Align submodule files
* Fix assertion error in smxx_layout.hpp for mn % 4 != 0 cases
* fix(smxx_layout): support mn%4!=0 and num_groups>1 via torch
* fix(smxx_layout): support mn%4!=0 and num_groups>1 via torch
* fix: correct logic for entering get_mn_major_tma_aligned_packed_ue8m0_tensor_torch
|
2025-07-30 10:36:54 +08:00 |
|