[LoRA] Update LoRA expand kernel heuristic (#32425)

Signed-off-by: Xin Yang <xyangx@amazon.com>
This commit is contained in:
Xin Yang
2026-01-16 10:38:07 -08:00
committed by GitHub
parent 7a1030431a
commit a884bc62d6

View File

@@ -251,7 +251,7 @@ def get_lora_op_configs(
else:
default = {
"block_m": 64,
"block_n": 128,
"block_n": max(64, next_power_of_2(128 // num_slices)),
"block_k": 16,
"num_warps": 4,
"num_ctas": 1,