[LoRA] Update LoRA expand kernel heuristic (#32425)
Signed-off-by: Xin Yang <xyangx@amazon.com>
This commit is contained in:
@@ -251,7 +251,7 @@ def get_lora_op_configs(
|
||||
else:
|
||||
default = {
|
||||
"block_m": 64,
|
||||
"block_n": 128,
|
||||
"block_n": max(64, next_power_of_2(128 // num_slices)),
|
||||
"block_k": 16,
|
||||
"num_warps": 4,
|
||||
"num_ctas": 1,
|
||||
|
||||
Reference in New Issue
Block a user