Lora MoE Align Improvements (#29257)

Signed-off-by: gnovack <gnovack@amazon.com>
This commit is contained in:
gnovack
2025-12-08 18:35:16 -08:00
committed by GitHub
parent db14f61f2d
commit ea657f2078
7 changed files with 360 additions and 249 deletions

View File

@@ -47,7 +47,8 @@ TORCH_LIBRARY_EXPAND(TORCH_EXTENSION_NAME, m) {
" Tensor !experts_ids,"
" Tensor !num_tokens_post_pad,"
" Tensor !adapter_enabled,"
" Tensor !lora_ids) -> () ");
" Tensor !lora_ids,"
" Tensor? maybe_expert_map) -> () ");
m.impl("moe_lora_align_block_size", torch::kCUDA, &moe_lora_align_block_size);
#ifndef USE_ROCM