[4/N][Attention] Move MLA common to model_executor (#32060)
Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Co-authored-by: Luka Govedič <ProExpertProg@users.noreply.github.com>
This commit is contained in:
2112
vllm/model_executor/layers/attention/mla_attention.py
Executable file
2112
vllm/model_executor/layers/attention/mla_attention.py
Executable file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user