vllm/vllm/model_executor/layers/fla at a5464dcf92bba8dfd052fc79bfc40e08aee515d9 - vllm

Files

Vadim Gimpelson 785d8b6410 [PERF] Qwen3-next MTP speedup (change bool mask indexing to index_select / index_copy to reduce d2h) (#26437 )

Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>

2025-10-16 12:18:31 +08:00

2025-10-16 12:18:31 +08:00

__init__.py

2025-09-10 00:04:41 +08:00