vllm/csrc/torch_bindings.cpp at 451da4bcbdc2dcabf3e319b4a82b72674c33f4de - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

Tao He 60f7624334 Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844 )

2025-05-12 19:52:47 -07:00

28 KiB

Raw Blame History

View Raw