vllm/csrc/torch_bindings.cpp at e57e4d6e9e3aa9987c1cffe4724d59d52b97c44e - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

Tao He 60f7624334 Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844 )

2025-05-12 19:52:47 -07:00

28 KiB

Raw Blame History

View Raw