vllm/csrc/torch_bindings.cpp at dd5fa7e04f7544dca276701816453e8cc31fb7de - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

Tao He 60f7624334 Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844 )

2025-05-12 19:52:47 -07:00

28 KiB

Raw Blame History

View Raw