vllm/csrc/ops.h at 6881107948c00a8564bc2fa85308f6fc2f065d64 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

Tao He 60f7624334 Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844 )

2025-05-12 19:52:47 -07:00

16 KiB

Raw Blame History

View Raw