vllm/vllm/v1/attention/backends at b6be6f8d1e49d4aa884603e8675dc216be1cbd79 - vllm

Files

iefgnoix b6be6f8d1e [TPU] Support sliding window and logit soft capping in the paged attention kernel for TPU. (#15732 )

Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>

2025-04-03 14:23:28 -07:00

2025-03-23 15:07:04 -07:00

__init__.py

2024-10-22 01:24:07 -07:00

flash_attn.py

2025-03-23 15:07:04 -07:00

pallas.py

2025-04-03 14:23:28 -07:00

triton_attn.py

2025-04-02 19:48:00 -07:00