vllm/vllm/attention at 7e24e5d4d65abbe5ffc7e653fdfd670c7e300944 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Cyrus Leung 5a87d8b9b1 [Deprecation] Remove deprecated plugin and compilation fields for v0.13 release (#30396 )

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

2025-12-10 19:59:35 -08:00

..

[Deprecation] Remove deprecated plugin and compilation fields for v0.13 release (#30396 )

2025-12-10 19:59:35 -08:00

[Attention] Make seq_lens_cpu optional in CommonAttentionMetadata to enable true async spec-decode (#29624 )

2025-12-09 17:18:10 -08:00

[Perf] Remove sync point in vit torch sdpa attn backend (#30232 )

2025-12-08 07:12:42 +00:00

[Attention][UX][1/N] Add AttentionConfig and change attention env vars to CLI arguments (#26315 )

2025-12-05 09:48:43 -08:00

__init__.py

[Attention] Remove imports from vllm/attention/__init__.py (#29342 )

2025-11-26 10:53:15 -07:00

layer.py

[CI/Build] Make test_mha_attn.py run on correct platform only and check for flash_attn_varlen_func in layer.py (#29145 )

2025-12-09 20:18:17 +00:00

selector.py

[Deprecation] Remove deprecated plugin and compilation fields for v0.13 release (#30396 )

2025-12-10 19:59:35 -08:00