vllm/vllm/attention at b7036c87a13bd94fabf9e46436d3c1e67688f729 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Cyrus Leung b665bbc2d4 [Chore] Migrate V0 attention utils (#31891 )

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

2026-01-07 13:44:36 +00:00

..

[Chore] Migrate V0 attention utils (#31891 )

2026-01-07 13:44:36 +00:00

[Refactor][TPU] Remove torch_xla path and use tpu-inference (#30808 )

2026-01-07 16:07:16 +08:00

[Misc] Improve error messages for unsupported types and parameters (#30593 )

2026-01-07 09:00:16 +00:00

[Attention][UX][1/N] Add AttentionConfig and change attention env vars to CLI arguments (#26315 )

2025-12-05 09:48:43 -08:00

__init__.py

[Attention] Remove imports from vllm/attention/__init__.py (#29342 )

2025-11-26 10:53:15 -07:00

layer.py

[MoE] Fix output_shape calculation in Attention layer to handle 3D query inputs (#31596 )

2026-01-02 15:46:23 +00:00

selector.py

[Platform] Refactor Platform attention backend selection to avoid breakpoint for OOT platform (#30212 )

2025-12-15 17:36:07 +00:00