vllm/vllm/v1/attention at 545d18d81bf11761e51c2b11a006573c2ae366c1 - vllm

Files

JartX a40ee486f2 [Bugfix] Add Multiple of 16 block_size to triton fallback on rocm Attention to support qwen3_5 (#35923 )

Signed-off-by: JartX <sagformas@epdcenter.es>
Co-authored-by: akaratza <akaratza@amd.com>
Co-authored-by: TJian <tunjian.tan@embeddedllm.com>

2026-03-11 07:45:57 +00:00

2026-03-11 07:45:57 +00:00

2026-03-10 09:14:35 -07:00

__init__.py

2024-10-22 01:24:07 -07:00

backend.py

2026-03-10 03:32:20 -07:00

selector.py

2026-03-09 07:17:14 -07:00