Logo
Explore Help
Register Sign In
biondizzle/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
245e4f2c01d19a567742fee4117badf1f6027da0
vllm/tests/v1/attention
History
Huamin Li c312320764 [CI/Build] tests(v1): feed Triton attention the (num_blocks, 2, …) KV cache layout in backend-correctness tests (#26663)
Signed-off-by: Huamin Li <3ericli@gmail.com>
Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com>
2025-10-17 21:11:26 -07:00
..
test_attention_backends_selection.py
Convert formatting to use ruff instead of yapf + isort (#26247)
2025-10-05 07:06:22 -07:00
test_attention_backends.py
[CI/Build] tests(v1): feed Triton attention the (num_blocks, 2, …) KV cache layout in backend-correctness tests (#26663)
2025-10-17 21:11:26 -07:00
test_attention_splitting.py
[Core] Simplify the Dp padding/should ubatch coordination logic (#25768)
2025-10-07 01:57:49 +00:00
test_chunked_local_attention.py
Convert formatting to use ruff instead of yapf + isort (#26247)
2025-10-05 07:06:22 -07:00
test_mla_backends.py
[Attention][Spec Decode] FlashMLA spec decode support (#26541)
2025-10-14 19:38:20 +00:00
test_sparse_mla_backends.py
[Misc] Clean up cruft from previous FlashMLA sparse implementation (#26125)
2025-10-08 10:09:34 +08:00
utils.py
[Chore] Separate out vllm.utils.importlib (#27022)
2025-10-17 00:48:59 +00:00
Powered by Gitea Version: 1.25.2 Page: 76ms Template: 2ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API