Logo
Explore Help
Register Sign In
biondizzle/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions 2 Packages Projects Releases Wiki Activity
Files
83004020fd2400f56df2686e8c55df3d9fc79b7b
vllm/tests/v1/attention
History
Huamin Li c312320764 [CI/Build] tests(v1): feed Triton attention the (num_blocks, 2, …) KV cache layout in backend-correctness tests (#26663)
Signed-off-by: Huamin Li <3ericli@gmail.com>
Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com>
2025-10-17 21:11:26 -07:00
..
test_attention_backends_selection.py
Convert formatting to use ruff instead of yapf + isort (#26247)
2025-10-05 07:06:22 -07:00
test_attention_backends.py
[CI/Build] tests(v1): feed Triton attention the (num_blocks, 2, …) KV cache layout in backend-correctness tests (#26663)
2025-10-17 21:11:26 -07:00
test_attention_splitting.py
[Core] Simplify the Dp padding/should ubatch coordination logic (#25768)
2025-10-07 01:57:49 +00:00
test_chunked_local_attention.py
Convert formatting to use ruff instead of yapf + isort (#26247)
2025-10-05 07:06:22 -07:00
test_mla_backends.py
[Attention][Spec Decode] FlashMLA spec decode support (#26541)
2025-10-14 19:38:20 +00:00
test_sparse_mla_backends.py
[Misc] Clean up cruft from previous FlashMLA sparse implementation (#26125)
2025-10-08 10:09:34 +08:00
utils.py
[Chore] Separate out vllm.utils.importlib (#27022)
2025-10-17 00:48:59 +00:00
Powered by Gitea Version: 1.25.2 Page: 74ms Template: 2ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API