This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
61e381dcf01f25cc8a006ecf0ba9c31dde662b42
vllm
/
tests
/
v1
/
e2e
History
Lucas Wilkinson
e1d85e5c24
[Attention] Support distinguishing between short extends and decodes (
#37303
)
...
Signed-off-by: Lucas Wilkinson <
lwilkins@redhat.com
>
2026-03-20 10:49:36 -07:00
..
general
[ROCm][CI] Retrying in case of batch variance effects and reducing flakiness (
#36442
)
2026-03-16 16:08:51 +08:00
spec_decode
[CI] Split V1 e2e + engine (1 GPU) into separate jobs (
#36945
)
2026-03-13 14:16:02 -07:00
__init__.py
[V1] Implement Cascade Attention (
#11635
)
2025-01-01 21:56:46 +09:00
test_hybrid_chunked_prefill.py
[Attention] Support distinguishing between short extends and decodes (
#37303
)
2026-03-20 10:49:36 -07:00