This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
738d0a281fab2e151a67b370c26b4e4360362f8f
vllm
/
tests
/
v1
/
e2e
History
Benjamin Chislett
8a680463fa
[Bugfix] Fix NemotronH MTP + Chunked Prefill (
#35447
)
2026-03-17 07:07:33 +01:00
..
general
[ROCm][CI] Retrying in case of batch variance effects and reducing flakiness (
#36442
)
2026-03-16 16:08:51 +08:00
spec_decode
[CI] Split V1 e2e + engine (1 GPU) into separate jobs (
#36945
)
2026-03-13 14:16:02 -07:00
__init__.py
[V1] Implement Cascade Attention (
#11635
)
2025-01-01 21:56:46 +09:00
test_hybrid_chunked_prefill.py
[Bugfix] Fix NemotronH MTP + Chunked Prefill (
#35447
)
2026-03-17 07:07:33 +01:00