This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
822e250ab74899af4bc28aa5d738ec4c0e8c646e
vllm
/
vllm
/
v1
/
worker
/
gpu
/
spec_decode
History
Lucas Wilkinson
483463f735
[MRV2] Extensible CG dispatch rework (
#35959
)
...
Signed-off-by: Lucas Wilkinson <
lwilkins@redhat.com
>
2026-03-09 13:58:45 -07:00
..
eagle
[MRV2] Extensible CG dispatch rework (
#35959
)
2026-03-09 13:58:45 -07:00
__init__.py
[Model Runner V2] Support Eagle3 (no CUDA graph) (
#35029
)
2026-02-21 12:55:24 -08:00
rejection_sample.py
[Model Runner V2] Misc code simplification (
#35941
)
2026-03-04 15:26:35 -08:00
utils.py
[Core] Don't schedule spec tokens with prefill chunks (
#33652
)
2026-02-04 23:40:22 +00:00