This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
4c69e228b32220ac9159dfdcf0df13ea776e630d
vllm
/
vllm
/
v1
/
spec_decode
History
Woosuk Kwon
99abb8b650
[V1][Spec Decode] Optimize Rejection Sampler with Triton Kernels (
#14930
)
...
Signed-off-by: Woosuk Kwon <
woosuk.kwon@berkeley.edu
>
2025-03-18 14:31:54 -07:00
..
__init__.py
[V1][BugFix] Add __init__.py to v1/spec_decode/ (
#13359
)
2025-02-16 09:39:08 -08:00
metadata.py
[V1][Spec Decode] Optimize Rejection Sampler with Triton Kernels (
#14930
)
2025-03-18 14:31:54 -07:00
ngram_proposer.py
[V1][Spec Decode] Optimize N-gram matching with Numba (
#13365
)
2025-02-18 13:19:58 -08:00
utils.py
[V1][Spec Decode] Optimize Rejection Sampler with Triton Kernels (
#14930
)
2025-03-18 14:31:54 -07:00