This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
df1430265c0bda5fe02c43005352cce7a8aa9562
vllm
/
vllm
/
v1
/
sample
History
Hyesoo Yang
47195057e9
[V1][TPU] Speed up top-k on TPU by using torch.topk (
#15242
)
...
Signed-off-by: Hyesoo Yang <
hyeygit@gmail.com
>
2025-03-20 19:19:40 -07:00
..
ops
[V1][TPU] Speed up top-k on TPU by using torch.topk (
#15242
)
2025-03-20 19:19:40 -07:00
tpu
[V1][TPU] Support V1 Sampler for ragged attention (
#14227
)
2025-03-19 21:00:39 -07:00
__init__.py
[V1] Implement vLLM V1 [1/N] (
#9289
)
2024-10-22 01:24:07 -07:00
metadata.py
[V1] Support bad_words in sampler (
#13376
)
2025-03-08 14:50:26 -08:00
rejection_sampler.py
[V1][Spec Decode] Optimize Rejection Sampler with Triton Kernels (
#14930
)
2025-03-18 14:31:54 -07:00
sampler.py
[V1] Ensure using int64 for sampled token ids (
#15065
)
2025-03-18 23:52:19 -07:00