This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
56777b5c898dcb92dbe6d64fecfccb670f919a0d
vllm
/
tests
/
basic_correctness
History
Harry Mellor
65986db6ba
Make Gemma and Gemma 2 accept
inputs_embeds
like Gemma 3 (
#36787
)
...
Signed-off-by: Harry Mellor <
19981378+hmellor@users.noreply.github.com
>
2026-03-11 18:12:43 +00:00
..
__init__.py
…
test_basic_correctness.py
Make Gemma and Gemma 2 accept
inputs_embeds
like Gemma 3 (
#36787
)
2026-03-11 18:12:43 +00:00
test_cpu_offload.py
[Feature] Support CPU Offloading without Pytorch Pinned Memory that leads to doubled allocation (
#32993
)
2026-02-13 08:11:26 -08:00
test_cumem.py
[refactor] refactor memory constants usage (
#31865
)
2026-01-07 18:37:31 +00:00
test_prefetch_offload.py
[offloader] v2: Hide weight onloading latency via prefetching (
#29941
)
2026-02-25 17:20:59 -08:00