This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
f242cfcdd5f1db4e005503a02a1317369d2a8e3d
vllm
/
tests
/
v1
/
spec_decode
History
rasmith
3999442f1c
[CI/Build][AMD] Add check for flash_att_varlen_func to test_tree_attention.py (
#29252
)
...
Signed-off-by: Randall Smith <
ransmith@amd.com
> Co-authored-by: Randall Smith <
ransmith@amd.com
>
2025-11-23 04:45:08 +00:00
..
test_eagle.py
Revert "[Redo]
#26368
(
#28771
)" (
#29121
)
2025-11-20 21:27:45 -08:00
test_max_len.py
[Bugfix] Spec decode + structured output + spec model max len edge case (
#28298
)
2025-11-08 19:44:25 +00:00
test_mtp.py
Add support for Eagle with separate lm-head and embed_tokens layers (
#28549
)
2025-11-15 06:12:02 -08:00
test_ngram.py
Revert "[Redo]
#26368
(
#28771
)" (
#29121
)
2025-11-20 21:27:45 -08:00
test_speculators_eagle3.py
[Speculators] Move tests + fix integration (
#27308
)
2025-10-29 00:54:21 -07:00
test_tree_attention.py
[CI/Build][AMD] Add check for flash_att_varlen_func to test_tree_attention.py (
#29252
)
2025-11-23 04:45:08 +00:00