This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
1cbbcfe8a334bab004c43a60be201c8ab528e0d2
vllm
/
tests
/
v1
/
spec_decode
History
Fynn Schmitt-Ulms
04bf5a35fa
[Spec Decode] Update extract_hidden_states to use deferred kv_connector clear (
#37013
)
2026-03-16 14:53:45 +01:00
..
__init__.py
…
test_acceptance_length.py
[Hardware] Replace torch.cuda.device_count/current_device/set_device API (
#36145
)
2026-03-12 07:57:47 -07:00
test_eagle_step_kernel.py
feat(spec_decode): fuse EAGLE step slot mapping and metadata updates (
#33503
)
2026-03-11 04:35:33 +00:00
test_eagle.py
Reapply [Attention] Refactor
check_and_update_config
(
#35122
)
2026-03-09 07:17:14 -07:00
test_extract_hidden_states.py
[Spec Decode] Update extract_hidden_states to use deferred kv_connector clear (
#37013
)
2026-03-16 14:53:45 +01:00
test_max_len.py
…
test_mtp.py
[BugFix] Add support for MTP num_speculative_tokens > 1 with sparse MLA (
#34552
)
2026-03-03 07:21:57 -08:00
test_ngram.py
…
test_speculators_eagle3.py
…
test_tree_attention.py
…