vllm/tests/v1/spec_decode at 9fc81ec765aa0daa6f704023c0f902a0da653b72 - vllm

Files

Jialin Ouyang 186352b270 [Core] Performance: Use list[np.ndarray] instead of list[list[int]] for output tokens for GC optimization (#26368 )

Signed-off-by: Jialin Ouyang <Jialin.Ouyang@gmail.com>

2025-11-14 16:04:04 -08:00

test_eagle.py

2025-11-14 16:04:04 -08:00

test_max_len.py

2025-11-08 19:44:25 +00:00

test_mtp.py

2025-11-11 07:40:44 -05:00

test_ngram.py

2025-11-14 16:04:04 -08:00

test_speculators_eagle3.py

2025-10-29 00:54:21 -07:00

test_tree_attention.py

2025-11-11 07:40:44 -05:00