vllm/tests/v1/spec_decode at f05d474c8a08659cc1610a85de7e7a7095494a52 - vllm

Files

Jialin Ouyang 186352b270 [Core] Performance: Use list[np.ndarray] instead of list[list[int]] for output tokens for GC optimization (#26368 )

Signed-off-by: Jialin Ouyang <Jialin.Ouyang@gmail.com>

2025-11-14 16:04:04 -08:00

test_eagle.py

2025-11-14 16:04:04 -08:00

test_max_len.py

2025-11-08 19:44:25 +00:00

test_mtp.py

2025-11-11 07:40:44 -05:00

test_ngram.py

2025-11-14 16:04:04 -08:00

test_speculators_eagle3.py

2025-10-29 00:54:21 -07:00

test_tree_attention.py

2025-11-11 07:40:44 -05:00