vllm/tests/v1/worker at 7c04779afa7d0811dba3e1ec98c0ac1bc56570be - vllm

Files

Chenguang Zheng d765cf01fe [Core][Multimodal] Track encode cache entries by mm_hash and enable embedding sharing between requests (#22711 )

Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Signed-off-by: Roger Wang <hey@rogerw.io>
Co-authored-by: knlnguyen1802 <knlnguyen1802@gmail.com>
Co-authored-by: Roger Wang <hey@rogerw.io>

2025-08-25 00:41:17 -07:00

__init__.py

[V1] Adding min tokens/repetition/presence/frequence penalties to V1 sampler (#10681 )

2024-12-26 19:02:58 +09:00

test_gpu_input_batch.py

[Core][Multimodal] Track encode cache entries by mm_hash and enable embedding sharing between requests (#22711 )

2025-08-25 00:41:17 -07:00

test_gpu_model_runner.py

[Core][Multimodal] Track encode cache entries by mm_hash and enable embedding sharing between requests (#22711 )

2025-08-25 00:41:17 -07:00