vllm/vllm/v1/worker at 0578e5a462dff347ee475913da7c2f91f60c9bc3 - vllm

Files

Chengji Yao 0578e5a462 [Hardware][TPU]Enable ragged paged attention kernel and resolve recompilation issue (#14310 )

Signed-off-by: Chengji Yao <chengjiyao@google.com>

2025-03-06 23:31:05 +00:00

__init__.py

2024-10-22 01:24:07 -07:00

block_table.py

2025-03-02 17:34:51 -08:00

gpu_input_batch.py

2025-03-05 17:10:13 -08:00

gpu_model_runner.py

2025-03-05 17:10:13 -08:00

gpu_worker.py

2025-03-02 17:34:51 -08:00

lora_model_runner_mixin.py

2025-03-02 17:34:51 -08:00

tpu_model_runner.py

2025-03-06 23:31:05 +00:00

tpu_worker.py

2025-03-04 19:58:48 -05:00

worker_base.py

2025-02-18 12:33:45 +08:00