vllm/tests/v1/tpu at 44fa4d556c37cf3538090960bb3e07654193df5e - vllm

Files

Nicolò Lucchesi b3f2fddd17 [TPU][V1] Fix exponential padding when max-num-batched-tokens is not a power of 2 (#16596 )

Signed-off-by: NickLucche <nlucches@redhat.com>

2025-04-14 17:01:05 +00:00

2025-04-14 17:01:05 +00:00

__init__.py

2025-03-08 08:19:38 -05:00

test_basic.py

2025-03-31 13:25:20 -04:00

test_mha_attn.py

2025-03-21 08:50:39 -07:00

test_pallas.py

2025-04-09 14:46:32 +08:00

test_perf.py

2025-03-31 13:25:20 -04:00

test_sampler.py

2025-04-10 17:05:44 -04:00

test_topk_topp_sampler.py

2025-04-02 17:18:08 -07:00