This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
c55d8046723325e09521a24ac076a8a7e64eaa52
vllm
/
tests
/
tpu
History
Chengji Yao
a1cc9f33a3
[TPU] remove transpose ops in moe kernel (
#18923
)
...
Signed-off-by: Chengji Yao <
chengjiyao@google.com
>
2025-05-29 23:00:11 +00:00
..
lora
[Hardware][TPU][V1] Multi-LoRA Optimisations for the V1 TPU backend (
#15655
)
2025-05-28 19:59:09 +00:00
__init__.py
[torch.compile] avoid Dynamo guard evaluation overhead (
#7898
)
2024-08-28 16:10:12 -07:00
test_compilation.py
[TPU][V1] Refine tpu_model_runner to mitigate future recompilation issues (
#16275
)
2025-04-09 18:51:51 -06:00
test_custom_dispatcher.py
[V1] TPU - Fix CI/CD runner (
#14974
)
2025-03-17 21:07:07 +00:00
test_moe_pallas.py
[TPU] remove transpose ops in moe kernel (
#18923
)
2025-05-29 23:00:11 +00:00
test_quantization_accuracy.py
Correct capitalisation:
VLLM
->
vLLM
(
#14562
)
2025-03-10 16:36:21 +00:00