vllm/benchmarks/kernels at 9013e24f7b09a19405c6856b88c004afd4e3fc57 - vllm

Files

History

Cyrus Leung 7e7eae338d [Misc] Standardize RoPE handling for Qwen2-VL (#9250 )

2024-10-16 13:56:17 +08:00

benchmark_aqlm.py

2024-06-20 17:00:13 -06:00

benchmark_layernorm.py

2024-09-18 10:38:11 +00:00

benchmark_machete.py

2024-09-23 13:46:26 -04:00

benchmark_marlin.py

2024-08-02 13:51:58 -07:00

benchmark_moe.py

2024-09-18 10:38:11 +00:00

benchmark_paged_attention.py

2024-09-18 10:38:11 +00:00

benchmark_quant.py

2024-09-18 10:38:11 +00:00

benchmark_rope.py

2024-10-16 13:56:17 +08:00

benchmark_shapes.py

2024-05-16 09:36:49 -04:00

graph_machete_bench.py

2024-09-18 11:00:56 +00:00

requirements.txt

2024-09-23 13:46:26 -04:00

weight_shapes.py

2024-08-20 07:09:33 -06:00