vllm/benchmark at aa50b17ca776f8c69a793787d0ce06dfa4671884 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Woosuk Kwon 0f4b32199e Support various block sizes & Change default block size to 16 (#38 )

2023-04-15 09:03:24 -07:00

..

benchmark_attention.py

Add query stride to multi_query_cached_kv_attention & Add kernel benchmark script (#27 )

2023-04-08 13:36:09 -07:00

benchmark_cache.py

Memcpy kernel for flash attention (#29 )

2023-04-10 18:22:49 -07:00

benchmark_latency.py

Collect system stats in scheduler & Add scripts for experiments (#30 )

2023-04-12 15:03:49 -07:00

benchmark_text_completion.py

Support various block sizes & Change default block size to 16 (#38 )

2023-04-15 09:03:24 -07:00

trace.py

Collect system stats in scheduler & Add scripts for experiments (#30 )

2023-04-12 15:03:49 -07:00