Signed-off-by: Huamin Li <3ericli@gmail.com>
vllm.utils.platform_utils.py
GPUModelRunner.prepare_kernel_block_sizes
vllm/attention
vllm.utils.mem_utils