GPUModelRunner.prepare_kernel_block_sizes
Signed-off-by: NickLucche <nlucches@redhat.com>
vllm.utils.platform_utils.py
vllm/attention
vllm.utils.mem_utils