.buildkite/scripts/hardware_ci/run-cpu-test.sh

#!/bin/bash

# This script build the CPU docker image and run the offline inference inside the container.
# It serves a sanity check for compilation and basic model usage.
set -euox pipefail

# allow to bind to different cores
CORE_RANGE=${CORE_RANGE:-48-95}
NUMA_NODE=${NUMA_NODE:-1}
IMAGE_NAME="cpu-test-$NUMA_NODE"
TIMEOUT_VAL=$1
TEST_COMMAND=$2

# building the docker image
echo "--- :docker: Building Docker image"
docker build --progress plain --tag "$IMAGE_NAME" --target vllm-test -f docker/Dockerfile.cpu .

# Run the image, setting --shm-size=4g for tensor parallel.
docker run --rm --cpuset-cpus="$CORE_RANGE" --cpuset-mems="$NUMA_NODE" -v ~/.cache/huggingface:/root/.cache/huggingface --privileged=true -e HF_TOKEN -e VLLM_CPU_KVCACHE_SPACE=16 -e VLLM_CPU_CI_ENV=1 -e VLLM_CPU_SIM_MULTI_NUMA=1 --shm-size=4g "$IMAGE_NAME" \
        timeout "$TIMEOUT_VAL" bash -c "set -euox pipefail; echo \"--- Print packages\"; pip list; echo \"--- Running tests\"; ${TEST_COMMAND}"
[CI/Build] Add shell script linting using shellcheck (#7925) Signed-off-by: Russell Bryant <rbryant@redhat.com> 2024-11-07 13:17:29 -05:00			`#!/bin/bash`

[Hardware][Intel] Add CPU inference backend (#3634) Co-authored-by: Kunshang Ji <kunshang.ji@intel.com> Co-authored-by: Yuan Zhou <yuan.zhou@intel.com> 2024-04-02 13:07:30 +08:00			`# This script build the CPU docker image and run the offline inference inside the container.`
			`# It serves a sanity check for compilation and basic model usage.`
[CI/Build] Parallelize CPU CI tests (#33778) Signed-off-by: jiang1.li <jiang1.li@intel.com> 2026-02-05 13:53:48 +08:00			`set -euox pipefail`
[Hardware][Intel] Add CPU inference backend (#3634) Co-authored-by: Kunshang Ji <kunshang.ji@intel.com> Co-authored-by: Yuan Zhou <yuan.zhou@intel.com> 2024-04-02 13:07:30 +08:00
[CI][CPU]refactor CPU tests to allow to bind with different cores (#10222) Signed-off-by: Yuan Zhou <yuan.zhou@intel.com> 2024-11-12 18:07:32 +08:00			`# allow to bind to different cores`
			`CORE_RANGE=${CORE_RANGE:-48-95}`
			`NUMA_NODE=${NUMA_NODE:-1}`
[CI/Build] Parallelize CPU CI tests (#33778) Signed-off-by: jiang1.li <jiang1.li@intel.com> 2026-02-05 13:53:48 +08:00			`IMAGE_NAME="cpu-test-$NUMA_NODE"`
			`TIMEOUT_VAL=$1`
			`TEST_COMMAND=$2`
[CI][CPU]refactor CPU tests to allow to bind with different cores (#10222) Signed-off-by: Yuan Zhou <yuan.zhou@intel.com> 2024-11-12 18:07:32 +08:00
[CI/Build] Parallelize CPU CI tests (#33778) Signed-off-by: jiang1.li <jiang1.li@intel.com> 2026-02-05 13:53:48 +08:00			`# building the docker image`
			`echo "--- :docker: Building Docker image"`
			`docker build --progress plain --tag "$IMAGE_NAME" --target vllm-test -f docker/Dockerfile.cpu .`
[CPU][CI] Improve CPU Dockerfile (#15690) Signed-off-by: jiang1.li <jiang1.li@intel.com> 2025-03-28 16:36:31 +08:00
[Hardware] [Intel] Enable Multiprocessing and tensor parallel in CPU backend and update documentation (#6125) 2024-07-27 04:50:10 +08:00			`# Run the image, setting --shm-size=4g for tensor parallel.`
[CI][BugFix] ShellCheck cleanup to remove baseline and preserve runtime behavior (#34514) Signed-off-by: junuxyz <216036880+junuxyz@users.noreply.github.com> 2026-02-17 21:22:56 +09:00			`docker run --rm --cpuset-cpus="$CORE_RANGE" --cpuset-mems="$NUMA_NODE" -v ~/.cache/huggingface:/root/.cache/huggingface --privileged=true -e HF_TOKEN -e VLLM_CPU_KVCACHE_SPACE=16 -e VLLM_CPU_CI_ENV=1 -e VLLM_CPU_SIM_MULTI_NUMA=1 --shm-size=4g "$IMAGE_NAME" \`
			`timeout "$TIMEOUT_VAL" bash -c "set -euox pipefail; echo \"--- Print packages\"; pip list; echo \"--- Running tests\"; ${TEST_COMMAND}"`