.buildkite/scripts/hardware_ci/run-cpu-test-ppc64le.sh

#!/bin/bash

# This script build the CPU docker image and run the offline inference inside the container.
# It serves a sanity check for compilation and basic model usage.
set -ex

# Setup cleanup
remove_docker_container() {
  if [[ -n "$container_id" ]]; then
      podman stop --all -t0
      podman rm -f "$container_id" || true
  fi
  podman system prune -f
}
trap remove_docker_container EXIT
remove_docker_container

# Try building the docker image
podman build -t cpu-test-ubi9-ppc -f docker/Dockerfile.ppc64le .

# Run the image
container_id=$(podman run -itd --entrypoint /bin/bash -v /tmp/:/root/.cache/huggingface --privileged=true --network host -e HF_TOKEN cpu-test-ubi9-ppc)

function cpu_tests() {

  # offline inference
  podman exec -it "$container_id" bash -c "
    export TORCH_COMPILE_DISABLE=1
    set -xve
    python3 examples/offline_inference/basic/generate.py --model facebook/opt-125m" >> "$HOME"/test_basic.log

  # Run basic model test
  podman exec -it "$container_id" bash -c "
    export TORCH_COMPILE_DISABLE=1
    set -evx
    pip install pytest pytest-asyncio einops peft Pillow soundfile transformers_stream_generator matplotlib
    pip install sentence-transformers datamodel_code_generator tblib 

    # Note: disable Bart until supports V1
    # pytest -v -s tests/models/language/generation/test_bart.py -m cpu_model
    pytest -v -s tests/models/language/generation/test_common.py::test_models[False-False-5-32-openai-community/gpt2]
    pytest -v -s tests/models/language/generation/test_common.py::test_models[False-False-5-32-facebook/opt-125m]
    pytest -v -s tests/models/language/generation/test_common.py::test_models[False-False-5-32-google/gemma-1.1-2b-it]
    pytest -v -s tests/models/language/pooling/test_classification.py::test_models[float-jason9693/Qwen2.5-1.5B-apeach]
    # TODO: Below test case tests/models/language/pooling/test_embedding.py::test_models[True-ssmits/Qwen2-7B-Instruct-embed-base] fails on ppc64le. Disabling it for time being.
    # pytest -v -s tests/models/language/pooling/test_embedding.py -m cpu_model" >> "$HOME"/test_rest.log
}

# All of CPU tests are expected to be finished less than 40 mins.

export container_id
export -f cpu_tests
timeout 120m bash -c cpu_tests
[CI/Build] Add shell script linting using shellcheck (#7925) Signed-off-by: Russell Bryant <rbryant@redhat.com> 2024-11-07 13:17:29 -05:00			`#!/bin/bash`

ppc64le: Dockerfile fixed, and a script for buildkite (#8026) 2024-09-07 23:48:40 +05:30			`# This script build the CPU docker image and run the offline inference inside the container.`
			`# It serves a sanity check for compilation and basic model usage.`
			`set -ex`

			`# Setup cleanup`
Updating builkite job for IBM Power (#17111) Signed-off-by: Aaruni Aggarwal <aaruniagg@gmail.com> 2025-04-24 22:36:17 +05:30			`remove_docker_container() {`
			`if [[ -n "$container_id" ]]; then`
Fixed ppc build when it runs on non-RHEL based linux distros (#18422) Signed-off-by: Nishidha Panpaliya <nishidha.panpaliya@partner.ibm.com> Signed-off-by: Md. Shafi Hussain <Md.Shafi.Hussain@ibm.com> Signed-off-by: npanpaliya <nishidha.panpaliya@partner.ibm.com> Co-authored-by: Md. Shafi Hussain <Md.Shafi.Hussain@ibm.com> 2025-06-07 00:24:26 +05:30			`podman stop --all -t0`
Updating builkite job for IBM Power (#17111) Signed-off-by: Aaruni Aggarwal <aaruniagg@gmail.com> 2025-04-24 22:36:17 +05:30			`podman rm -f "$container_id" \|\| true`
			`fi`
			`podman system prune -f`
			`}`
ppc64le: Dockerfile fixed, and a script for buildkite (#8026) 2024-09-07 23:48:40 +05:30			`trap remove_docker_container EXIT`
			`remove_docker_container`

For ppc64le, disabled tests for now and addressed space issues (#10538) 2024-11-23 15:03:53 +05:30			`# Try building the docker image`
Adding vllm buildkite job for IBM Power (#16679) Signed-off-by: Aaruni Aggarwal <aaruniagg@gmail.com> 2025-04-17 08:17:47 +05:30			`podman build -t cpu-test-ubi9-ppc -f docker/Dockerfile.ppc64le .`

			`# Run the image`
Updating builkite job for IBM Power (#17111) Signed-off-by: Aaruni Aggarwal <aaruniagg@gmail.com> 2025-04-24 22:36:17 +05:30			`container_id=$(podman run -itd --entrypoint /bin/bash -v /tmp/:/root/.cache/huggingface --privileged=true --network host -e HF_TOKEN cpu-test-ubi9-ppc)`
Adding vllm buildkite job for IBM Power (#16679) Signed-off-by: Aaruni Aggarwal <aaruniagg@gmail.com> 2025-04-17 08:17:47 +05:30
			`function cpu_tests() {`

			`# offline inference`
Updating builkite job for IBM Power (#17111) Signed-off-by: Aaruni Aggarwal <aaruniagg@gmail.com> 2025-04-24 22:36:17 +05:30			`podman exec -it "$container_id" bash -c "`
Update Dockerfile to use gcc-toolset-14 and fix test case failures on power (ppc64le) (#28957) Signed-off-by: Bhagyashri <Bhagyashri.Gaikwad2@ibm.com> 2025-11-21 17:54:09 +05:30			`export TORCH_COMPILE_DISABLE=1`
[CI/Build] Fix ppc64le CPU build and tests (#22443) Signed-off-by: Nishidha Panpaliya <nishidha.panpaliya@partner.ibm.com> 2025-10-11 10:34:42 +05:30			`set -xve`
[CI][BugFix] ShellCheck cleanup to remove baseline and preserve runtime behavior (#34514) Signed-off-by: junuxyz <216036880+junuxyz@users.noreply.github.com> 2026-02-17 21:22:56 +09:00			`python3 examples/offline_inference/basic/generate.py --model facebook/opt-125m" >> "$HOME"/test_basic.log`
Adding vllm buildkite job for IBM Power (#16679) Signed-off-by: Aaruni Aggarwal <aaruniagg@gmail.com> 2025-04-17 08:17:47 +05:30
			`# Run basic model test`
Updating builkite job for IBM Power (#17111) Signed-off-by: Aaruni Aggarwal <aaruniagg@gmail.com> 2025-04-24 22:36:17 +05:30			`podman exec -it "$container_id" bash -c "`
Update Dockerfile to use gcc-toolset-14 and fix test case failures on power (ppc64le) (#28957) Signed-off-by: Bhagyashri <Bhagyashri.Gaikwad2@ibm.com> 2025-11-21 17:54:09 +05:30			`export TORCH_COMPILE_DISABLE=1`
[CI/Build] Fix ppc64le CPU build and tests (#22443) Signed-off-by: Nishidha Panpaliya <nishidha.panpaliya@partner.ibm.com> 2025-10-11 10:34:42 +05:30			`set -evx`
Adding vllm buildkite job for IBM Power (#16679) Signed-off-by: Aaruni Aggarwal <aaruniagg@gmail.com> 2025-04-17 08:17:47 +05:30			`pip install pytest pytest-asyncio einops peft Pillow soundfile transformers_stream_generator matplotlib`
Update Dockerfile to use gcc-toolset-14 and fix test case failures on power (ppc64le) (#28957) Signed-off-by: Bhagyashri <Bhagyashri.Gaikwad2@ibm.com> 2025-11-21 17:54:09 +05:30			`pip install sentence-transformers datamodel_code_generator tblib`
[CI/Build] Fix ppc64le CPU build and tests (#22443) Signed-off-by: Nishidha Panpaliya <nishidha.panpaliya@partner.ibm.com> 2025-10-11 10:34:42 +05:30
			`# Note: disable Bart until supports V1`
			`# pytest -v -s tests/models/language/generation/test_bart.py -m cpu_model`
Update Dockerfile to use gcc-toolset-14 and fix test case failures on power (ppc64le) (#28957) Signed-off-by: Bhagyashri <Bhagyashri.Gaikwad2@ibm.com> 2025-11-21 17:54:09 +05:30			`pytest -v -s tests/models/language/generation/test_common.py::test_models[False-False-5-32-openai-community/gpt2]`
			`pytest -v -s tests/models/language/generation/test_common.py::test_models[False-False-5-32-facebook/opt-125m]`
			`pytest -v -s tests/models/language/generation/test_common.py::test_models[False-False-5-32-google/gemma-1.1-2b-it]`
Correcting testcases in builkite job for IBM Power (#17675) Signed-off-by: Aaruni Aggarwal <aaruniagg@gmail.com> 2025-05-12 13:41:55 +05:30			`pytest -v -s tests/models/language/pooling/test_classification.py::test_models[float-jason9693/Qwen2.5-1.5B-apeach]`
[CI/Build] Fix ppc64le CPU build and tests (#22443) Signed-off-by: Nishidha Panpaliya <nishidha.panpaliya@partner.ibm.com> 2025-10-11 10:34:42 +05:30			`# TODO: Below test case tests/models/language/pooling/test_embedding.py::test_models[True-ssmits/Qwen2-7B-Instruct-embed-base] fails on ppc64le. Disabling it for time being.`
[CI][BugFix] ShellCheck cleanup to remove baseline and preserve runtime behavior (#34514) Signed-off-by: junuxyz <216036880+junuxyz@users.noreply.github.com> 2026-02-17 21:22:56 +09:00			`# pytest -v -s tests/models/language/pooling/test_embedding.py -m cpu_model" >> "$HOME"/test_rest.log`
Adding vllm buildkite job for IBM Power (#16679) Signed-off-by: Aaruni Aggarwal <aaruniagg@gmail.com> 2025-04-17 08:17:47 +05:30			`}`

			`# All of CPU tests are expected to be finished less than 40 mins.`
Updating builkite job for IBM Power (#17111) Signed-off-by: Aaruni Aggarwal <aaruniagg@gmail.com> 2025-04-24 22:36:17 +05:30
			`export container_id`
Adding vllm buildkite job for IBM Power (#16679) Signed-off-by: Aaruni Aggarwal <aaruniagg@gmail.com> 2025-04-17 08:17:47 +05:30			`export -f cpu_tests`
[CI/Build] Fix ppc64le CPU build and tests (#22443) Signed-off-by: Nishidha Panpaliya <nishidha.panpaliya@partner.ibm.com> 2025-10-11 10:34:42 +05:30			`timeout 120m bash -c cpu_tests`
[CI/Build] Adding timeout in CPU CI to avoid CPU test queue blocking (#6892) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> 2024-11-09 11:27:11 +08:00