From 31f4489d1f33af80f368de3fa747d26f2343d6dc Mon Sep 17 00:00:00 2001 From: Rajesh Shashi Kumar <35628747+rajesh-s@users.noreply.github.com> Date: Wed, 24 Sep 2025 01:43:49 -0500 Subject: [PATCH] Update README.md --- README.md | 13 ++++++++++++- 1 file changed, 12 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 3054251..1afb0d2 100644 --- a/README.md +++ b/README.md @@ -1 +1,12 @@ -# containers \ No newline at end of file +# Building containers for GH200 + +Currently, prebuilt wheels for `vLLM` and `LMcache` are not available for `aarch64`. This can make setup tedious when working on modern `aarch64` platforms such as NVIDIA GH200. + +This repository provides a Dockerfile to build a container with vLLM and all its dependencies pre-installed to try out various things such as KV offloading. + +If you prefer not to build the image yourself, you can pull the ready-to-use image directly from Docker Hub: + +`docker pull rajesh550/gh200-vllm:0.10.2` + +👉 [Docker Hub](https://hub.docker.com/repository/docker/rajesh550/gh200-vllm/general) +