Files
vllm/docs/models/hardware_supported_models/cpu.md
Louie Tsai 719ac592ed Update CPU doc according to feedback (#32963)
Signed-off-by: Tsai, Louie <louie.tsai@intel.com>
Signed-off-by: Louie Tsai <louie.tsai@intel.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2026-01-24 16:02:44 +00:00

2.0 KiB

CPU - Intel® Xeon®

Validated Hardware

Hardware
Intel® Xeon® 6 Processors
Intel® Xeon® 5 Processors

Text-only Language Models

Model Architecture Supported
meta-llama/Llama-3.1-8B-Instruct LlamaForCausalLM
meta-llama/Llama-3.2-3B-Instruct LlamaForCausalLM
ibm-granite/granite-3.2-2b-instruct GraniteForCausalLM
Qwen/Qwen3-1.7B Qwen3ForCausalLM
Qwen/Qwen3-4B Qwen3ForCausalLM
Qwen/Qwen3-8B Qwen3ForCausalLM
zai-org/glm-4-9b-hf GLMForCausalLM
google/gemma-7b GemmaForCausalLM

Multimodal Language Models

Model Architecture Supported
Qwen/Qwen2.5-VL-7B-Instruct Qwen2VLForConditionalGeneration
openai/whisper-large-v3 WhisperForConditionalGeneration

Runs and optimized.
🟨 Runs and correct but not optimized to green yet.
Does not pass accuracy test or does not run.