vllm/vllm/platforms at 78434b923c80e435bcae9ad846471a48d8e3bb4e - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Md. Mekayel Anik d521dcdbcc docs: clarify SMT and OMP acronyms in CpuPlatform (#39085 )

2026-04-07 17:42:07 -07:00

..

__init__.py

In-Tree AMD Zen CPU Backend via zentorch [1/N] (#35970 )

2026-03-15 23:35:35 +00:00

cpu.py

docs: clarify SMT and OMP acronyms in CpuPlatform (#39085 )

2026-04-07 17:42:07 -07:00

cuda.py

[1/N][Cleanup] Standardize on use of is_quantized_kv_cache (#38659 )

2026-04-01 04:08:01 +00:00

interface.py

[vLLM IR] add import_ir_kernels() to support OOT platforms (#38807 )

2026-04-03 17:25:19 +00:00

rocm.py

[NVFP4] Support NVFP4 dense models from modelopt and compressed-tensors on AMD Instinct MI300, MI355X and Hopper through emulation (#35733 )

2026-04-06 16:18:27 -06:00

tpu.py

[Refactor][TPU] Remove torch_xla path and use tpu-inference (#30808 )

2026-01-07 16:07:16 +08:00

xpu.py

[XPU] Initial support for GDN attention on Qwen3-next/Qwen3.5 (#33657 )

2026-04-03 08:59:11 +08:00

zen_cpu.py

In-Tree AMD Zen CPU Backend via zentorch [1/N] (#35970 )

2026-03-15 23:35:35 +00:00