This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
78434b923c80e435bcae9ad846471a48d8e3bb4e
vllm
/
vllm
/
platforms
History
Md. Mekayel Anik
d521dcdbcc
docs: clarify SMT and OMP acronyms in CpuPlatform (
#39085
)
2026-04-07 17:42:07 -07:00
..
__init__.py
In-Tree AMD Zen CPU Backend via zentorch [1/N] (
#35970
)
2026-03-15 23:35:35 +00:00
cpu.py
docs: clarify SMT and OMP acronyms in CpuPlatform (
#39085
)
2026-04-07 17:42:07 -07:00
cuda.py
[1/N][Cleanup] Standardize on use of
is_quantized_kv_cache
(
#38659
)
2026-04-01 04:08:01 +00:00
interface.py
[vLLM IR] add
import_ir_kernels()
to support OOT platforms (
#38807
)
2026-04-03 17:25:19 +00:00
rocm.py
[NVFP4] Support NVFP4 dense models from
modelopt
and
compressed-tensors
on AMD Instinct MI300, MI355X and Hopper through emulation (
#35733
)
2026-04-06 16:18:27 -06:00
tpu.py
[Refactor][TPU] Remove torch_xla path and use tpu-inference (
#30808
)
2026-01-07 16:07:16 +08:00
xpu.py
[XPU] Initial support for GDN attention on Qwen3-next/Qwen3.5 (
#33657
)
2026-04-03 08:59:11 +08:00
zen_cpu.py
In-Tree AMD Zen CPU Backend via zentorch [1/N] (
#35970
)
2026-03-15 23:35:35 +00:00