vllm/cmake at 93f71673ce1a6cd4ac6217c6ca8f7a74c920bcc0 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Lucas Wilkinson c7852a6d9b [Build] Allow shipping PTX on a per-file basis (#18155 )

Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>

2025-05-15 16:41:55 -07:00

..

external_projects

[Perf]Optimize rotary_emb implementation to use Triton operator for improved inference performance (#16457 )

2025-04-25 14:52:28 +08:00

cpu_extension.cmake

[Hardware][Power] Enable compressed tensor W8A8 INT8 quantization for POWER (#17153 )

2025-05-07 22:35:03 -07:00

hipify.py

[Misc] Fix improper placement of SPDX header in scripts (#12694 )

2025-02-03 11:16:59 -08:00

utils.cmake

[Build] Allow shipping PTX on a per-file basis (#18155 )

2025-05-15 16:41:55 -07:00