vllm/csrc at d27f4bae393214b4e7715fc3cb5754d4bf801bce - vllm

Files

History

Yanming W e0c6f556e8 [Build] Avoid building too many extensions (#1624 )

2023-11-23 16:31:19 -08:00

2023-10-31 15:19:30 -07:00

Support SqueezeLLM (#1326 )

2023-10-21 23:14:59 -07:00

activation_kernels.cu

2023-11-03 14:12:48 -07:00

cache_kernels.cu

2023-10-31 15:19:30 -07:00

cache.h

2023-11-23 16:31:19 -08:00

cuda_utils_kernels.cu

2023-09-26 22:27:13 -07:00

cuda_utils.h

2023-11-23 16:31:19 -08:00

dispatch_utils.h

2023-09-02 14:59:47 +09:00

layernorm_kernels.cu

2023-11-18 18:18:02 -08:00

ops.h

2023-11-23 16:31:19 -08:00

pos_encoding_kernels.cu

2023-11-03 14:12:48 -07:00

pybind.cpp

2023-11-23 16:31:19 -08:00

reduction_utils.cuh

2023-06-17 03:07:40 -07:00