vllm/csrc/quantization at 739b61a348afa5da297a80ff15f4e39d6e524b53 - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Alexander Matveev 396d92d5e0 [Kernel][Core] Add AWQ support to the Marlin kernel (#6612 )

2024-07-21 19:41:42 -04:00

..

[Kernel][Misc] Use TORCH_LIBRARY instead of PYBIND11_MODULE for custom ops (#5047 )

2024-06-09 16:23:30 -04:00

[Kernel][Misc] Use TORCH_LIBRARY instead of PYBIND11_MODULE for custom ops (#5047 )

2024-06-09 16:23:30 -04:00

compressed_tensors

[Kernel][Misc] Use TORCH_LIBRARY instead of PYBIND11_MODULE for custom ops (#5047 )

2024-06-09 16:23:30 -04:00

[Kernel] Turn off CUTLASS scaled_mm for Ada Lovelace (#6384 )

2024-07-14 13:37:19 +00:00

[Kernel][Core] Add AWQ support to the Marlin kernel (#6612 )

2024-07-21 19:41:42 -04:00

[Kernel][Misc] Use TORCH_LIBRARY instead of PYBIND11_MODULE for custom ops (#5047 )

2024-06-09 16:23:30 -04:00

[Kernel][Core] Add AWQ support to the Marlin kernel (#6612 )

2024-07-21 19:41:42 -04:00

[Kernel][Core] Add AWQ support to the Marlin kernel (#6612 )

2024-07-21 19:41:42 -04:00

[Kernel][Misc] Use TORCH_LIBRARY instead of PYBIND11_MODULE for custom ops (#5047 )

2024-06-09 16:23:30 -04:00