vllm/csrc/cutlass_extensions at 2dfdfed8a0fe5517f8d4050740c251b1c1d35eeb - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Harry Mellor cf069aa8aa Update deprecated Python 3.8 typing (#13971 )

2025-03-02 17:34:51 -08:00

..

[Kernel][Build/CI] Bump CUTLASS to 3.8 and add initializers for cutlass epilogues (#13797 )

2025-02-25 18:52:03 -08:00

[Kernel] Update cutlass_scaled_mm to support 2d group (blockwise) scaling (#11868 )

2025-01-30 18:33:00 -08:00

common.cpp

[Kernel]: Cutlass 2:4 Sparsity + FP8/Int8 Quant Support (#10995 )

2024-12-18 09:57:16 -05:00

common.hpp

[Kernel] Update cutlass_scaled_mm to support 2d group (blockwise) scaling (#11868 )

2025-01-30 18:33:00 -08:00

cute_utils.cuh

[Kernel] Initial Machete W4A8 support + Refactors (#9855 )

2024-11-18 12:59:29 -07:00

torch_utils.hpp

[MISC] Replace c10::optional with std::optional (#11730 )

2025-01-05 10:20:34 +09:00

vllm_collective_builder.cuh

[Kernel] Update cutlass_scaled_mm to support 2d group (blockwise) scaling (#11868 )

2025-01-30 18:33:00 -08:00

vllm_custom_types.cuh

[Kernel] (1/N) Machete - Hopper Optimized Mixed Precision Linear Kernel (#7174 )

2024-08-20 07:09:33 -06:00

vllm_cutlass_library_extension.py

Update deprecated Python 3.8 typing (#13971 )

2025-03-02 17:34:51 -08:00

vllm_numeric_conversion.cuh

[Kernel] Initial Machete W4A8 support + Refactors (#9855 )

2024-11-18 12:59:29 -07:00

vllm_type_utils.cuh

[Kernel] Initial Machete W4A8 support + Refactors (#9855 )

2024-11-18 12:59:29 -07:00