This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
61059bee40511b6f6c044053cf921da81cf89985
vllm
/
csrc
/
quantization
/
cutlass_w8a8
/
moe
History
Chiyue Wei
61059bee40
[Hardware][NVIDIA] FP4 MoE kernel optimization (
#19110
)
...
Signed-off-by: Chiyue Wei <
chiyuew@nvidia.com
> Co-authored-by: Chiyue Wei <
chiyuew@nvidia.com
>
2025-06-05 09:48:26 -07:00
..
get_group_starts.cuh
[Kernel] CUTLASS grouped gemm fp8 MoE kernel (
#13972
)
2025-03-27 00:54:44 +00:00
grouped_mm_c3x.cu
[Kernel] CUTLASS grouped gemm fp8 MoE kernel (
#13972
)
2025-03-27 00:54:44 +00:00
grouped_mm_c3x.cuh
[Kernel] CUTLASS grouped gemm fp8 MoE kernel (
#13972
)
2025-03-27 00:54:44 +00:00
moe_data.cu
[Hardware][NVIDIA] FP4 MoE kernel optimization (
#19110
)
2025-06-05 09:48:26 -07:00