This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
6,265
Commits
2
Branches
140
Tags
e3d0a1d190678b49d541fea2bd3db8d3ce9f0430
Commit Graph
2 Commits
Author
SHA1
Message
Date
rasmith
e3d0a1d190
[Quantizaton] [AMD] Add support for running DeepSeek int8 w8a8 MoE on ROCm (
#17558
)
...
Signed-off-by: Randall Smith <
Randall.Smith@amd.com
>
2025-05-02 21:41:10 -07:00
Michael Goin
f41647ee6b
[Kernel] Support W8A8 channel-wise weights and per-token activations in triton fused_moe_kernel (
#16366
)
...
Signed-off-by: mgoin <
mgoin64@gmail.com
>
2025-04-11 17:54:08 +00:00