This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
fa3bba2a538de76c630f75de160bfb43e4e1cd4b
vllm
/
tests
/
kernels
/
test_int8_kernel.py
Michael Goin
f41647ee6b
[Kernel] Support W8A8 channel-wise weights and per-token activations in triton fused_moe_kernel (
#16366
)
...
Signed-off-by: mgoin <
mgoin64@gmail.com
>
2025-04-11 17:54:08 +00:00
5.1 KiB
Raw
Blame
History
View Raw
Reference in New Issue
View Git Blame
Copy Permalink