This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
4965ec42d28830f0c30756dea19e14b45cdbe5b1
vllm
/
vllm
/
model_executor
/
layers
/
quantization
/
kernels
History
TJian
4965ec42d2
[FEAT] [ROCm] Add AITER int8 scaled gemm kernel (
#15433
)
...
Signed-off-by: tjtanaa <
tunjian.tan@embeddedllm.com
>
2025-03-29 03:33:56 -07:00
..
mixed_precision
Revert "Fix non-contiguous input passed to Marlin kernel (
#15319
)" (
#15398
)
2025-03-24 20:43:51 -07:00
scaled_mm
[FEAT] [ROCm] Add AITER int8 scaled gemm kernel (
#15433
)
2025-03-29 03:33:56 -07:00
__init__.py
[TPU][Quantization] TPU
W8A8
(
#11785
)
2025-01-08 19:33:29 +00:00