This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
28a6d5423db63ba9c4df13608f6151a484bdb7c9
vllm
/
vllm
/
model_executor
/
layers
/
quantization
/
kernels
History
Michael Goin
28a6d5423d
[Bugfix] Fix Machete zero point issue for GPTQ models on SM90 (
#21066
)
...
Signed-off-by: mgoin <
mgoin64@gmail.com
>
2025-07-16 19:54:45 -07:00
..
mixed_precision
[Bugfix] Fix Machete zero point issue for GPTQ models on SM90 (
#21066
)
2025-07-16 19:54:45 -07:00
scaled_mm
Use w8a8 quantized matmul Pallas kernel (
#19170
)
2025-07-15 03:06:33 +00:00
__init__.py
[TPU][Quantization] TPU
W8A8
(
#11785
)
2025-01-08 19:33:29 +00:00