This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
4,258
Commits
2
Branches
140
Tags
68ad4e3a8d8a66fb2a43be57471ee13a8bec4ec0
Commit Graph
3 Commits
Author
SHA1
Message
Date
kliuae
7c25fe45a6
[AMD] Add support for GGUF quantization on ROCm (
#10254
)
2024-11-22 21:14:49 -08:00
Isotr0py
fc990f9795
[Bugfix][Kernel] Add
IQ1_M
quantization implementation to GGUF kernel (
#8357
)
2024-09-15 16:51:44 -06:00
Isotr0py
360bd67cf0
[Core] Support loading GGUF model (
#5191
)
...
Co-authored-by: Michael Goin <
michael@neuralmagic.com
>
2024-08-05 17:54:23 -06:00