This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
14,087
Commits
2
Branches
140
Tags
5719a4e4e601fb91274294d25370b7aad656d629
Commit Graph
2 Commits
Author
SHA1
Message
Date
Roberto L. Castro
fcb9df99bd
[Perf][Kernel] Optimize FP4 quantization kernels (SM100F) (
#32520
)
...
Signed-off-by: LopezCastroRoberto <
rocastro@redhat.com
>
2026-01-24 18:45:27 -07:00
Michael Goin
06d490282f
[NVFP4][Perf] Tune NVFP4 input quant kernel for small batch size (
#30897
)
...
Signed-off-by: mgoin <
mgoin64@gmail.com
>
2025-12-21 09:41:57 -08:00