This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
2e9034c998e231634a75ab74c5812e0dba2cf3a1
vllm
/
vllm
/
model_executor
/
kernels
/
linear
History
Maral
2e9034c998
[W8A8 Block Linear Refactor][2/N] Remove W8A8Fp8BlockLinearOp and adopt Fp8 block linear kernel selections. (
#33892
)
...
Signed-off-by: maral <
maralbahari.98@gmail.com
> Signed-off-by: Maral <
maralbahari.98@gmail.com
>
2026-04-09 08:50:39 +08:00
..
mixed_precision
[CPU] Support CT W4A16 on CPU MP kernel (
#38219
)
2026-03-27 14:15:28 +08:00
scaled_mm
[W8A8 Block Linear Refactor][2/N] Remove W8A8Fp8BlockLinearOp and adopt Fp8 block linear kernel selections. (
#33892
)
2026-04-09 08:50:39 +08:00
__init__.py
[W8A8 Block Linear Refactor][2/N] Remove W8A8Fp8BlockLinearOp and adopt Fp8 block linear kernel selections. (
#33892
)
2026-04-09 08:50:39 +08:00
base.py
[W8A8 Block Linear Refactor][2/N] Remove W8A8Fp8BlockLinearOp and adopt Fp8 block linear kernel selections. (
#33892
)
2026-04-09 08:50:39 +08:00