This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
2263d44b688902aa3fd384ecdd7e3db3460b01e0
vllm
/
tests
/
evals
/
gsm8k
/
configs
/
moe-refactor
History
Robert Shaw
0fa8dd24d2
[Bugfix] Fix Typo from NVFP4 Refactor (
#31977
)
...
Signed-off-by: Robert Shaw <
robshaw@redhat.com
> Co-authored-by: Robert Shaw <
robshaw@redhat.com
>
2026-01-08 16:18:50 -08:00
..
config-b200.txt
[MoE Refactor][15/N] Apply Refactor to Fp8 (
#31415
)
2026-01-07 19:42:33 -05:00
config-h100.txt
[MoE Refactor][15/N] Apply Refactor to Fp8 (
#31415
)
2026-01-07 19:42:33 -05:00
config-test.txt
[Bugfix] Fix Typo from NVFP4 Refactor (
#31977
)
2026-01-08 16:18:50 -08:00
Llama-4-Scout-Fp8-CT-vllm-cutlass.yaml
[MoE Refactor][15/N] Apply Refactor to Fp8 (
#31415
)
2026-01-07 19:42:33 -05:00
Llama-4-Scout-Fp8-ModelOpt-fi-cutlass.yaml
…
Llama-4-Scout-Fp8-ModelOpt-fi-trtllm.yaml
…
Llama-4-Scout-Fp8-ModelOpt-marlin.yaml
…
Llama-4-Scout-Fp8-ModelOpt-triton.yaml
…
Mixtral-8x7B-Fp8-AutoFp8-fi-cutlass.yaml
…
Mixtral-8x7B-Fp8-AutoFp8-triton.yaml
…
Qwen3-30B-A3B-Fp8-AutoFp8-deepgemm.yaml
…
Qwen3-30B-A3B-Fp8-AutoFp8-fi-cutlass.yaml
…
Qwen3-30B-A3B-Fp8-AutoFp8-fi-trtllm.yaml
…
Qwen3-30B-A3B-Fp8-AutoFp8-marlin.yaml
…
Qwen3-30B-A3B-Fp8-AutoFp8-triton.yaml
…
Qwen3-30B-A3B-Fp8-CT-Block-deepgemm.yaml
…
Qwen3-30B-A3B-Fp8-CT-Block-fi-cutlass.yaml
…
Qwen3-30B-A3B-Fp8-CT-Block-marlin.yaml
…
Qwen3-30B-A3B-Fp8-CT-Block-triton.yaml
[MoE Refactor][15/N] Apply Refactor to Fp8 (
#31415
)
2026-01-07 19:42:33 -05:00
Qwen3-30B-A3B-Fp8-CT-Channel-marlin.yaml
…
Qwen3-30B-A3B-Fp8-CT-Channel-vllm-cutlass.yaml
…
Qwen3-30B-A3B-NvFp4-CT-fi-cutlass-dp-ep.yaml
…
Qwen3-30B-A3B-NvFp4-CT-fi-cutlass.yaml
…
Qwen3-30B-A3B-NvFp4-CT-fi-trtllm.yaml
…
Qwen3-30B-A3B-NvFp4-CT-marlin.yaml
…
Qwen3-30B-A3B-NvFp4-CT-vllm-cutlass.yaml
…
Qwen3-30B-A3B-NvFp4-ModelOpt-fi-cutlass-dp-ep.yaml
…
Qwen3-30B-A3B-NvFp4-ModelOpt-fi-cutlass.yaml
…
Qwen3-30B-A3B-NvFp4-ModelOpt-fi-trtllm.yaml
…
Qwen3-30B-A3B-NvFp4-ModelOpt-marlin.yaml
…
Qwen3-30B-A3B-NvFp4-ModelOpt-vllm-cutlass.yaml
…