This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
deepseek-v4-quant
Watch
1
Star
0
Fork
0
You've already forked deepseek-v4-quant
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
d88793dee6714dd266959a460fa8fbe5e8962cbc
deepseek-v4-quant
/
scripts
History
biondizzle
0d74b97fb2
Config patches doc + compress_ratios runtime patch in serve script
2026-05-10 08:23:11 +00:00
..
dequant_fp8_to_bf16.py
Add resume capability to dequant script (skip already-done shards)
2026-05-08 02:58:24 +00:00
quantize_nvfp4.py
8 patches covering full export chain — no more whack-a-mole
2026-05-09 22:50:58 +00:00
serve_vllm.py
Config patches doc + compress_ratios runtime patch in serve script
2026-05-10 08:23:11 +00:00