This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
deepseek-v4-quant
Watch
1
Star
0
Fork
0
You've already forked deepseek-v4-quant
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
f63eed5cfd9c14c759ff3a38874049d26cc7012e
deepseek-v4-quant
/
scripts
History
biondizzle
f63eed5cfd
Purge INT4 references — expert weights are FP4 (E2M1), not INT4
...
All docs and scripts updated. Historical memory entries annotated.
2026-05-08 02:33:46 +00:00
..
dequant_fp8_to_bf16.py
Fix: expert weights are FP4 (E2M1), not INT4 - verified with nibble analysis
2026-05-08 02:25:43 +00:00
model_opt_nvfp4_experts_only.py
Update nvfp4_experts_only to use dequantized BF16 model
2026-05-07 16:34:37 +00:00
model_opt_nvfp4_full.py
Purge INT4 references — expert weights are FP4 (E2M1), not INT4
2026-05-08 02:33:46 +00:00
run_modelopt_nvfp4.sh
Add ModelOpt NVFP4 pipeline: patch, run script, README
2026-05-07 07:22:54 +00:00
upcast_to_bf16.py
Add BF16 upcast script and Blackwell DeepGEMM patch
2026-05-07 14:25:30 +00:00