Logo
Explore Help
Register Sign In
biondizzle/deepseek-v4-quant
1
0
Fork 0
You've already forked deepseek-v4-quant
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
f63eed5cfd9c14c759ff3a38874049d26cc7012e
deepseek-v4-quant/scripts
History
biondizzle f63eed5cfd Purge INT4 references — expert weights are FP4 (E2M1), not INT4
All docs and scripts updated. Historical memory entries annotated.
2026-05-08 02:33:46 +00:00
..
dequant_fp8_to_bf16.py
Fix: expert weights are FP4 (E2M1), not INT4 - verified with nibble analysis
2026-05-08 02:25:43 +00:00
model_opt_nvfp4_experts_only.py
Update nvfp4_experts_only to use dequantized BF16 model
2026-05-07 16:34:37 +00:00
model_opt_nvfp4_full.py
Purge INT4 references — expert weights are FP4 (E2M1), not INT4
2026-05-08 02:33:46 +00:00
run_modelopt_nvfp4.sh
Add ModelOpt NVFP4 pipeline: patch, run script, README
2026-05-07 07:22:54 +00:00
upcast_to_bf16.py
Add BF16 upcast script and Blackwell DeepGEMM patch
2026-05-07 14:25:30 +00:00
Powered by Gitea Version: 1.25.2 Page: 55ms Template: 7ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API