Logo
Explore Help
Register Sign In
biondizzle/deepseek-v4-quant
1
0
Fork 0
You've already forked deepseek-v4-quant
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
3d38e1d5cd0f84fbc351c676f64528a331929064
deepseek-v4-quant/scripts
History
biondizzle 3d38e1d5cd nvfp4_full: drop calib to 128, gpu_max_mem to 0.7 for VRAM headroom
2026-05-08 06:24:45 +00:00
..
dequant_fp8_to_bf16.py
Add resume capability to dequant script (skip already-done shards)
2026-05-08 02:58:24 +00:00
model_opt_nvfp4_experts_only.py
Update nvfp4_experts_only to use dequantized BF16 model
2026-05-07 16:34:37 +00:00
model_opt_nvfp4_full.py
nvfp4_full: drop calib to 128, gpu_max_mem to 0.7 for VRAM headroom
2026-05-08 06:24:45 +00:00
run_modelopt_nvfp4.sh
Add ModelOpt NVFP4 pipeline: patch, run script, README
2026-05-07 07:22:54 +00:00
upcast_to_bf16.py
Add BF16 upcast script and Blackwell DeepGEMM patch
2026-05-07 14:25:30 +00:00
Powered by Gitea Version: 1.25.2 Page: 65ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API