This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
deepseek-v4-quant
Watch
1
Star
0
Fork
0
You've already forked deepseek-v4-quant
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
075da675dcc27a0e1c6a932ed778c8de697a2979
deepseek-v4-quant
/
scripts
History
biondizzle
075da675dc
fix: update HF token, echo it at runtime, export both HF_TOKEN and HUGGING_FACE_HUB_TOKEN
2026-05-08 16:57:32 +00:00
..
dequant_fp8_to_bf16.py
Add resume capability to dequant script (skip already-done shards)
2026-05-08 02:58:24 +00:00
model_opt_nvfp4_experts_only.py
Update nvfp4_experts_only to use dequantized BF16 model
2026-05-07 16:34:37 +00:00
model_opt_nvfp4_full.py
fix: update HF token, echo it at runtime, export both HF_TOKEN and HUGGING_FACE_HUB_TOKEN
2026-05-08 16:57:32 +00:00
run_modelopt_nvfp4.sh
Add ModelOpt NVFP4 pipeline: patch, run script, README
2026-05-07 07:22:54 +00:00
upcast_to_bf16.py
Add BF16 upcast script and Blackwell DeepGEMM patch
2026-05-07 14:25:30 +00:00