deepseek-v4-quant/.gitignore at f2656dcf6d4013adf8ee7e05ab5be36dc3ef4604 - deepseek-v4-quant - Gitea: Git with a cup of tea

biondizzle/deepseek-v4-quant

Files

biondizzle a0bacb3cf6 Replace shell wrapper with in-process quantize script

- New scripts/quantize_nvfp4.py: runs full ModelOpt pipeline in-process
- Saves calibrated state after calibration (insurance against export crashes)
- Patches modelopt for V4: ModuleList quantizers, stale GPU tensor safety
- --export-only flag to retry export from saved calibration state
- Removed old model_opt_nvfp4_full.py (shell wrapper)
- Updated README with new pipeline docs and bug #5/#6

2026-05-09 06:07:22 +00:00

11 lines

137 B

Plaintext

Raw Blame History

 # Dequantized BF16 weights (3TB)
 DeepSeek-V4-Pro-BF16/
 # Calibration state (huge, not for git)
 *.pt
 # Python
 __pycache__/
 *.pyc
 .venv/