deepseek-v4-quant

Files

biondizzle f63eed5cfd Purge INT4 references — expert weights are FP4 (E2M1), not INT4

All docs and scripts updated. Historical memory entries annotated.

2026-05-08 02:33:46 +00:00

dequant_fp8_to_bf16.py

2026-05-08 02:25:43 +00:00

model_opt_nvfp4_experts_only.py

2026-05-07 16:34:37 +00:00

model_opt_nvfp4_full.py

2026-05-08 02:33:46 +00:00

run_modelopt_nvfp4.sh

2026-05-07 07:22:54 +00:00

upcast_to_bf16.py

2026-05-07 14:25:30 +00:00