Logo
Explore Help
Register Sign In
biondizzle/deepseek-v4-quant
1
0
Fork 0
You've already forked deepseek-v4-quant
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
17 Commits 4 Branches 0 Tags
b8bdd00d1965b0472f0039ef8a382902efc7f62c
Commit Graph

5 Commits

Author SHA1 Message Date
biondizzle
b8bdd00d19 Lower GPU max_memory to 100GiB, add CPU-only fallback for low_memory_mode 2026-05-07 02:49:24 +00:00
biondizzle
717151b98c Add CPU offloading and max_memory caps for FP8 model loading 2026-05-07 02:40:48 +00:00
biondizzle
aff12c6951 Fix forward_loop: pass as callable, not via create_forward_loop 2026-05-07 02:08:09 +00:00
biondizzle
492e44c0f6 Fix dataloader API: max_sample_length not seq_len, proper create_forward_loop 2026-05-07 02:04:54 +00:00
biondizzle
b32bb2e84d NVIDIA Model Optimizer branch: nvfp4_experts_only PTQ for DeepSeek V4 Pro 2026-05-07 00:11:31 +00:00
Powered by Gitea Version: 1.25.2 Page: 24ms Template: 2ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API