|
|
a0bcabac5a
|
NVFP4-everything: quantize all 2D Linear weights including attention and lm_head
|
2026-05-07 03:38:02 +00:00 |
|
|
|
c40607053b
|
Fix remaining gate_proj/up_proj -> w1/w3 references in paired_names
|
2026-05-07 00:05:55 +00:00 |
|
|
|
771e42cef3
|
Fix expert pair dict keys: w1/w3 not gate_proj/up_proj
|
2026-05-07 00:05:25 +00:00 |
|
|
|
5f35a5d2b3
|
Gracefully handle missing scale tensors (BF16 weights with stale index entries)
|
2026-05-07 00:04:29 +00:00 |
|
|
|
4470653e15
|
Fix V4 tensor naming: .scale companions, w1/w3 expert pairs, ffn.gate, hc_* preserve
|
2026-05-07 00:03:20 +00:00 |
|
|
|
2b7f063e39
|
7 commit
|
2026-05-06 23:51:54 +00:00 |
|
|
|
be16bd023e
|
sixth commit
|
2026-05-06 23:50:51 +00:00 |
|
|
|
97e7638abc
|
sixth commit
|
2026-05-06 23:49:34 +00:00 |
|
|
|
75503a1190
|
fifth commit
|
2026-05-06 23:49:02 +00:00 |
|
|
|
2eeeefcf8f
|
fourth commit
|
2026-05-06 23:48:38 +00:00 |
|
|
|
31a4302ab6
|
third commit
|
2026-05-06 23:48:25 +00:00 |
|
|
|
18ba8e057f
|
second commit
|
2026-05-06 23:47:38 +00:00 |
|
|
|
4708cdebb2
|
init commit
|
2026-05-06 23:47:07 +00:00 |
|