biondizzle
  • Joined on 2025-12-10
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 08:19:45 +00:00
1e77dfcaa0 Fix prompt encoding: remove \n\n before content per official DSV4 spec; add --chat-mode
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 08:18:57 +00:00
2a42686e8e Test 1 v2: diff hand-rolled vs official DSV4 encoding
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 08:18:02 +00:00
11c2d5fe53 Add degeneration test 2: falsify mHC residual growth root cause
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 08:17:19 +00:00
c77b83fffc Add degeneration test 1: chat-template token-ID diff
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 08:14:12 +00:00
c5a131c358 more doc clean up again
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 08:12:56 +00:00
019a3a34b7 Clean up L0 B1 verify noise (gate on VERBOSE), update FINAL_STRETCH.md
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 07:56:32 +00:00
5e09be08af Fix non-contiguous tensor in quantize_nvfp4_gpu_fused (T>1 prefill)
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 07:39:41 +00:00
60309ef124 Batched prefill: replace T=1 token-by-token with chunked T≤128 batch processing
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 07:37:17 +00:00
0bf276f8c9 more doc cleanup
d463ac8512 doc cleanup
Compare 2 commits »
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 07:31:04 +00:00
7450ebc67a CORRECTNESS_BACKLOG.md: comprehensive production pipeline verification results — all tested and confirmed findings from PART A diagnostics
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 07:03:52 +00:00
9dbfac9dfa PART A: verify kv_norm_w loaded correctly
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 06:57:25 +00:00
a682c6adf4 PART A: add raw compressor output diagnostic
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 06:34:08 +00:00
f2c1b3afd5 PART A: fix KV diagnostics — compute q_a before indexer, add Q_heads magnitude check
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 06:25:38 +00:00
86e59c16c5 PART A: add KV gather diagnostics at blowup layer
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 06:10:34 +00:00
262f844e2e PART A: add detailed blowup diagnostics — capture mHC intermediate values when |X| > 1e6
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 05:41:34 +00:00
6459fbca9a fix: import forward_attention
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 05:33:25 +00:00
91dfac34d8 PART A: simplified to production-only diagnostics — track per-layer |X| during prefill and decode, detect blowup early
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 05:23:02 +00:00
d99503732d fix: add BF16 gate weight fallback for dense routers (missing from test)
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 05:15:54 +00:00
801bfc9a83 add router mode debug print
biondizzle pushed to master at biondizzle/nvfp4-megamoe-kernel 2026-06-03 05:06:42 +00:00
b385ecc05e PART A: decode diagnostics test — production vs reference per-layer X comparison at decode step