This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
0 Followers
·
0 Following
Joined on
2025-12-10
Block a user
Blocking a user prevents them from interacting with repositories, such as opening or commenting on pull requests or issues. Learn more about blocking a user.
User to block:
Optional note:
The note is not visible to the blocked user.
Cancel
Block
Repositories
25
Projects
Packages
Public Activity
Starred Repositories
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 08:19:45 +00:00
1e77dfcaa0
Fix prompt encoding: remove \n\n before content per official DSV4 spec; add --chat-mode
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 08:18:57 +00:00
2a42686e8e
Test 1 v2: diff hand-rolled vs official DSV4 encoding
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 08:18:02 +00:00
11c2d5fe53
Add degeneration test 2: falsify mHC residual growth root cause
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 08:17:19 +00:00
c77b83fffc
Add degeneration test 1: chat-template token-ID diff
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 08:14:12 +00:00
c5a131c358
more doc clean up again
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 08:12:56 +00:00
019a3a34b7
Clean up L0 B1 verify noise (gate on VERBOSE), update FINAL_STRETCH.md
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 07:56:32 +00:00
5e09be08af
Fix non-contiguous tensor in quantize_nvfp4_gpu_fused (T>1 prefill)
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 07:39:41 +00:00
60309ef124
Batched prefill: replace T=1 token-by-token with chunked T≤128 batch processing
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 07:37:17 +00:00
0bf276f8c9
more doc cleanup
d463ac8512
doc cleanup
Compare 2 commits »
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 07:31:04 +00:00
7450ebc67a
CORRECTNESS_BACKLOG.md: comprehensive production pipeline verification results — all tested and confirmed findings from PART A diagnostics
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 07:03:52 +00:00
9dbfac9dfa
PART A: verify kv_norm_w loaded correctly
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 06:57:25 +00:00
a682c6adf4
PART A: add raw compressor output diagnostic
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 06:34:08 +00:00
f2c1b3afd5
PART A: fix KV diagnostics — compute q_a before indexer, add Q_heads magnitude check
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 06:25:38 +00:00
86e59c16c5
PART A: add KV gather diagnostics at blowup layer
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 06:10:34 +00:00
262f844e2e
PART A: add detailed blowup diagnostics — capture mHC intermediate values when |X| > 1e6
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 05:41:34 +00:00
6459fbca9a
fix: import forward_attention
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 05:33:25 +00:00
91dfac34d8
PART A: simplified to production-only diagnostics — track per-layer |X| during prefill and decode, detect blowup early
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 05:23:02 +00:00
d99503732d
fix: add BF16 gate weight fallback for dense routers (missing from test)
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 05:15:54 +00:00
801bfc9a83
add router mode debug print
biondizzle
pushed to
master
at
biondizzle/nvfp4-megamoe-kernel
2026-06-03 05:06:42 +00:00
b385ecc05e
PART A: decode diagnostics test — production vs reference per-layer X comparison at decode step
First
Previous
...
4
5
6
7
8
...
Next
Last