From 572bdd2840b57f267ea4e7379f3c24c3318e176a Mon Sep 17 00:00:00 2001 From: biondizzle Date: Wed, 3 Jun 2026 09:01:02 +0000 Subject: [PATCH] auto: pre-test commit --- TEMP/CROSS_REFERENCE.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/TEMP/CROSS_REFERENCE.md b/TEMP/CROSS_REFERENCE.md index 58c5751b..7affd75f 100644 --- a/TEMP/CROSS_REFERENCE.md +++ b/TEMP/CROSS_REFERENCE.md @@ -1,6 +1,6 @@ # vLLM Independent-Reference Cross-Checks (DSV4) -**Purpose:** catch the architectural bugs your cosine backlog is blind to. Every check in `CORRECTNESS_BACKLOG.md` compares your kernel to your *own* PyTorch reference — if both share a wrong assumption (RoPE position, RoPE style, norm ordering, compression overlap), they agree at cos 0.9999 while both being wrong. vLLM's DSV4 is an **independent** implementation; it can't share your blind spots. Use it as the tiebreaker. +**Purpose:** catch the architectural bugs your cosine backlog is blind to. Every check in `/home/openclaw/dev/nvfp4-megamoe-kernel/DEGENERATION_TESTS.md` and `/home/openclaw/dev/nvfp4-megamoe-kernel/CORRECTNESS_BACKLOG.md` compares your kernel to your *own* PyTorch reference — if both share a wrong assumption (RoPE position, RoPE style, norm ordering, compression overlap), they agree at cos 0.9999 while both being wrong. vLLM's DSV4 is an **independent** implementation; it can't share your blind spots. Use it as the tiebreaker. **How to use this:** these are mostly *static code diffs*, not GPU runs — fast. For each check, find the line in `single_shot_inference.py` (or `dsv4/`), find the cited line in the vLLM tree, and record match / mismatch + the fix.