docs(vcr): traceability for parallel perf tracks B (clamp lowering) + C (R10 pool) (#242)#282
Merged
Merged
Conversation
…l, C) — traceability leads (#242) Per the measured ROI reframing (#209): the allocator (VCR-RA-001) is the north-star but register-pool-constrained, so its win is modest. gale's #209 decomposition puts the biggest single chunk in clamp lowering (18 IT-blocks vs native 6 — VCR-SEL-002, track B), and freeing R10 when bounds-checking is off is a cheap +1-register pressure multiplier (VCR-RA-002, track C). B + C run in parallel with A, disjoint code, gated by RESULT-identity (flight_seam 0x07FDF307) + measured delta, not byte-identity. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Codecov Report✅ All modified and coverable lines are covered by tests. 📢 Thoughts on this report? Let us know! |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Traceability leads — parallel ROI tracks for the codegen perf work
Per the measured reframing on #209: the allocator (VCR-RA-001, track A) is the north-star but register-pool-constrained, so its measured win is modest. This files the two complementary, parallelizable levers gale's #209 decomposition surfaced:
[-127,127]lower to 18 IT-blocks vs native's 6 — gale's biggest single chunk. ARMSSATdoesn't apply ([-127,127] ≠ [-128,127]); the lever is a tighterSelect/conditional-move lowering. Disjoint from A.BoundsCheckConfig::{Software,Mpu,Masking}(verified). InNonemode it's a wasted reserve — a cheap +1 register (pool 9→10). Composes with A (which reads the pool from config).Parallelism: B, C, A touch disjoint code. B/C are optimizations (bytes change), so the gate is result-identity (
flight_seam 0x07FDF307) + measured delta, not byte-identity — which removes the only would-be conflict (no shared fixture re-freeze). Bothproposed; verification artifacts land with the oracle-gated code.rivet-clean (only the expected
proposed-requirement WARN, no ERROR).🤖 Generated with Claude Code