feat: add S7 pinned-inspection scenario with H9-H12 hypotheses by blove · Pull Request #1 · cacheplane/pretable

blove · 2026-04-20T21:45:17Z

Summary

Add S7 benchmark scenario (40 cols, 3 pinned left, 3 wrapped, variable-height, multilingual) with same row counts as S2
Wire S7 into bench app query parsing, bench-runner validation, and matrix runner
Refactor evaluateH1/H6-H8 to accept explicit scenarioId parameter; add H9-H12 thin wrappers evaluating S7 with identical thresholds
29 bench-matrix tests passing (7 new for H9-H12), plus new tests in scenario-data, query-state, and bench-runner

Test plan

CI passes (test, typecheck, lint, format, build)
Dev-scale matrix run with S7: pnpm bench:matrix -- --project=chromium --adapters=pretable --scenarios=S7 --scripts=scroll,sort,filter-metadata,filter-text --scale=dev --repeats=3
Hypothesis-scale comparative: pnpm bench:matrix -- --project=chromium --adapters=pretable,gridalpha,gridbeta,gridgamma --scenarios=S7 --scripts=scroll --scale=hypothesis --repeats=3
Inspect *.hypotheses.json: H9-H12 present, 9 hypotheses total

New benchmark scenario with 3 pinned columns on variable-height inspection content, plus H9-H12 hypotheses mirroring S2's proof surface. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

6 tasks: scenario-data definition, bench app wiring, bench-runner validation, hypothesis refactor (scenarioId param), H9-H12 tests, full verification. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

40 cols, 3 pinned left, 3 wrapped, variable-height, multilingual. Same row counts as S2. Exercises pinned-column layout overhead.

…tion scripts Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

evaluateH1, evaluateH6-H8 now accept explicit scenarioId. Add H9-H12 wrappers for S7. Report grows from 5 to 9 hypotheses. Add S7 to DEFAULT_SCENARIOS.

Tests cover satisfied, failing, and insufficient states for composite scroll quality (H9) and interaction hypotheses (H10-H12) on the pinned-inspection scenario. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Remove dead `compareValues` function from derived-rows.ts and replace destructuring-based key removal with delete to avoid unused variable lint error. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Suppress react-hooks/refs and react-hooks/set-state-in-effect false positives for legitimate patterns (sync ref updates for callbacks, DOM measurement in useLayoutEffect). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

* chore(bench): high-repeat S2/scroll milestone for B2 follow-up perf diagnostic * docs(research): pretable vs MUI scroll perf diagnostic memo Phase C of B2 follow-up #1. Verdict: gap is noise. The high-repeat S2/hypothesis/scroll rerun shows no meaningful MUI advantage and recommends tightening H1-sensitive repeat protocol instead of scoping a perf-fix PR. Spec: docs/superpowers/specs/2026-05-09-b2-followup-perf-diagnostic-design.md Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * chore(bench): format perf diagnostic artifacts --------- Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>

blove and others added 10 commits April 20, 2026 13:16

docs: add pinned-column inspection scenario (S7) design spec

0d9f450

New benchmark scenario with 3 pinned columns on variable-height inspection content, plus H9-H12 hypotheses mirroring S2's proof surface. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

docs: add pinned-inspection scenario (S7) implementation plan

7f1946c

6 tasks: scenario-data definition, bench app wiring, bench-runner validation, hypothesis refactor (scenarioId param), H9-H12 tests, full verification. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat(scenario-data): add S7 pinned-inspection scenario

c043a0d

40 cols, 3 pinned left, 3 wrapped, variable-height, multilingual. Same row counts as S2. Exercises pinned-column layout overhead.

feat(bench): wire S7 into query parsing and type system

aa9646e

feat(bench-runner): allow S7 in P0a validation for scroll and interac…

4306d73

…tion scripts Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

refactor(bench-matrix): parameterize hypothesis evaluation by scenarioId

ed7dc5f

evaluateH1, evaluateH6-H8 now accept explicit scenarioId. Add H9-H12 wrappers for S7. Report grows from 5 to 9 hypotheses. Add S7 to DEFAULT_SCENARIOS.

test(bench-matrix): add H9-H12 test coverage for S7 hypotheses

6ee8471

Tests cover satisfied, failing, and insufficient states for composite scroll quality (H9) and interaction hypotheses (H10-H12) on the pinned-inspection scenario. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

style: format files with prettier

440e509

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

fix(grid-core): resolve pre-existing lint errors

314ea22

Remove dead `compareValues` function from derived-rows.ts and replace destructuring-based key removal with delete to avoid unused variable lint error. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

blove merged commit e0bb24c into main Apr 20, 2026
5 checks passed

blove mentioned this pull request May 1, 2026

feat(website): landing content rewrite — performance + AI-era for product leaders #38

Merged

5 tasks

blove mentioned this pull request May 9, 2026

docs(research): pretable vs MUI scroll perf diagnostic #124

Merged

7 tasks

blove mentioned this pull request May 13, 2026

docs(research): pretable wrapped-text filter perf diagnostic #142

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add S7 pinned-inspection scenario with H9-H12 hypotheses#1

feat: add S7 pinned-inspection scenario with H9-H12 hypotheses#1
blove merged 10 commits into
mainfrom
feat/pinned-inspection-scenario

blove commented Apr 20, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

blove commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

blove commented Apr 20, 2026 •

edited

Loading