You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This file is the single source of truth for autonomous maintenance. Read by Claude on every /loop iteration.
Phase 0 — Benchmark (EVERY iteration)
Run quality scorecard first. If any metric regresses, fix it before doing anything else.
bunx biome check packages/api/src/ 2>&1| tail -1 # lint
bunx tsc --noEmit -p packages/api/tsconfig.json 2>&1| wc -l # typecheck (0 = clean)cd packages/api && bunx vitest run 2>&1| grep -E 'passed|failed'# testscd packages/dashboard && bun run build 2>&1| grep -c 'prerendered'# dashboard build
curl -s -o /dev/null -w '%{http_code}' https://agentstate.app/api # live API (200)
curl -s -o /dev/null -w '%{time_total}' https://agentstate.app/api # response time (<0.1s)
gh run list --repo duyet/agentstate --workflow CI --limit 1 --json conclusion -q '.[0].conclusion'# CI
git status --short # clean tree
Quality Targets
Metric
Target
Lint errors
0
Type errors
0
Tests
100% pass
Dashboard build
clean static export
API status
200
API latency
<100ms
CI
success
Working tree
clean
Dead code
0 unused exports
Security
0 hardcoded secrets
Docs
match actual code
Save scores to memory after each run.
Phase 1 — Health check (run directly)
bunx biome check packages/api/src/
bunx tsc --noEmit -p packages/api/tsconfig.json
cd packages/api && bunx vitest run
curl -s https://agentstate.app/api
gh run list --repo duyet/agentstate --workflow CI --limit 1
git status --short
If anything fails → fix immediately before Phase 2.