diff --git a/AGENTS.md b/AGENTS.md index 097f7f0..e799b95 100644 --- a/AGENTS.md +++ b/AGENTS.md @@ -92,31 +92,21 @@ should carry this so any vendor's agent surfaces pending items at session start: /full-eval /design /adjudicate`; subagents incl. `design-checker` + `onboarding-tester` (substance in `guides/`, thin wrappers in `adapters/claude/`, symlinked into `~/.claude`). Dogfoods its own standard. Latest `/roundup`: `STATUS.md` 2026-06-16. -- **`/adjudicate` built this session (ROADMAP item 10), live.** Debates each parked P2/P3 backlog - item on a repo's ROADMAP to a verdict so the owner ratifies instead of researching: per item, - investigator (grounds it in the code + classifies blast radius) → build- ∥ drop-advocate → judge - (rubric = `how-i-work.md` + repo `AGENTS.md`, biased to DROP on ties). Verdicts: **DROP** (auto, - ratified in one batch), **DO** (low blast radius → annotated plan, recommend-only), **ESCALATE** - (HIGH blast radius / low confidence → balanced brief for the owner). Autonomy gated by blast - radius, not priority; ROADMAP-only (nudges `/triage` first, never reads raw inbox). **v1 is - recommend-only** — never executes; v2 (narrow auto-execution of the safe DO class) deferred until - trust is built. -- **`onboarding-tester` built this session (ROADMAP item 9), live.** Docs-only adopter agent: walks - a product's published docs as a literal newcomer (never reading source), reports doc gaps, and on - a fully clean run emits a publishable "all it took was X, Y, Z" walkthrough. First target: keysat - SDK integration, **staged** — Stage 1 (Path 1, manual issuance under keysat's new `merchant-onboard` - key, `d5885d1`) is **unblocked**, runs in a keysat session; Stage 2 (Path 2, buyer-pays on regtest) - is **gated** on keysat's greenlit `payment_providers:write` scope + network gate + sandbox flag. -- **Design system (ROADMAP item 8) shipped** — `/design` → `design/DESIGN.md` + DTCG tokens; - `design-checker`; `/new-project` scaffolds `design/`. Pilot: keysat import (Case A). **Done:** - the first **Case B (Extract) run on recap** — document-as-is; it exercised the previously-untested - extract→reconcile path, the contract was distilled, and the conformance cleanup shipped in two - phases (recap app 0.2.161, 2026-06-17). Extract-phase Phase-D learnings landed in - `guides/design.md`; cleanup-execution learnings in `guides/design-checker.md`. **Still open - (decoupled):** a fresh Claude Design run to confirm export internals + tune Phase-C — deliberately - kept out of the recap run. -- **Next steps:** (1) run the Stage-1 `onboarding-tester` harness in a keysat session (item 9); - (2) cross-repo quality-gate standard + `/harden` (item 1); (3) non-git-folder sweep under `~/Projects` (~13). +- **`/adjudicate` shipped this session (ROADMAP item 10).** Debates parked P2/P3 backlog to + DROP/DO/ESCALATE verdicts; recommend-only v1, autonomy gated by blast radius, ROADMAP-only. + **Untested on a real backlog** — the first run should eyeball the judge's drop-bias before + trusting it (tune the tie-break rule in `guides/adjudicate.md` if it over-drops). Best first + targets: keysat, recap. +- **`onboarding-tester` live; first harness still pending (item 9).** Stage 1 (Path 1, manual + issuance under keysat's `merchant-onboard` key) is **unblocked** — build the harness + first run + in a keysat session. Stage 2 (Path 2, buyer-pays on regtest) is **gated** on keysat's + `payment_providers:write` scope + network gate + sandbox flag. +- **Design system live (item 8).** recap Case B (Extract) backfill done (app 0.2.161). **Open + (decoupled):** a fresh Claude Design run to confirm export internals + tune Phase-C. +- **Next steps:** (1) first real `/adjudicate` run on keysat/recap to calibrate the drop-bias + (item 10); (2) Stage-1 `onboarding-tester` harness in a keysat session (item 9); (3) cross-repo + quality-gate standard + `/harden` (item 1); (4) non-git-folder sweep under `~/Projects` (item 6 + residual). - Queued in `INBOX.md` for other repos' `/triage`: keysat design cleanup (P2) + onboarding Path-2 (P3); `ten31-transcripts` mini-retrofit; `ten31-database` networking/icon/intake; (standards) operator-onboarding agent (P3).