Files
Keysat 637ac3e7c2 handoff: prune Current state to lean session status
Drop the finished build narratives (durable record lives in ROADMAP
items 8/9/10), fix the stale "built this session" on onboarding-tester,
and surface the /adjudicate first-run drop-bias calibration as the top
next step.
2026-06-17 22:50:40 -05:00

113 lines
6.9 KiB
Markdown

# Standards — AGENTS.md
The home of my agent-operating standards **and** the live global fleet that serves them.
This repo is both the documentation of how any coding agent works in my repos and the
actual source of the global agents, commands, and `how-i-work.md` that every project
inherits. Edit here and every repo on this machine feels it.
For a human-facing tour of the four standards documents, read `README.md` — this file is
the agent-facing orientation and does not restate it.
> **Inbox check (this repo):** At session start, if `INBOX.md` has unchecked items tagged
> `(standards)`, surface them before proposing next steps; triage with `/triage`.
## What this repo is the source of
The global layer lives here and is wired into `~/.claude` by **directory symlinks**, so a
file added under `adapters/` is live immediately — no per-file linking:
- `~/.claude/commands``adapters/claude/commands/` — global slash commands (`/retrofit`,
`/handoff`, `/full-eval`, `/capture`, `/triage`, `/roundup`, `/new-project`, `/design`,
`/adjudicate`).
- `~/.claude/agents``adapters/claude/agents/` — global subagents (reviewer, evaluator,
security-auditor, doc-auditor, exerciser, researcher, janitor, portability-checker,
start9-spec-checker, design-checker, onboarding-tester).
- `~/.claude/CLAUDE.md``how-i-work.md` — my universal preferences, loaded every session.
(Distinct from this repo's *root* `CLAUDE.md`, which → `AGENTS.md`: same filename, different
scopes — global preferences vs. this repo's orientation.)
- `~/.claude/statusline.sh``adapters/claude/statusline.sh` — the Claude Code CLI terminal
status-line script (invoked by `~/.claude/settings.json`).
Because these are *global*, decisions here are never scoped to this repo alone. The test
for anything proposed ("should we adopt linters / hooks / CI?", "is this skill worth
building?") is **whether it's generally best-in-class across all the repos I build** — this
repo is just where that cross-repo standard is authored and stored.
## Layout
- `README.md` — human index of the four standards docs.
- `how-i-work.md` — universal preferences (served as `~/.claude/CLAUDE.md`).
- `portability.md` — the vendor-neutral / hot-swap protocol.
- `retrofit-playbook.md` — terminal runbook for moving a project's brain onto disk.
- `subagents-handbook.md` — designing and running delegated agents.
- `guides/`**neutral substance**: the self-contained operating guide for each command
and agent, written as plain prose any vendor's harness could follow. Also holds shared
reference docs that aren't tied to one command — e.g. `placement.md` (where a new project
should live), which `/new-project` and `how-i-work.md` both point at.
- `adapters/claude/{commands,agents}/`**thin Claude wrappers**: frontmatter + a pointer
to the matching `guides/` file. Substance never lives in a wrapper.
- `INBOX.md` — cross-project capture buffer for untriaged ideas/bugs (see below).
- `ROADMAP.md` — longer-term backlog for this repo (future agents, commands, standards).
- `STATUS.md` — latest cross-project roundup snapshot, overwritten + committed by `/roundup`
each run (git history is the diff over time).
## Conventions when editing
- **Substance in `guides/`, machinery in `adapters/`.** To add or change what a
command/agent *does*, edit its `guides/<name>.md`. The wrapper only carries frontmatter
and the "read your guide, then follow it exactly; if you can't read it, stop — don't
improvise" pointer. Match the existing wrappers exactly.
- **Portability first.** Knowledge lives in vendor-neutral files; vendor-named paths are
relative symlinks into them. Full protocol in `portability.md`; the `portability-checker`
agent verifies it.
- **Keep this file lean.** Whole-repo, every-session facts only. Subsystem detail goes in a
guide. Near-term status goes in `## Current state` below; longer-term backlog in
`ROADMAP.md`.
- Follow `how-i-work.md` for collaboration, git, and debugging defaults.
## The capture → triage → roadmap loop
A frictionless path from "random thought about some repo" to "on the right list," so ideas
stop scattering into phone notes:
- **`/capture`** appends one structured line to `INBOX.md` from *any* repo (you need not be
in the target repo) and commits+pushes the standards repo so the note is durable. Capture
is deliberately dumb and uniform — no routing decision at capture time.
- **`/triage`**, run *inside* a project, drains that project's `INBOX.md` items and routes
each — `Current state`, the repo's `ROADMAP.md`, a guide, or discard — with your
approval, then clears them from the inbox.
- **Inbox is upstream of every repo's ROADMAP.** ROADMAP is triaged, owned backlog for one
repo; the inbox is the raw, cross-project buffer that feeds the ROADMAPs.
**Inbox-check line (the portable surfacing mechanism).** Every project repo's `AGENTS.md`
should carry this so any vendor's agent surfaces pending items at session start:
> **Inbox check:** At session start, if `~/Projects/standards/INBOX.md` exists, scan it for
> items tagged `(this-repo)` and surface them before proposing next steps; triage with
> `/triage`.
## Current state
- **Fleet built and live** — commands `/capture /triage /roundup /new-project /handoff /retrofit
/full-eval /design /adjudicate`; subagents incl. `design-checker` + `onboarding-tester`
(substance in `guides/`, thin wrappers in `adapters/claude/`, symlinked into `~/.claude`).
Dogfoods its own standard. Latest `/roundup`: `STATUS.md` 2026-06-16.
- **`/adjudicate` shipped this session (ROADMAP item 10).** Debates parked P2/P3 backlog to
DROP/DO/ESCALATE verdicts; recommend-only v1, autonomy gated by blast radius, ROADMAP-only.
**Untested on a real backlog** — the first run should eyeball the judge's drop-bias before
trusting it (tune the tie-break rule in `guides/adjudicate.md` if it over-drops). Best first
targets: keysat, recap.
- **`onboarding-tester` live; first harness still pending (item 9).** Stage 1 (Path 1, manual
issuance under keysat's `merchant-onboard` key) is **unblocked** — build the harness + first run
in a keysat session. Stage 2 (Path 2, buyer-pays on regtest) is **gated** on keysat's
`payment_providers:write` scope + network gate + sandbox flag.
- **Design system live (item 8).** recap Case B (Extract) backfill done (app 0.2.161). **Open
(decoupled):** a fresh Claude Design run to confirm export internals + tune Phase-C.
- **Next steps:** (1) first real `/adjudicate` run on keysat/recap to calibrate the drop-bias
(item 10); (2) Stage-1 `onboarding-tester` harness in a keysat session (item 9); (3) cross-repo
quality-gate standard + `/harden` (item 1); (4) non-git-folder sweep under `~/Projects` (item 6
residual).
- Queued in `INBOX.md` for other repos' `/triage`: keysat design cleanup (P2) + onboarding Path-2
(P3); `ten31-transcripts` mini-retrofit; `ten31-database` networking/icon/intake; (standards)
operator-onboarding agent (P3).