Add doc-auditor agent and wire it into full-eval

Read-only documentation drift auditor: checks every README/instruction/HTML claim against the code and reports what no longer matches, modeled on the janitor/reviewer wrappers. Added to the full-eval suite (always-run) and to the subagents handbook roster and length budgets.
2026-06-13 00:06:43 -05:00
parent 25dab4204b
commit 5f7a51199c
5 changed files with 148 additions and 10 deletions
@@ -88,6 +88,7 @@ least one.
 | exerciser | containment | Black-box QA: run it, feed it normal + hostile inputs | ✅ in kit |
 | researcher | compression | Multi-source web research → cited brief | ✅ in kit |
 | janitor | compression | Spring-clean docs/artifacts: report stale, orphaned, superseded files | ✅ in kit |
+| doc-auditor | independence | Doc drift: every README/instruction/HTML claim checked against the code | ✅ in kit |
 | test-runner | compression | Run the suite, return only failures + causes | build when a repo has a real suite |
 | docs-reader | compression | Read a library's docs, return just the calls you need | build on 3rd manual-trawling task |
 | fresh-debugger | independence | Gets symptoms only (never your theories), hunts the bug | build next time you're stuck >1hr |
@@ -194,9 +195,9 @@ softened.

 ### Length budgets

-Reviewer ≤ 70 lines · exerciser, spec-checker, portability-checker & janitor ≤ 80 ·
-researcher & security-auditor ≤ 100 · evaluator ≤ 120. Tighten freely; loosen only with
-cause.
+Reviewer ≤ 70 lines · exerciser, spec-checker, portability-checker, janitor & doc-auditor
+≤ 80 · researcher & security-auditor ≤ 100 · evaluator ≤ 120. Tighten freely; loosen only
+with cause.

 ---

@@ -257,8 +258,8 @@ Three real options:

 1. **Slash command (this kit's choice): `/full-eval`.** A command file is just a stored
   prompt for the *main thread* — and the main thread *can* fan out. `/full-eval` makes
-   it launch evaluator + security-auditor + exerciser (+ start9-spec-checker when
-   relevant) in parallel, then synthesize one deduplicated, prioritized report. Zero new
+   it launch evaluator + security-auditor + exerciser + doc-auditor (+ start9-spec-checker
+   when relevant) in parallel, then synthesize one deduplicated, prioritized report. Zero new
   machinery, works in any session.
 2. **`claude --agent <name>` (advanced).** A session launched this way takes the agent's
   system prompt as its main thread — and a main-thread agent *can* spawn subagents,