Add doc-auditor agent and wire it into full-eval

Read-only documentation drift auditor: checks every README/instruction/HTML claim against the code and reports what no longer matches, modeled on the janitor/reviewer wrappers. Added to the full-eval suite (always-run) and to the subagents handbook roster and length budgets.
This commit is contained in:
Keysat
2026-06-13 00:06:43 -05:00
parent 25dab4204b
commit 5f7a51199c
5 changed files with 148 additions and 10 deletions
+6 -5
View File
@@ -88,6 +88,7 @@ least one.
| exerciser | containment | Black-box QA: run it, feed it normal + hostile inputs | ✅ in kit |
| researcher | compression | Multi-source web research → cited brief | ✅ in kit |
| janitor | compression | Spring-clean docs/artifacts: report stale, orphaned, superseded files | ✅ in kit |
| doc-auditor | independence | Doc drift: every README/instruction/HTML claim checked against the code | ✅ in kit |
| test-runner | compression | Run the suite, return only failures + causes | build when a repo has a real suite |
| docs-reader | compression | Read a library's docs, return just the calls you need | build on 3rd manual-trawling task |
| fresh-debugger | independence | Gets symptoms only (never your theories), hunts the bug | build next time you're stuck >1hr |
@@ -194,9 +195,9 @@ softened.
### Length budgets
Reviewer ≤ 70 lines · exerciser, spec-checker, portability-checker & janitor ≤ 80 ·
researcher & security-auditor ≤ 100 · evaluator ≤ 120. Tighten freely; loosen only with
cause.
Reviewer ≤ 70 lines · exerciser, spec-checker, portability-checker, janitor & doc-auditor
≤ 80 · researcher & security-auditor ≤ 100 · evaluator ≤ 120. Tighten freely; loosen only
with cause.
---
@@ -257,8 +258,8 @@ Three real options:
1. **Slash command (this kit's choice): `/full-eval`.** A command file is just a stored
prompt for the *main thread* — and the main thread *can* fan out. `/full-eval` makes
it launch evaluator + security-auditor + exerciser (+ start9-spec-checker when
relevant) in parallel, then synthesize one deduplicated, prioritized report. Zero new
it launch evaluator + security-auditor + exerciser + doc-auditor (+ start9-spec-checker
when relevant) in parallel, then synthesize one deduplicated, prioritized report. Zero new
machinery, works in any session.
2. **`claude --agent <name>` (advanced).** A session launched this way takes the agent's
system prompt as its main thread — and a main-thread agent *can* spawn subagents,