Add doc-auditor agent and wire it into full-eval
Read-only documentation drift auditor: checks every README/instruction/HTML claim against the code and reports what no longer matches, modeled on the janitor/reviewer wrappers. Added to the full-eval suite (always-run) and to the subagents handbook roster and length budgets.
This commit is contained in:
@@ -88,6 +88,7 @@ least one.
|
||||
| exerciser | containment | Black-box QA: run it, feed it normal + hostile inputs | ✅ in kit |
|
||||
| researcher | compression | Multi-source web research → cited brief | ✅ in kit |
|
||||
| janitor | compression | Spring-clean docs/artifacts: report stale, orphaned, superseded files | ✅ in kit |
|
||||
| doc-auditor | independence | Doc drift: every README/instruction/HTML claim checked against the code | ✅ in kit |
|
||||
| test-runner | compression | Run the suite, return only failures + causes | build when a repo has a real suite |
|
||||
| docs-reader | compression | Read a library's docs, return just the calls you need | build on 3rd manual-trawling task |
|
||||
| fresh-debugger | independence | Gets symptoms only (never your theories), hunts the bug | build next time you're stuck >1hr |
|
||||
@@ -194,9 +195,9 @@ softened.
|
||||
|
||||
### Length budgets
|
||||
|
||||
Reviewer ≤ 70 lines · exerciser, spec-checker, portability-checker & janitor ≤ 80 ·
|
||||
researcher & security-auditor ≤ 100 · evaluator ≤ 120. Tighten freely; loosen only with
|
||||
cause.
|
||||
Reviewer ≤ 70 lines · exerciser, spec-checker, portability-checker, janitor & doc-auditor
|
||||
≤ 80 · researcher & security-auditor ≤ 100 · evaluator ≤ 120. Tighten freely; loosen only
|
||||
with cause.
|
||||
|
||||
---
|
||||
|
||||
@@ -257,8 +258,8 @@ Three real options:
|
||||
|
||||
1. **Slash command (this kit's choice): `/full-eval`.** A command file is just a stored
|
||||
prompt for the *main thread* — and the main thread *can* fan out. `/full-eval` makes
|
||||
it launch evaluator + security-auditor + exerciser (+ start9-spec-checker when
|
||||
relevant) in parallel, then synthesize one deduplicated, prioritized report. Zero new
|
||||
it launch evaluator + security-auditor + exerciser + doc-auditor (+ start9-spec-checker
|
||||
when relevant) in parallel, then synthesize one deduplicated, prioritized report. Zero new
|
||||
machinery, works in any session.
|
||||
2. **`claude --agent <name>` (advanced).** A session launched this way takes the agent's
|
||||
system prompt as its main thread — and a main-thread agent *can* spawn subagents,
|
||||
|
||||
Reference in New Issue
Block a user