WO-001 Dry Run Report — Production Line Validation

WO-001 Dry Run Report — Production Line Validation

Date: 2026-03-18 Product: Enterprise Architecture Agent for STM Reference implementation: C:/Projects/agent-ea/ Production line: digital-talent (Stages 4-7) Purpose: Stress-test the production line, not redesign agent-ea


Executive Summary

The production line stages are structurally sound — the right activities happen in the right order with the right owners. But when applied to a real product, significant operational gaps emerge: missing templates, undefined handoff formats, ambiguous instructions, and no automation for verification. The biggest theme: the stages describe what to do but not how to do it, leaving each factory worker to reinvent the wheel.

Verdict: The line can produce a digital talent, but it would take 2x the estimated time because Pablo/Quinn/Diego/Dana would constantly need to improvise. The fixes below would cut that overhead significantly.


Stage 4 (Build) — Gap/Lessons List

What Works

  • Assembly.md phases A-H cover the right ground
  • The phase ordering is correct (repo → config → skills → templates → refs → orchestration → docs → verify)
  • Agent-ea passes most structural checks — it was well-built

Gaps Found

# Gap Severity Fix
B-01 No CLAUDE.md template. Assembly says "write CLAUDE.md with required sections" but provides no starter template. Pablo must invent formatting each time. HIGH Create production-lines/digital-talent/templates/CLAUDE-template.md with all 9 required sections as fillable stubs
B-02 No per-request folder structure guidance. Work order lists demandes-ae/ as a deliverable, but assembly.md never mentions creating request folder structures. HIGH Add Phase A.2: "Create request folder structure per work order" with standard pattern
B-03 No skill template file. Assembly says "each skill follows the standard skill template" but the template doesn't exist in the production line. HIGH Create production-lines/digital-talent/templates/skill-template.md with frontmatter + required sections
B-04 No documentation templates. Phase G says "user guide, config guide, skill reference card" but provides no templates. Remy's collaboration model undefined. HIGH Create doc templates in production-lines/digital-talent/templates/
B-05 Quick-start cheat sheet missing from Phase G. Stage 7 expects it as a deliverable but Phase G doesn't mention it. MEDIUM Add to Phase G list
B-06 Assembly checklist is too generic. Final checklist is just 8 checkboxes. Needs to be a fill-in form with file counts, skill counts, test results. MEDIUM Create detailed assembly checklist template
B-07 No consistency check tooling. CLAUDE.md-to-commands/ sync is manual. Agent-ea has 5 undocumented skills + 2 phantom references. MEDIUM Add a verification script or at minimum a documented manual procedure
B-08 content-in/ subdirectory guidance vague. Assembly says "adjust based on domain" but doesn't give examples. LOW Add a table of common domain → subdirectory mappings
B-09 Reference material format requirements missing. Agent-ea has .jpg, .doc, .pdf mixed with .md. No guidance on what formats agents can actually read. LOW Add note: prefer markdown, document fallback for binary files
B-10 Scope creep handling undefined. Agent-ea grew from 15 → 20 skills during development. Assembly doesn't address how to handle extra skills that emerge. LOW Add note in Phase C: document extras in assembly checklist, update WO
B-11 settings.json vs settings.local.json ambiguity. Assembly says one, agent-ea uses the other. LOW Clarify which file and why
B-12 No guidance on .claude/skills/ directory. Agent-ea has dev-only skills there (skill-creator, improve-agent) that are separate from product skills. LOW Address in Phase A or Phase H (strip dev skills before handoff)

Agent-ea Quick Fixes (Side Effects)

# Fix Effort
A-01 Remove /ea-note and /ea-validate phantom references from CLAUDE.md workflow text (they're aliases that don't resolve) 5 min
A-02 Document the 5 extra skills in CLAUDE.md skills table (ae-creer-diagramme-hautniveau, bmad-add-creer-note, bmad-add-revoir-note, do-work, ea-archi-extraction) OR remove them if they're dev artifacts 15 min
A-03 Remove bmad-v6 reference from CLAUDE.md line 4-5 ("Source de verite: bmad-v6") — factory-internal path that shouldn't be in client product 5 min
A-04 Strip "WORKSPACE GELE" freeze notice from CLAUDE.md — this is a dev state, not a product feature 5 min

Stage 5 (QA) — Gap/Lessons List

What Works

  • Test categories are correct (structural, functional, edge case, documentation)
  • Scoring formula is clear and the thresholds are reasonable
  • WO Section 8 provides excellent product-specific test cases

Gaps Found

# Gap Severity Fix
Q-01 No test execution protocol. Does Quinn run each skill manually in a fresh Claude session? With which model? In what order? No guidance on test environment setup. HIGH Add "Test Environment Setup" section: fresh session, specified model, clean test folder
Q-02 No test data specification. Stage says Quinn gets test data from Pablo but doesn't define what constitutes adequate test data. F7 (repository validation) needs synthetic data with known planted gaps. HIGH Define test data requirements per test type. Add to Pablo's Stage 4 handoff checklist.
Q-03 No QA certification template. Stage says "QA certification" output but provides no template for the document. HIGH Create production-lines/digital-talent/templates/qa-certification-template.md
Q-04 Isolation vs integration testing undefined. F4 (solution design) and F8 (orchestrator) require multi-step pipelines as prerequisites. Stage doesn't say whether Quinn tests skills in isolation or runs the full pipeline. MEDIUM Add guidance: test skills in isolation first (unit), then test pipeline (integration)
Q-05 Edge case inputs not operationalized. "Empty input" for a slash command is ambiguous — empty file? Missing file? Empty argument? MEDIUM Define concrete edge case scenarios per input type (file-based, argument-based, multi-file)
Q-06 WO-specific vs stage-generic threshold ambiguity. WO says 8/8 + 5/6, stage says 100% + 80%. They're equivalent here but could diverge. LOW Clarify: WO thresholds override stage defaults
Q-07 Remediation loop undefined. Stage says FAIL → back to Pablo, but no guidance on how Pablo addresses issues, re-submits, or what a re-test looks like. MEDIUM Add remediation workflow: Pablo fixes → re-submits with delta → Quinn re-tests failed cases only

Stage 6 (Deploy) — Gap/Lessons List

What Works

  • Three deployment options cover realistic scenarios
  • Rollback plan concept is right
  • Verification checks are mostly right

Gaps Found

# Gap Severity Fix
D-01 No packaging manifest. No list of what ships to client vs what's internal. Agent-ea has ~30 top-level items. Someone must decide: do TODO.md, install-agent.ps1, ARCHITECTURE-*.md, notes/, prompts/, emp/, Agent-ae/, .claude/skills/ ship? HIGH Add Phase 6.0: "Create packaging manifest" — list include/exclude patterns. Or add to assembly.md as Phase H output.
D-02 No sanitization step. CLAUDE.md contains factory-internal paths (bmad-v6), dev freeze notices, dev scripts. These must be stripped before client delivery. No stage mentions this. HIGH Add Step 6.1b: "Sanitize agent configuration — remove factory references, dev state, internal paths"
D-03 /help verification check is wrong. /help is a Claude Code built-in command, not an agent feature. Testing it proves Claude Code works, not the agent. MEDIUM Replace with: "Agent responds correctly when asked about its capabilities (checks CLAUDE.md role definition is active)"
D-04 No deployment checklist template. Diego has nothing to fill in as evidence. MEDIUM Create production-lines/digital-talent/templates/deployment-manifest-template.md
D-05 No guidance on .mcp.json. Agent-ea has one referencing ea-apps MCP server. Does this ship? Does client need it? MEDIUM Add to verification checklist: "Review .mcp.json — verify all referenced servers are available in client environment or remove"
D-06 No guidance on dev-only artifacts. .claude/skills/ contains dev tools (skill-creator, improve-agent, dev-html) that shouldn't ship. MEDIUM Add to packaging manifest: exclude .claude/skills/ dev tools
D-07 "Naming conventions active" check is vague. Naming is enforced by CLAUDE.md instructions, not code. What does "active" mean? LOW Rewrite: "Run a skill, verify output file follows naming convention from CLAUDE.md"

Stage 7 (Deliver) — Gap/Lessons List

What Works

  • Handover session structure is well-designed (6 agenda items with timing)
  • Client operational capability confirmation is concrete (3+ skills, create request, understand halts)
  • Initial feedback collection is a good practice

Gaps Found

# Gap Severity Fix
V-01 No demo script per product type. Dana needs to know which 3 skills to demo. For EA: note-revue → intrant → diagramme. But the stage doesn't reference the WO for recommended demos. MEDIUM Add: "Consult work order for recommended demo scenarios. If none specified, demo: intake skill, core capability skill, output skill."
V-02 WO-specific feedback questions not referenced. WO Section 12 has detailed EA-specific feedback questions, but Stage 7 only asks generic ones. MEDIUM Add: "Supplement generic feedback with product-specific questions from the work order (Section 12 or equivalent)."
V-03 Quick-start cheat sheet expected but never built. Stage 7 lists it as a deliverable; assembly.md Phase G doesn't mention it. MEDIUM Add to assembly.md Phase G. Create template.
V-04 Delivery confirmation format undefined. "Signed by client" — email? Markdown? Confluence page? LOW Define standard format (suggest: markdown template with client signature line, stored in order records)
V-05 Documentation delivery blocked. User guide, config guide, and skill reference card don't exist for agent-ea. Stage 7 can't deliver what Stage 4 didn't produce. HIGH (but root cause is B-04) Fix B-04 in Stage 4; this resolves downstream
V-06 No handover notes template. Dana needs to capture what happened in the session but has no structure for it. LOW Create template with: attendees, demo results, questions asked, action items, client confidence score

Cross-Stage Issues

# Issue Stages Affected Fix
X-01 No artifact templates anywhere. The line defines activities and outputs but provides zero templates for any output document (assembly checklist, QA certification, deployment manifest, delivery confirmation, handover notes). Every factory worker improvises. 4, 5, 6, 7 Create a production-lines/digital-talent/templates/ directory with all templates
X-02 Handoff protocol between stages is implicit. How does Pablo hand off to Quinn? Git tag? Slack message? Markdown file in the order directory? Each transition is undocumented. 4→5, 5→6, 6→7 Define standard handoff: create stage-N-complete.md in the order directory with checklist + links
X-03 Work order is underutilized. Stages 5-7 should reference WO sections for product-specific details (test cases, feedback questions, demo scenarios) but they don't link back to the WO. 5, 6, 7 Add "Consult work order Section X for product-specific details" to each stage
X-04 No order directory tracking. There's no convention for where stage artifacts (checklist, QA cert, deployment manifest, delivery confirmation) are stored. 4-7 Define: production-lines/orders/{order-id}/stage-artifacts/

Priority Summary

Must Fix Before Next Production Run (HIGH)

  1. B-01: CLAUDE.md template
  2. B-02: Per-request folder structure in assembly
  3. B-03: Skill template file
  4. B-04: Documentation templates (user guide, config guide, skill card, cheat sheet)
  5. Q-01: Test execution protocol
  6. Q-02: Test data specification
  7. Q-03: QA certification template
  8. D-01: Packaging manifest
  9. D-02: Sanitization step
  10. X-01: All artifact templates

Should Fix (MEDIUM)

  1. B-05, B-06, B-07, Q-04, Q-05, Q-07, D-03, D-04, D-05, D-06, V-01, V-02, V-03, X-02, X-03, X-04

Nice to Have (LOW)

  1. B-08, B-09, B-10, B-11, B-12, Q-06, D-07, V-04, V-06

Agent-ea Quick Fixes

These fell out naturally from the audit — low-effort fixes to the reference implementation:

# Fix Effort Priority
A-01 Remove /ea-note and /ea-validate phantom references from CLAUDE.md 5 min Do now
A-02 Document or remove 5 extra skills from .claude/commands/ 15 min Do now
A-03 Remove bmad-v6 factory reference from CLAUDE.md 5 min Before deploy
A-04 Strip "WORKSPACE GELE" dev freeze notice from CLAUDE.md 5 min Before deploy
A-05 Create client documentation (user guide, config guide, skill card) 2-3 hours Before deliver
A-06 Move dev-only skills from .claude/skills/ to a non-shipping location 10 min Before deploy
A-07 Review .mcp.json — verify ea-apps server is needed for client 10 min Before deploy
A-08 Clean up internal dev files (ARCHITECTURE-.md, STRUCTURE-ANALYSIS-.md, TODO.md, MEMORY-*.md, NEW-PROJECT-SETUP.md) 15 min Before deploy

Conclusion

The production line's design is right. The operational tooling is what's missing. The highest-leverage fix is creating the templates directory (X-01) — this single action addresses B-01, B-03, B-04, B-06, Q-03, D-04, and V-06. Second highest leverage is the sanitization/packaging step (D-01 + D-02) which prevents shipping factory internals to clients.

The line is ready for a second dry run after templates are in place. The agent-ea quick fixes (A-01 through A-08) can be done in parallel.