WO-001 Dry Run Report — Production Line Validation

Date: 2026-03-18 Product: Enterprise Architecture Agent for STM Reference implementation: C:/Projects/agent-ea/ Production line: digital-talent (Stages 4-7) Purpose: Stress-test the production line, not redesign agent-ea

Executive Summary

The production line stages are structurally sound — the right activities happen in the right order with the right owners. But when applied to a real product, significant operational gaps emerge: missing templates, undefined handoff formats, ambiguous instructions, and no automation for verification. The biggest theme: the stages describe what to do but not how to do it, leaving each factory worker to reinvent the wheel.

Verdict: The line can produce a digital talent, but it would take 2x the estimated time because Pablo/Quinn/Diego/Dana would constantly need to improvise. The fixes below would cut that overhead significantly.

Stage 4 (Build) — Gap/Lessons List

What Works

Assembly.md phases A-H cover the right ground
The phase ordering is correct (repo → config → skills → templates → refs → orchestration → docs → verify)
Agent-ea passes most structural checks — it was well-built

Gaps Found

#	Gap	Severity	Fix
B-01	No CLAUDE.md template. Assembly says "write CLAUDE.md with required sections" but provides no starter template. Pablo must invent formatting each time.	HIGH	Create `production-lines/digital-talent/templates/CLAUDE-template.md` with all 9 required sections as fillable stubs
B-02	No per-request folder structure guidance. Work order lists `demandes-ae/` as a deliverable, but assembly.md never mentions creating request folder structures.	HIGH	Add Phase A.2: "Create request folder structure per work order" with standard pattern
B-03	No skill template file. Assembly says "each skill follows the standard skill template" but the template doesn't exist in the production line.	HIGH	Create `production-lines/digital-talent/templates/skill-template.md` with frontmatter + required sections
B-04	No documentation templates. Phase G says "user guide, config guide, skill reference card" but provides no templates. Remy's collaboration model undefined.	HIGH	Create doc templates in `production-lines/digital-talent/templates/`
B-05	Quick-start cheat sheet missing from Phase G. Stage 7 expects it as a deliverable but Phase G doesn't mention it.	MEDIUM	Add to Phase G list
B-06	Assembly checklist is too generic. Final checklist is just 8 checkboxes. Needs to be a fill-in form with file counts, skill counts, test results.	MEDIUM	Create detailed assembly checklist template
B-07	No consistency check tooling. CLAUDE.md-to-commands/ sync is manual. Agent-ea has 5 undocumented skills + 2 phantom references.	MEDIUM	Add a verification script or at minimum a documented manual procedure
B-08	content-in/ subdirectory guidance vague. Assembly says "adjust based on domain" but doesn't give examples.	LOW	Add a table of common domain → subdirectory mappings
B-09	Reference material format requirements missing. Agent-ea has .jpg, .doc, .pdf mixed with .md. No guidance on what formats agents can actually read.	LOW	Add note: prefer markdown, document fallback for binary files
B-10	Scope creep handling undefined. Agent-ea grew from 15 → 20 skills during development. Assembly doesn't address how to handle extra skills that emerge.	LOW	Add note in Phase C: document extras in assembly checklist, update WO
B-11	settings.json vs settings.local.json ambiguity. Assembly says one, agent-ea uses the other.	LOW	Clarify which file and why
B-12	No guidance on .claude/skills/ directory. Agent-ea has dev-only skills there (skill-creator, improve-agent) that are separate from product skills.	LOW	Address in Phase A or Phase H (strip dev skills before handoff)

Agent-ea Quick Fixes (Side Effects)

#	Fix	Effort
A-01	Remove `/ea-note` and `/ea-validate` phantom references from CLAUDE.md workflow text (they're aliases that don't resolve)	5 min
A-02	Document the 5 extra skills in CLAUDE.md skills table (ae-creer-diagramme-hautniveau, bmad-add-creer-note, bmad-add-revoir-note, do-work, ea-archi-extraction) OR remove them if they're dev artifacts	15 min
A-03	Remove bmad-v6 reference from CLAUDE.md line 4-5 ("Source de verite: bmad-v6") — factory-internal path that shouldn't be in client product	5 min
A-04	Strip "WORKSPACE GELE" freeze notice from CLAUDE.md — this is a dev state, not a product feature	5 min

Stage 5 (QA) — Gap/Lessons List

What Works

Test categories are correct (structural, functional, edge case, documentation)
Scoring formula is clear and the thresholds are reasonable
WO Section 8 provides excellent product-specific test cases

Gaps Found

#	Gap	Severity	Fix
Q-01	No test execution protocol. Does Quinn run each skill manually in a fresh Claude session? With which model? In what order? No guidance on test environment setup.	HIGH	Add "Test Environment Setup" section: fresh session, specified model, clean test folder
Q-02	No test data specification. Stage says Quinn gets test data from Pablo but doesn't define what constitutes adequate test data. F7 (repository validation) needs synthetic data with known planted gaps.	HIGH	Define test data requirements per test type. Add to Pablo's Stage 4 handoff checklist.
Q-03	No QA certification template. Stage says "QA certification" output but provides no template for the document.	HIGH	Create `production-lines/digital-talent/templates/qa-certification-template.md`
Q-04	Isolation vs integration testing undefined. F4 (solution design) and F8 (orchestrator) require multi-step pipelines as prerequisites. Stage doesn't say whether Quinn tests skills in isolation or runs the full pipeline.	MEDIUM	Add guidance: test skills in isolation first (unit), then test pipeline (integration)
Q-05	Edge case inputs not operationalized. "Empty input" for a slash command is ambiguous — empty file? Missing file? Empty argument?	MEDIUM	Define concrete edge case scenarios per input type (file-based, argument-based, multi-file)
Q-06	WO-specific vs stage-generic threshold ambiguity. WO says 8/8 + 5/6, stage says 100% + 80%. They're equivalent here but could diverge.	LOW	Clarify: WO thresholds override stage defaults
Q-07	Remediation loop undefined. Stage says FAIL → back to Pablo, but no guidance on how Pablo addresses issues, re-submits, or what a re-test looks like.	MEDIUM	Add remediation workflow: Pablo fixes → re-submits with delta → Quinn re-tests failed cases only

Stage 6 (Deploy) — Gap/Lessons List

What Works

Three deployment options cover realistic scenarios
Rollback plan concept is right
Verification checks are mostly right

Gaps Found

#	Gap	Severity	Fix
D-01	No packaging manifest. No list of what ships to client vs what's internal. Agent-ea has ~30 top-level items. Someone must decide: do `TODO.md`, `install-agent.ps1`, `ARCHITECTURE-*.md`, `notes/`, `prompts/`, `emp/`, `Agent-ae/`, `.claude/skills/` ship?	HIGH	Add Phase 6.0: "Create packaging manifest" — list include/exclude patterns. Or add to assembly.md as Phase H output.
D-02	No sanitization step. CLAUDE.md contains factory-internal paths (bmad-v6), dev freeze notices, dev scripts. These must be stripped before client delivery. No stage mentions this.	HIGH	Add Step 6.1b: "Sanitize agent configuration — remove factory references, dev state, internal paths"
D-03	`/help` verification check is wrong. `/help` is a Claude Code built-in command, not an agent feature. Testing it proves Claude Code works, not the agent.	MEDIUM	Replace with: "Agent responds correctly when asked about its capabilities (checks CLAUDE.md role definition is active)"
D-04	No deployment checklist template. Diego has nothing to fill in as evidence.	MEDIUM	Create `production-lines/digital-talent/templates/deployment-manifest-template.md`
D-05	No guidance on .mcp.json. Agent-ea has one referencing `ea-apps` MCP server. Does this ship? Does client need it?	MEDIUM	Add to verification checklist: "Review .mcp.json — verify all referenced servers are available in client environment or remove"
D-06	No guidance on dev-only artifacts. `.claude/skills/` contains dev tools (skill-creator, improve-agent, dev-html) that shouldn't ship.	MEDIUM	Add to packaging manifest: exclude `.claude/skills/` dev tools
D-07	"Naming conventions active" check is vague. Naming is enforced by CLAUDE.md instructions, not code. What does "active" mean?	LOW	Rewrite: "Run a skill, verify output file follows naming convention from CLAUDE.md"

Stage 7 (Deliver) — Gap/Lessons List

What Works

Handover session structure is well-designed (6 agenda items with timing)
Client operational capability confirmation is concrete (3+ skills, create request, understand halts)
Initial feedback collection is a good practice

Gaps Found

#	Gap	Severity	Fix
V-01	No demo script per product type. Dana needs to know which 3 skills to demo. For EA: note-revue → intrant → diagramme. But the stage doesn't reference the WO for recommended demos.	MEDIUM	Add: "Consult work order for recommended demo scenarios. If none specified, demo: intake skill, core capability skill, output skill."
V-02	WO-specific feedback questions not referenced. WO Section 12 has detailed EA-specific feedback questions, but Stage 7 only asks generic ones.	MEDIUM	Add: "Supplement generic feedback with product-specific questions from the work order (Section 12 or equivalent)."
V-03	Quick-start cheat sheet expected but never built. Stage 7 lists it as a deliverable; assembly.md Phase G doesn't mention it.	MEDIUM	Add to assembly.md Phase G. Create template.
V-04	Delivery confirmation format undefined. "Signed by client" — email? Markdown? Confluence page?	LOW	Define standard format (suggest: markdown template with client signature line, stored in order records)
V-05	Documentation delivery blocked. User guide, config guide, and skill reference card don't exist for agent-ea. Stage 7 can't deliver what Stage 4 didn't produce.	HIGH (but root cause is B-04)	Fix B-04 in Stage 4; this resolves downstream
V-06	No handover notes template. Dana needs to capture what happened in the session but has no structure for it.	LOW	Create template with: attendees, demo results, questions asked, action items, client confidence score

Cross-Stage Issues

#	Issue	Stages Affected	Fix
X-01	No artifact templates anywhere. The line defines activities and outputs but provides zero templates for any output document (assembly checklist, QA certification, deployment manifest, delivery confirmation, handover notes). Every factory worker improvises.	4, 5, 6, 7	Create a `production-lines/digital-talent/templates/` directory with all templates
X-02	Handoff protocol between stages is implicit. How does Pablo hand off to Quinn? Git tag? Slack message? Markdown file in the order directory? Each transition is undocumented.	4→5, 5→6, 6→7	Define standard handoff: create `stage-N-complete.md` in the order directory with checklist + links
X-03	Work order is underutilized. Stages 5-7 should reference WO sections for product-specific details (test cases, feedback questions, demo scenarios) but they don't link back to the WO.	5, 6, 7	Add "Consult work order Section X for product-specific details" to each stage
X-04	No order directory tracking. There's no convention for where stage artifacts (checklist, QA cert, deployment manifest, delivery confirmation) are stored.	4-7	Define: `production-lines/orders/{order-id}/stage-artifacts/`

Priority Summary

Must Fix Before Next Production Run (HIGH)

B-01: CLAUDE.md template
B-02: Per-request folder structure in assembly
B-03: Skill template file
B-04: Documentation templates (user guide, config guide, skill card, cheat sheet)
Q-01: Test execution protocol
Q-02: Test data specification
Q-03: QA certification template
D-01: Packaging manifest
D-02: Sanitization step
X-01: All artifact templates

Should Fix (MEDIUM)

B-05, B-06, B-07, Q-04, Q-05, Q-07, D-03, D-04, D-05, D-06, V-01, V-02, V-03, X-02, X-03, X-04

Nice to Have (LOW)

B-08, B-09, B-10, B-11, B-12, Q-06, D-07, V-04, V-06

Agent-ea Quick Fixes

These fell out naturally from the audit — low-effort fixes to the reference implementation:

#	Fix	Effort	Priority
A-01	Remove `/ea-note` and `/ea-validate` phantom references from CLAUDE.md	5 min	Do now
A-02	Document or remove 5 extra skills from .claude/commands/	15 min	Do now
A-03	Remove bmad-v6 factory reference from CLAUDE.md	5 min	Before deploy
A-04	Strip "WORKSPACE GELE" dev freeze notice from CLAUDE.md	5 min	Before deploy
A-05	Create client documentation (user guide, config guide, skill card)	2-3 hours	Before deliver
A-06	Move dev-only skills from .claude/skills/ to a non-shipping location	10 min	Before deploy
A-07	Review .mcp.json — verify ea-apps server is needed for client	10 min	Before deploy
A-08	Clean up internal dev files (ARCHITECTURE-.md, STRUCTURE-ANALYSIS-.md, TODO.md, MEMORY-*.md, NEW-PROJECT-SETUP.md)	15 min	Before deploy

Conclusion

The production line's design is right. The operational tooling is what's missing. The highest-leverage fix is creating the templates directory (X-01) — this single action addresses B-01, B-03, B-04, B-06, Q-03, D-04, and V-06. Second highest leverage is the sanitization/packaging step (D-01 + D-02) which prevents shipping factory internals to clients.

The line is ready for a second dry run after templates are in place. The agent-ea quick fixes (A-01 through A-08) can be done in parallel.