Model & Routing Catalog

Maintained by: Ada (Agentic Pattern Designer) + Riley (R&D Analyst) Last updated: 2026-03-29 Decision: TFD-0014 (Component Taxonomy — Category 3)

Overview

This catalog documents which LLM handles which task across all digital talent production. Model selection balances cost, quality, privacy, and latency — the right model for the right job.

Every skill shipped with a digital talent includes a model recommendation. This catalog provides the selection framework that drives those recommendations.

Available Models

Claude Family (Primary — Anthropic)

Model	Tier	Strengths	Cost	Context	When to Use
Claude Opus	Premium	Complex reasoning, orchestration, multi-step planning, nuanced judgment	$$$	200K	Orchestrators, planners, complex analysis, architecture decisions
Claude Sonnet	Standard	Balanced analysis, writing, code generation, standard workflows	$$	200K	Most skill execution, document generation, standard analysis
Claude Haiku	Economy	Fast execution, template filling, validation, simple lookups	$	200K	File operations, format validation, simple transformations, evaluators

Model Capability Boundaries

Capability	Haiku	Sonnet	Opus
Template filling	Excellent	Excellent	Overkill
Document generation	Basic	Excellent	Excellent
Multi-step reasoning	Limited	Good	Excellent
Architecture analysis	Not recommended	Good	Excellent
Orchestration/planning	Not recommended	Acceptable	Excellent
Code generation	Basic	Excellent	Excellent
Self-evaluation	Unreliable	Acceptable	Good (but see Principle 1)
Domain expertise	Limited	Good	Excellent

Local Models (Privacy Routing — Future)

Model	Use Case	Status
Local LLM (e.g., Llama, Mistral)	Sensitive data that cannot leave client infrastructure	R&D — not yet evaluated
On-prem deployment	Regulated industries (healthcare, finance, government)	R&D — requires AI Model Expert role

Gap: No formal evaluation of local models. Deferred pending AI Model Expert role creation (see deferred decisions).

Selection Criteria

Primary Decision: Task Complexity

Is this task simple execution (template fill, file ops, validation)?
  YES → Haiku
  NO  → Does it require multi-step reasoning or orchestration?
    YES → Opus
    NO  → Sonnet (default for most skills)

Secondary Factors

Factor	Guidance	Override Direction
Privacy	Sensitive client data → local model (when available); otherwise document risk	May force model change regardless of task complexity
Cost sensitivity	Client on budget tier → Sonnet ceiling, Haiku preferred	Downgrade from default
Latency	Interactive/streaming tasks → Haiku or Sonnet; batch OK with Opus	Faster model for UX
Quality threshold	Output directly faces client stakeholders → upgrade one tier	Upgrade from default
Evaluation tasks	Separate evaluator agent → can use Haiku with structured criteria	Downgrade: evaluators don't need to generate, just judge

Cost Optimization Patterns

Pattern	How It Works	When to Apply
Start high, downgrade	Begin with Sonnet for all skills, downgrade to Haiku after validation shows quality holds	Default approach during assembly
Opus for planning only	Reserve Opus for orchestrator/planner; generators use Sonnet, evaluators use Haiku	Multi-agent architectures (future)
Haiku for validation	Use Haiku for all verification/validation steps where criteria are explicit	Anywhere quality criteria can be formalized as a checklist
Batch processing	Use Opus for complex batch tasks where latency doesn't matter but quality does	Architecture reviews, compliance audits

Per-Skill Assignment Guide

During assembly (Phase C), each skill gets a model recommendation. Use this mapping:

Typical Assignments by Skill Type

Skill Type	Default Model	Upgrade To	Downgrade To
Orientation / analysis	Sonnet	Opus (if domain is complex)	—
Document generation	Sonnet	Opus (if high-stakes deliverable)	Haiku (if template-driven)
Diagram generation	Sonnet	—	Haiku (if template + data fill)
Validation / verification	Haiku	Sonnet (if judgment needed)	—
Orchestrator / workflow	Opus	—	Sonnet (if workflow is linear)
Data transformation	Haiku	Sonnet (if complex mapping)	—
Requirements capture	Sonnet	Opus (if client interviews)	Haiku (if form-based)
Search / lookup	Haiku	—	—
Quality evaluation	Haiku + criteria	Sonnet (if subjective judgment)	—
Client-facing report	Sonnet	Opus (if executive audience)	—

Work Order Override

Clients can specify model preferences in the work order (Section 7: Model Selection). Client preferences override the defaults above when:

Client pays for premium tier → Opus ceiling unlocked for all skills
Client on budget → Sonnet ceiling, Haiku preferred
Client has privacy requirements → local model routing (when available)

Model Routing Architecture

Current: Static Assignment

Today, model assignment is static — set once during assembly and baked into each skill's frontmatter. The CLAUDE.md skills table documents the assignment.

# In skill frontmatter
model: sonnet

# In CLAUDE.md skills table
| Skill | Command | Input | Output | Model |
|-------|---------|-------|--------|-------|
| Orientation | /orientation | request folder | analysis.md | sonnet |

Future: Dynamic Routing (R&D)

With Agent SDK, model routing could become dynamic — the orchestrator decides which model to use based on task complexity at runtime:

Scenario	Routing Decision
Simple request, known pattern	Haiku generates, Haiku validates
Standard request	Sonnet generates, Haiku validates
Complex request, novel domain	Opus plans, Sonnet generates, Sonnet validates
High-stakes deliverable	Opus plans, Opus generates, Sonnet validates

This requires Agent SDK evaluation (R&D intake logged 2026-03-29).

Update Triggers

Trigger	Action
New Claude model release (e.g., Claude 5)	Riley evaluates; Ada updates capability boundaries and selection criteria
Pricing change	Ada updates cost tier and optimization patterns
Client requests local model	Escalate to R&D; flag AI Model Expert role need
Production feedback (model underperforms)	Pablo reports; Ada adjusts assignment for that skill type
New model family available	Riley evaluates through R&D pipeline (TFD-0012)

References

Assembly guide Phase C, Step 5: production-lines/digital-talent/assembly.md
CLAUDE template Section 7: production-lines/digital-talent/templates/CLAUDE-template.md
Skill template: production-lines/digital-talent/templates/skill-template.md
Harness design principles: docs/superpowers/specs/2026-03-29-component-taxonomy-design.md (Section 3.2)
Technology radar: departments/executive/rd-analyst-riley/technology-radar.md