Model & Routing Catalog

Maintained by: Ada (Agentic Pattern Designer) + Riley (R&D Analyst) Last updated: 2026-03-29 Decision: TFD-0014 (Component Taxonomy — Category 3)

Overview

This catalog documents which LLM handles which task across all digital talent production. Model selection balances cost, quality, privacy, and latency — the right model for the right job.

Every skill shipped with a digital talent includes a model recommendation. This catalog provides the selection framework that drives those recommendations.


Available Models

Claude Family (Primary — Anthropic)

Model Tier Strengths Cost Context When to Use
Claude Opus Premium Complex reasoning, orchestration, multi-step planning, nuanced judgment $$$ 200K Orchestrators, planners, complex analysis, architecture decisions
Claude Sonnet Standard Balanced analysis, writing, code generation, standard workflows $$ 200K Most skill execution, document generation, standard analysis
Claude Haiku Economy Fast execution, template filling, validation, simple lookups $ 200K File operations, format validation, simple transformations, evaluators

Model Capability Boundaries

Capability Haiku Sonnet Opus
Template filling Excellent Excellent Overkill
Document generation Basic Excellent Excellent
Multi-step reasoning Limited Good Excellent
Architecture analysis Not recommended Good Excellent
Orchestration/planning Not recommended Acceptable Excellent
Code generation Basic Excellent Excellent
Self-evaluation Unreliable Acceptable Good (but see Principle 1)
Domain expertise Limited Good Excellent

Local Models (Privacy Routing — Future)

Model Use Case Status
Local LLM (e.g., Llama, Mistral) Sensitive data that cannot leave client infrastructure R&D — not yet evaluated
On-prem deployment Regulated industries (healthcare, finance, government) R&D — requires AI Model Expert role

Gap: No formal evaluation of local models. Deferred pending AI Model Expert role creation (see deferred decisions).


Selection Criteria

Primary Decision: Task Complexity

Is this task simple execution (template fill, file ops, validation)?
  YES → Haiku
  NO  → Does it require multi-step reasoning or orchestration?
    YES → Opus
    NO  → Sonnet (default for most skills)

Secondary Factors

Factor Guidance Override Direction
Privacy Sensitive client data → local model (when available); otherwise document risk May force model change regardless of task complexity
Cost sensitivity Client on budget tier → Sonnet ceiling, Haiku preferred Downgrade from default
Latency Interactive/streaming tasks → Haiku or Sonnet; batch OK with Opus Faster model for UX
Quality threshold Output directly faces client stakeholders → upgrade one tier Upgrade from default
Evaluation tasks Separate evaluator agent → can use Haiku with structured criteria Downgrade: evaluators don't need to generate, just judge

Cost Optimization Patterns

Pattern How It Works When to Apply
Start high, downgrade Begin with Sonnet for all skills, downgrade to Haiku after validation shows quality holds Default approach during assembly
Opus for planning only Reserve Opus for orchestrator/planner; generators use Sonnet, evaluators use Haiku Multi-agent architectures (future)
Haiku for validation Use Haiku for all verification/validation steps where criteria are explicit Anywhere quality criteria can be formalized as a checklist
Batch processing Use Opus for complex batch tasks where latency doesn't matter but quality does Architecture reviews, compliance audits

Per-Skill Assignment Guide

During assembly (Phase C), each skill gets a model recommendation. Use this mapping:

Typical Assignments by Skill Type

Skill Type Default Model Upgrade To Downgrade To
Orientation / analysis Sonnet Opus (if domain is complex)
Document generation Sonnet Opus (if high-stakes deliverable) Haiku (if template-driven)
Diagram generation Sonnet Haiku (if template + data fill)
Validation / verification Haiku Sonnet (if judgment needed)
Orchestrator / workflow Opus Sonnet (if workflow is linear)
Data transformation Haiku Sonnet (if complex mapping)
Requirements capture Sonnet Opus (if client interviews) Haiku (if form-based)
Search / lookup Haiku
Quality evaluation Haiku + criteria Sonnet (if subjective judgment)
Client-facing report Sonnet Opus (if executive audience)

Work Order Override

Clients can specify model preferences in the work order (Section 7: Model Selection). Client preferences override the defaults above when:

  • Client pays for premium tier → Opus ceiling unlocked for all skills
  • Client on budget → Sonnet ceiling, Haiku preferred
  • Client has privacy requirements → local model routing (when available)

Model Routing Architecture

Current: Static Assignment

Today, model assignment is static — set once during assembly and baked into each skill's frontmatter. The CLAUDE.md skills table documents the assignment.

# In skill frontmatter
model: sonnet

# In CLAUDE.md skills table
| Skill | Command | Input | Output | Model |
|-------|---------|-------|--------|-------|
| Orientation | /orientation | request folder | analysis.md | sonnet |

Future: Dynamic Routing (R&D)

With Agent SDK, model routing could become dynamic — the orchestrator decides which model to use based on task complexity at runtime:

Scenario Routing Decision
Simple request, known pattern Haiku generates, Haiku validates
Standard request Sonnet generates, Haiku validates
Complex request, novel domain Opus plans, Sonnet generates, Sonnet validates
High-stakes deliverable Opus plans, Opus generates, Sonnet validates

This requires Agent SDK evaluation (R&D intake logged 2026-03-29).


Update Triggers

Trigger Action
New Claude model release (e.g., Claude 5) Riley evaluates; Ada updates capability boundaries and selection criteria
Pricing change Ada updates cost tier and optimization patterns
Client requests local model Escalate to R&D; flag AI Model Expert role need
Production feedback (model underperforms) Pablo reports; Ada adjusts assignment for that skill type
New model family available Riley evaluates through R&D pipeline (TFD-0012)

References

  • Assembly guide Phase C, Step 5: production-lines/digital-talent/assembly.md
  • CLAUDE template Section 7: production-lines/digital-talent/templates/CLAUDE-template.md
  • Skill template: production-lines/digital-talent/templates/skill-template.md
  • Harness design principles: docs/superpowers/specs/2026-03-29-component-taxonomy-design.md (Section 3.2)
  • Technology radar: departments/executive/rd-analyst-riley/technology-radar.md