Design Space Map
Design Space Map — Morphological Analysis
Section titled “Design Space Map — Morphological Analysis”This table shows all design dimensions and their possible values. The chosen combination is highlighted.
| # | Design Dimension | Option A | Option B | Option C | Option D | Chosen |
|---|---|---|---|---|---|---|
| 1 | IR as Compilation Target | Direct Authoring → Runtime | Authoring → flowJson → Runtime | Authoring → IR → Multi-Target | Authoring → IR → Single-Target | C |
| 2 | Architecture Layers | Two Layers (IR → Runtime+Pipecat) | Three Layers (IR → Runtime → Pipecat) | Four Layers (IR → Policy → Runtime → Pipecat) | Pipecat + Controller Overlay | B |
| 3 | Agent Communication | Single report_observation | Multiple Specialized Functions | Free-Text + Runtime Parsing | Structured Output (JSON mode) | A |
| 4 | Evidence Signal Model | Discrete Signal Kinds Only | Continuous Confidence Only | Discrete + Confidence + Process Taxonomy | Rubric-Level Judgments | C |
| 5 | Policy Expression | Structured Data Objects | Code (Python/TS) | Prompt Instructions | Rule Engine (DSL) | A |
| 6 | State Management | Explicit State Machines Only | Event Sourcing Only | Hybrid (Explicit SM + Event Sourcing) | Reactive State | C |
| 7 | Agent Autonomy Gradient | Two Levels | Three Levels (Autonomous/Advisory/Controlled) | Five Levels | Full Autonomy + Post-Hoc | B |
| 8 | Event Protocol | Push Only | Pull Only | Event Sourcing Only | Hybrid (Push + Event Store) | D |
| 9 | Evidence Output | Separate EvidenceLedger | Embedded in Transcript | Post-Hoc Extraction | Hybrid (Real-Time + Enrichment) | A |
| 10 | Recovery Strategy | Fully Automated | Human-in-the-Loop | Categorized (Technical/Assessment) | Graduated (Severity-Based) | C |
| 11 | Transition Authority | Runtime Only | LLM Proposes + Runtime Approves | LLM Decides | Runtime + LLM Delay | B |
| 12 | Context Management | RESET Only | APPEND Only | Hybrid (RESET + Summary) | Policy-Driven (RESET default) | D |
Design Configuration Summary
Section titled “Design Configuration Summary”The chosen design configuration represents a specific point in the design space:
{C, B, A, C, A, C, B, D, A, C, B, D}
This configuration is characterized by:
- Determinism over flexibility: Policies are structured data, transitions are runtime-approved, evidence is validated before recording
- Separation of concerns: IR ↔ Runtime ↔ Pipecat are cleanly separated; evidence collection ↔ evidence evaluation are separated
- Defense in depth: Multiple layers of constraint (IR policies, runtime guardrails, output validation, context policy)
- LLM as sensor, Runtime as judge: The LLM observes and reports; the Runtime validates and enforces
- Auditability as first-class: Events, evidence signals, and policy decisions are all recorded with provenance
Cross-Dimension Dependencies
Section titled “Cross-Dimension Dependencies”| Dimension Pair | Dependency | Nature |
|---|---|---|
| 1 (IR) → 2 (Layers) | IR enables three-layer architecture | IR provides the semantic richness that justifies a separate Runtime Controller |
| 2 (Layers) → 7 (Autonomy) | Three layers enable three autonomy levels | The Runtime Controller layer is the enforcement mechanism for the advisory/controlled boundary |
| 3 (Communication) → 11 (Transition) | Single function bundles transition proposals | evidenceSufficient in report_observation is the LLM’s transition proposal |
| 4 (Evidence) → 9 (Evidence Output) | Rich taxonomy requires first-class ledger | The eight signal kinds and provenance fields require a dedicated storage mechanism |
| 5 (Policy) → 6 (State) | Structured policies drive state transitions | CompletionPolicy and TransitionPolicy are evaluated by the state machine |
| 7 (Autonomy) → 11 (Transition) | Advisory level enables LLM proposals | The LLM’s evidenceSufficient signal is an advisory action |
| 8 (Events) → 9 (Evidence) | Event protocol delivers evidence signals | evidence_signal events flow through the event protocol to the ledger |
| 10 (Recovery) → 12 (Context) | Recovery affects context management | After recovery, context may need to be reconstructed from events |
Rejected Combinations
Section titled “Rejected Combinations”| Combination | Why Rejected |
|---|---|
| {A, _, _, _, _, _, _, _, _, _, _, _} | No IR = no versioning, no multi-target, no compile-time validation |
| {_, _, _, _, C, _, _, _, _, _, _, _} | Prompt-based policies are not enforceable (Greshake et al., 2023) |
| {_, _, _, _, _, _, _, _, _, _, C, _} | LLM-controlled transitions threaten fairness and reliability (Joughin, 1998) |
| {_, _, _, C, _, _, _, _, _, _, _, _} | Free-text parsing is unreliable for high-stakes assessment |
| {_, _, _, _, _, _, D, _, _, _, _, _} | Full LLM autonomy without runtime guardrails violates safety requirements |
Interactive Visualization
Section titled “Interactive Visualization”For the full interactive design space visualization with QOC evaluation matrices, dependency graphs, and cross-cutting themes, see the Design Alternatives section.