What This Covers
This page documents two adjacent capabilities that ship together:
- Decision Perspective Gate (WWMD / WWTD) — the canonical handler the platform calls whenever a coworker hits an ambiguity or an open product question. Returns one of
recommend/arbitrate/escalate/deferwith a confidence score, the principles that governed the decision, and an audit ledger entry. - Persona Voice Layer — an optional audio modality. Speech-to-text (STT) is on by default for voice input; text-to-speech (TTS) is opt-in per profile and narrates decision rationales in the persona’s voice.
The naming: WWMD is “What Would Mark Do” — the first profile, seeded for the DPF platform itself. WWTD is “What Would They Do” — the generalized model that lets a customer organization encode an executive, a domain expert, or an organizational archetype as its own profile. WWWD is the customer-organization variant of WWMD; in this DPF portal instance, WWMD and WWWD point at the same profile because the business and the product are the same thing.
The full design lives in three specs:
docs/superpowers/specs/2026-05-17-wwmd-decision-perspective-kernel-design.md— the kerneldocs/superpowers/specs/2026-05-19-wwmd-mcp-exposure-design.md— the MCP tooldocs/superpowers/specs/2026-05-19-persona-voice-layer-wwtd-design.md— voice and WWTD profile kinds
What Decision Perspective Does
When a coworker or external MCP client hits a decision point that doesn’t have a deterministic answer in code, it calls the Decision Perspective Gate. The gate selects the active profile, retrieves the principles and prior decisions that bear on the question, aggregates them as a weighted vector across decision dimensions, and returns one of four outcomes:
| Outcome | Meaning | What happens next |
|---|---|---|
recommend |
Confident direction backed by source-traced principles. | The coworker proceeds; the recommendation is logged. |
arbitrate |
Two or more credible directions exist. The gate resolves with weighted vector aggregation and returns the chosen path plus the dissenting view. | The coworker proceeds with the chosen path; the dissent is preserved in the ledger. |
escalate |
The decision needs human leadership — confidence is low, sources contradict, or the principle hierarchy doesn’t cover the question. | The work pauses; an approval surface is raised to the operator. |
defer |
The active profile lacks enough material to frame even a recommended direction. | The question is captured as a profile gap so future curation can close it. |
Every invocation writes a DecisionInteraction row recording the active profile version, the source materials cited, the confidence score, the chosen outcome, and the rationale text. This is the audit ledger — auditors and operators reconstruct “what perspective governed this decision, and on what evidence” from it.
The Confidence Model
The gate’s confidence is a governed runtime state, not a model-self-reported number:
Confidence is earned in drops and lost in buckets.
Confidence rises slowly through repeated evidence-backed alignment — a recommendation made, observed in practice, and confirmed by a human, increases the profile’s confidence for that question domain. Confidence drops fast after misses: a contradicted rationale, a stale source, or an overconfident recommendation that turned out wrong pulls the profile back. The point is to make autonomy something the platform earns, not something it claims.
The Inheritance Chain
When the active profile can’t answer a question, the gate falls back through a fixed chain:
- Active profile (WWWD or customer-specific)
- DPF product doctrine (general platform principles)
- DPF organizational principles (TAK / GAID governance layer)
defer— insufficient coverage even for a framing recommendation; capture as a profile gap
The chain is wired into the data model from day one so the fallback path is auditable: every interaction row records which level in the chain produced the answer.
Profile Kinds
The DecisionPerspectiveProfile model supports several profile kinds today. Each profile is versioned — when a profile’s materials change, a new version is snapshotted so old decision interactions still resolve against the doctrine that was active when they ran.
| Kind | Description | Example | Status |
|---|---|---|---|
platform |
DPF platform doctrine (Mark / DPF Platform). The first profile, seeded with the founder’s writings and approved decisions. | Mark / DPF Platform | Shipped — the WWMD kernel for this portal instance |
organization |
Customer organization’s operating principles (the WWWD profile). | Acme Corp Operating Principles | Shipped surface, deferred content — points at the platform profile in this instance because product and business are the same |
customer |
Future customer-instance profile, isolated from the platform’s product-origin guidance. | Customer org doctrine | Deferred — not in V1 |
persona-real |
A real person who has given explicit documented consent. The voice layer can use their voice clone. | An executive at a customer org | Shipped surface, consent-gated |
persona-fictional |
A persona not derived from any real person — useful as an archetype profile. | “The Pragmatic Founder” | Shipped |
persona-synthetic |
An AI-synthesized persona built from curated training data with no real-person basis. | Industry archetype seeded by DPF | Shipped |
The non-negotiable boundary: a customer profile must not inherit platform-specific business judgment as authority by default. DPF product doctrine can be advisory product guidance for any profile; the customer’s own WWWD profile becomes authoritative for its business context once that profile exists.
Calling the Gate
In-product coworkers call the gate through the platform’s internal handler — they don’t need to know it exists. The gate is also exposed as an MCP tool so external clients (Claude Code, Claude Desktop, Codex CLI, custom orchestrators) face the same gate under the same governance rules.
An operator can also invoke the gate manually from the Decision Perspective Gate Panel in Build Studio. A typical use is reviewing an automated arbitrate outcome before letting the plan advance — the dissenting view is shown alongside the chosen path so the operator decides whether the resolution holds up.
Tool grants and HITL tiers apply: the gate is callable, but high-risk outcomes (escalate) raise a human approval surface even when invoked autonomously.
The Voice Layer
A persona has three independently configurable layers. None requires the others.
| Layer | What it is | Required for WWTD |
|---|---|---|
| Decision materials | The principles, decisions, and writings that encode how this person thinks. Stored as PerspectiveMaterial records linked to the profile. This is what the gate evaluates. |
Yes |
| Generation style | A persona prompt that shapes how the LLM expresses the rationale text — phrasing, cadence, vocabulary. Doesn’t change the decision logic. | Recommended |
| Voice timbre | A cloned voice that narrates the synthesized rationale audio. | Optional; opt-in per profile |
Materials alone produce a fully functional text-only gate. Adding generation style produces styled text rationales. Adding a voice timbre produces audio narration of the styled text. The decision logic is identical across all three configurations.
Speech-to-Text (input)
STT is on by default on every install. The bundled service is speaches — a local Docker service at dpf-stt:9000 running faster-whisper / distil-whisper. No GPU is required; speaches runs CPU-friendly. A 3-tier hardware ladder is available for installs that want to upgrade:
| Tier | Backend | When to use |
|---|---|---|
| CPU (default) | speaches on CPU |
Every install gets this. Adequate for normal admin and coworker dictation. |
| GPU (upgrade) | speaches with the CUDA image (DPF_STT_IMAGE env var) |
Faster, lower-latency transcription on hosts with an NVIDIA GPU. |
| Hosted | Groq / Deepgram / AssemblyAI / OpenAI Whisper | Customer-supplied fallback when local STT isn’t desired, or when local hardware is constrained. |
The mic button is wired into the coworker chat surface; transcripts feed the existing coworker message pipeline. Errors surface inline rather than silently failing.
Text-to-Speech (output)
TTS narrates decision rationales returned by the gate. Synthesis runs asynchronously after the gate writes the DecisionInteraction row — a TTS provider outage cannot block plan advancement, because audio is always enrichment and text is always the primary output.
Provider options:
| Tier | Provider | GPU required | Notes |
|---|---|---|---|
| Cloud (default) | Cartesia Sonic 3 | None | Lowest first-audio latency (~90ms), 3-second minimum sample for voice cloning, streaming-native, professional voice clones without contacting sales. |
| Quality / self-hosted | Fish Audio S2 | 1× RTX 4090 (24GB VRAM) | Highest measured naturalness, enterprise RBAC built in, self-hostable via the open-source repo for customers with data-residency requirements. |
| Fallback | ElevenLabs / XTTS v2 (Coqui) | None (ElevenLabs) / 1× RTX 4090 (XTTS) | Stability fallback for non-real-time pre-rendering, or fully self-hosted budget option. |
The default deployment path adds zero hardware cost beyond the existing platform — cloud TTS is API-only. GPU is only relevant when a customer specifically requires on-premises voice synthesis.
Voice Profile Admin
Voice profiles are managed in the platform’s wiki area at the persona admin page (/wiki/personas/[id]/voice). Operator workflow:
- Upload audio or video samples on the profile admin page
- The platform extracts audio from video (FFmpeg, server-side)
- Optional vocal isolation removes background noise, music, and other speakers
- Consent capture — for any
persona-realprofile, a structured consent record must be created and confirmed before training runs. This is a non-negotiable gate - The provider training API is called with the processed audio
- A
VoiceTrainingJobrow tracks status:pending→processing→ready→failed - On
ready,VoiceProfile.providerVoiceIdis set andvoiceEnabledcan be turned on
Re-training is allowed when sample quality improves or the existing voice degrades. Each training run creates a new VoiceProfile version; the profile points at the current active version.
What an Operator Sees Today
- Build Studio Decision Perspective Gate Panel — the primary surface; shows the active profile, the gate outcome, the rationale text (with audio player when voice is enabled), the cited materials, and any dissenting view
- Decision Canvas — a read surface for a single
DecisionInteraction; shows the question, options, recommendation, confidence, material pulls, evidence sources, and audit identifiers without exposing internal tool names in the default view - Material Backlinks — a bounded local neighborhood for the cited principle or profile material; shows related stances and heuristics, citations, and prior decisions when the material has been approved and promoted
- Founder / owner review queue — unresolved decisions grouped by human-readable gap reason. WWMD decisions use founder-review wording; WWWD and custom profiles use owner/operator wording.
/wiki/personas/[id]— profile detail with materials, generation style, and voice configuration/wiki/personas/[id]/voice— voice training, consent record, training job status, provider voice IDDecisionInteractionledger — every gate invocation; queryable for “what did the gate decide for this build, and on what basis?”- Operations Map — decision-pressure overlays appear alongside cost-pressure overlays so an operator can see when escalation rates rise on a particular profile
Canvas, Backlinks, and Capture
The decision is the primary artifact. A Decision Canvas is the operator-readable projection of a recorded decision: what was asked, which options were available, what the active profile recommended or deferred, which materials pulled the result, and what action comes next.
A material backlink is supporting context, not a second decision. It answers “what else is connected to this cited principle or material?” by showing related stances, heuristics, citations, and prior decisions. Draft or candidate material is held as review material and is not presented as active doctrine.
Research capture is proposal-only. Captured notes, web clips, or Markdown become draft source/material candidates for an explicit target profile. They do not become published wiki pages, promoted profile material, or operational doctrine until reviewed.
Use founder review when a WWMD platform decision needs a missing principle, a founder judgment call, or a platform-governance resolution. Use owner/operator review when a WWWD or custom profile needs organization-local policy, customer-commitment guidance, or an accountable business owner.
Boundaries
Three constraints are non-negotiable:
-
Voice is a presentation layer. It does not change the decision logic, the confidence model, the escalation rules, or the governance authority. A WWTD profile with a celebrity voice clone follows the same
recommend / arbitrate / escalate / deferrules as a text-only profile. Voice makes the output richer; it does not make the profile more authoritative. -
Consent is required for any
persona-realvoice. A structured consent record is stored before any training job can run. Without that record, the training pipeline refuses to start. This is enforced at the data layer, not the UI layer. -
Customer profiles do not inherit platform-specific business judgment as authority. Mark / DPF Platform doctrine can be advisory product guidance for any profile, but a customer’s WWWD profile is authoritative for its own business context once it exists.
What’s In Progress
Tracked under EP-WWMD and EP-VOICE-LAYER:
- Customer WWWD profiles — the surface exists; the data and onboarding flow that lets a customer organization seed its own profile is the next deliverable
- Real-time voice conversation — live back-and-forth using streaming TTS in a sub-400ms loop fed by the existing STT pipeline. V2 target
- Voice in coworker thread messages — narrating coworker replies beyond gate rationale audio. V2
- Standalone “Ask WWMD by voice” — a dedicated voice surface to query the kernel directly without going through a Build Studio gate
- Streaming partial transcripts — the STT path currently posts full transcripts; partials are deferred to a later voice slice
- GPU auto-detection for STT — the CUDA upgrade currently requires an
.envchange; auto-detection of the host GPU is tracked as Slice 1.6
Related Routes
- Build Studio → Decision Perspective Gate Panel — primary surface for the gate
/wiki/personas/[id]— profile detail/wiki/personas/[id]/voice— voice training and consent/platform/ai/operations— Operations Map with decision-pressure overlays- MCP — the
decisionPerspective.invoketool exposes the gate to external clients under the same grant model
Related Specs
docs/superpowers/specs/2026-05-17-wwmd-decision-perspective-kernel-design.md— the kernel design (V1 WWMD)docs/superpowers/specs/2026-05-19-wwmd-mcp-exposure-design.md— the MCP tool surfacedocs/superpowers/specs/2026-05-19-persona-voice-layer-wwtd-design.md— voice layer + WWTD profile kindsdocs/superpowers/specs/2026-05-16-voice-input-and-transcription-design.md— STT designdocs/superpowers/specs/2026-05-17-voice-input-slice-1-5-default-on-cpu.md— CPU-default STT + the 3-tier hardware ladder