Seraph Development Status

Seraph is an AI guardian that remembers, watches, and acts. This page is the fastest answer to what is real on develop right now.

Legend

[x] shipped on develop
[ ] not fully shipped on develop
in-flight branch work should be tracked in open PRs, not in this file

When this file is updated on an open feature branch, it reflects the intended post-merge develop state for that branch. Until merge, the open PR and its validation are the live integration truth.

Current Snapshot

Seraph is usable today as a real guardian workspace with a browser cockpit, memory, screen awareness, proactive behavior, and a real action layer.
The live planning surface is now docs/research/ plus docs/implementation/.
Trust Boundaries, Execution Plane, and Runtime Reliability have strong foundations on develop.
The target product shape is now a power-user guardian workspace, not a village-first shell.
The guardian workspace is the only supported browser shell; the village/editor line is removed from the active repo path and should not be revived.
The workspace now exposes capability discovery, starter packs, workflow history, step records, retry-from-step recovery, parameterized replay, reload continuity, a searchable capability palette, capability preflight/autorepair, a separate Activity Ledger window, a denser operator terminal, live operator feed, saved runbook macros, and explicit continue/open-thread controls instead of leaving those as implicit operator knowledge.
The workspace window system now uses flatter terminal-style chrome with close controls, a Windows visibility menu, and per-pane hide/show state instead of only static rounded dashboard cards.
No workstream is complete yet.
Seraph is not yet the finished guardian product described in the research docs.

Docs Contract

docs/research/00-synthesis.md defines what Seraph is trying to become.
docs/research/10-competitive-benchmark.md owns the comparative judgment.
docs/research/11-superiority-program.md owns the design-level superiority program.
this file owns the fastest shipped snapshot on develop.
docs/implementation/00-master-roadmap.md owns the live 10-PR queue.
docs/implementation/08-docs-contract.md, docs/implementation/09-benchmark-status.md, and docs/implementation/10-superiority-delivery.md are the implementation-side mirrors of the research evidence/benchmark/program docs.
docs/implementation/01 through 07 remain the workstream docs; 08 through 10 are meta mirrors, not extra workstreams.

Current Focus On `develop`

The latest delivery batch is now complete for the current roadmap horizon: capability bootstrap v3, extension studio v1, workflow branching/resume v1, cockpit density v4, provider explainability/budgets v3, execution hardening v9, native-channel expansion v5, world-model fusion v9, guardian-learning policy v9, and guardian behavioral evals v9 all landed together.
The roadmap has now refreshed to a new next-10 batch rather than leaving the just-shipped batch as future work.
Guardian Intelligence remains central inside the current batch, but it is no longer the only active workstream.
Runtime Reliability now has a strong baseline on develop, but it is not fully complete.
The repo-wide 10-PR horizon is tracked in docs/implementation/00-master-roadmap.md.
The next strategic focus is now the extension-platform transition beginning with extension-model-terminology-v1, extension-manifest-schema-v1, and extension-registry-and-loader-v1, because Seraph now needs one coherent extension architecture for skills, workflows, starter packs, runbooks, and MCP connectors before deeper marketplace or managed-connector work can land cleanly.
capability-pack-autoinstall-and-bootstrap-v3, extension-authoring-and-validation-studio-v1, workflow-step-branching-and-resume-v1, cockpit-density-and-live-operator-views-v4, provider-policy-explainability-and-budgets-v3, execution-safety-hardening-v9, native-channel-expansion-v5, world-model-memory-fusion-v9, guardian-learning-policy-v9, and guardian-behavioral-evals-v9 are now represented in the shipped state this branch is preparing to merge.
The published 10-PR horizon should be refreshed whenever landed PR count from that queue is divisible by 5.

Current Target Shape

Shipped On `develop`

Core guardian platform

browser-based guardian workspace as the only supported browser shell
FastAPI backend with chat, WebSocket, goals, tools, observer, settings, audit, approvals, vault, skills, and MCP APIs
native macOS observer daemon for screen/window ingest
persistent guardian record, vector memory, sessions, and goal storage

Trust and control

tool policy modes for safe, balanced, and full
MCP policy modes for disabled, approval, and full
approval-gated high-risk actions in chat and WebSocket flows
explicit execution-boundary metadata and approval behavior surfaced for tools and reusable workflows
structured audit logging for approval, tool, and runtime events
secret redaction and scoped secret-reference handling
secret-reference resolution now stays limited to explicit injection-safe surfaces instead of resolving into arbitrary tool calls

Execution and integrations

Runtime and observability

Guardian intelligence and proactive behavior

Current interface surface

Ecosystem foundations

SKILL.md support and runtime skill loading
MCP-powered extension surface
recursive delegation foundations behind a flag
reusable workflow runtime with tool, skill, specialist, and MCP-aware gating

Still To Do On `develop`

Runtime and execution

richer provider selection policy beyond the shipped weighted scoring, required capability safeguards, tier guardrails, path patterns, explicit overrides, ordered fallbacks, and cooldown rerouting
broader eval coverage beyond the shipped REST, WebSocket, observer refresh, delivery policy, salience/confidence delivery, strategist-learning continuity, consolidation, proactive, tool/MCP guardrail, delegated workflow, and workflow-composition behavioral contracts
stronger execution isolation and privileged-path hardening beyond the first workflow/tool boundary pass
richer capability installation, recommendation, and recovery beyond the new starter-pack repair guidance, catalog-install, runbook preflight, bounded bootstrap flow, and first cockpit-native extension studio

Guardian intelligence

stronger learning and feedback loops beyond the first multi-signal delivery/channel/timing/suppression/thread layer
deeper guardian world modeling, learning loops, and stronger intervention quality beyond the new project/routine/collaborator/obligation-aware world-model layer
stronger salience calibration and confidence quality beyond the first aligned-work/high-salience pass

Interface and presence

richer cockpit density and broader keyboard/operator control beyond the first dedicated workflow-run shell
richer cross-surface continuity and broader non-browser presence beyond the new continuity snapshot, action-card continuation model, and first actionable desktop-shell/browser-native control layer
stronger explicit threading between ambient observation, workflow runs, native notifications, approvals, and deliberate interaction beyond the new shared thread metadata and continue/open-thread layer

Workflow and leverage

deeper operator-facing workflow control and workflow history beyond the new workflow-runs API, replay guardrails, timeline events, and cockpit workflow timeline
stronger extension ergonomics around reusable capabilities and workflows beyond the new cockpit operator surface, starter packs, repair flows, and runbooks

Practical Summary

Seraph already has a serious guardian core: memory, observer loop, strategy, tools, approvals, runtime audit, and deterministic evals.
The strongest current moat is guardian-oriented state plus proactive scaffolding, not the UI.
The biggest gaps against the reference systems are versioned capability distribution, deeper extension-studio ergonomics, visual workflow branch debugging, deeper execution hardening, stronger intervention learning beyond the new world-model plus timing/suppression/thread layer, and broader native reach.
The next major step is to deepen the new cockpit shell into a denser, more legible, more stateful guardian workspace without losing the existing trust and memory foundations.

Workstream View

Workstream 01: Trust Boundaries is only partially complete
Workstream 02: Execution Plane is only partially complete
Workstream 03: Runtime Reliability is only partially complete
Workstream 04: Presence And Reach is only partially complete
Workstream 05: Guardian Intelligence is only partially complete
Workstream 06: Embodied Interface is only partially complete
Workstream 07: Ecosystem And Delegation is only partially complete

Legend​

Current Snapshot​

Docs Contract​

Current Focus On develop​

Current Target Shape​

Shipped On develop​

Core guardian platform​

Trust and control​

Execution and integrations​

Runtime and observability​

Guardian intelligence and proactive behavior​

Current interface surface​

Ecosystem foundations​

Still To Do On develop​

Runtime and execution​

Guardian intelligence​

Interface and presence​

Workflow and leverage​

Practical Summary​

Workstream View​