Workstream 03: Runtime Reliability

Status On `develop`

Make Seraph more resilient, observable, and predictable under real usage.

Runtime Reliability remains the current repo-wide hardening track
close the remaining runtime observability gaps outside the main agent, scheduler/helper flows, current integration lifecycle coverage, and observer surfaces already instrumented

deepen provider routing beyond the current explicit runtime-path primary and fallback overrides, ordered fallback, and cooldown rerouting with richer policy-aware selection
broaden local-model routing beyond the current helper, scheduled completion, core agent-model, delegation, and connected MCP specialist paths into any remaining runtime paths where it makes sense
add observability coverage across any remaining edge helpers and external integration paths beyond observer refresh, calendar/git/goal/time sources, daemon ingest, proactive delivery gating, current MCP lifecycle coverage, the embedding/vector-store/soul-file/filesystem boundaries, and the browser/sandbox/web-search tool boundaries
expand eval coverage beyond the current runtime seam checks, including broader provider-routing and remaining edge-path contracts beyond the current MCP-specialist, embedding-model, vector-store, soul-file, and filesystem coverage

provider failure with configured fallbacks does not collapse the entire chat path
a local or non-OpenRouter path is demonstrably possible across helper, scheduled completion, core agent, delegation, and connected MCP specialist flows
runtime paths can force distinct primary and fallback routing without changing the global runtime baseline
key flows are observable and easier to debug
the project has repeatable eval coverage for core behavior