S1-B1: Trust Boundaries

Intent

Add the minimum serious trust and control layer Seraph needs before broader autonomy work.

policy profiles for tool access
approval gates for sensitive actions
secret scoping and safer credential usage
audit log of meaningful actions and approvals
clearer isolation boundaries between planning, execution, and privileged operations

The trust-boundary foundation is now meaningfully underway.

Shipped in this batch so far:

tool policy modes for safe, balanced, and full
explicit MCP access modes for disabled, approval, and full
structured audit logging for tool calls, tool results, and approval decisions
high-risk approval gates in chat and WebSocket flows
secret egress redaction for outbound chat, step output, and surfaced errors
vault operation audit for secret store/get/list/delete actions
session-scoped secret references for safer downstream tool usage without re-exposing raw values to the model context
approval flow improvements so approved chat actions can resume automatically

Still open inside this batch:

reducing reliance on raw get_secret() retrieval in favor of narrower secret-injection paths
tighter isolation between planning, privileged execution, and future workflow/runtime layers
deeper policy distinctions inside the MCP/external execution layer beyond one global gate

poor policy design can create user friction without meaningful safety
approval UX can become annoying if the risk model is too coarse
partial trust boundaries may create a false sense of safety if messaging is sloppy