Observability Model

How SYRIS exposes audit, telemetry, health, alarms, and operator controls without relying on console logs.

If you can’t see what SYRIS is doing, you can’t safely run it 24/7. Observability is a product requirement.

Principles

Audit is the source of truth. Logs are optional; audit is required.
Everything is queryable. The dashboard should not scrape console output.
Traceability is end-to-end. Inbound event > decision > tool calls > outcomes.

Every meaningful operation emits an AuditEvent:

Audit must be searchable by:

Projections are query-optimized views derived from event/audit logs:

Tasks view: active tasks, step status, next wake time
Schedules view: next run, last run, missed runs
Watchers view: enabled, last tick, last outcome, suppression counters
Rules view: enabled, hit counts, suppression reasons
Integrations view: health, auth status, last success/error, rate-limit state
Approvals view: pending approvals and context
Queues/backlog view: runnable tasks, schedule backlog, tool queue depth
Autonomy view: current level and change history
Alarms view: open/acked/resolved incidents

A persistent heartbeat record should include:

Alarms are persisted entities with:

Example alarm types:

Status:

Query:

Controls (all audited):

Find the trace_id for the originating event
Query audit events by trace_id
Observe:
- routing decision
- which lane was chosen (fast/task/sandbox/gated)
- tool calls/outcomes and latency
- any suppression or gates
If needed: replay the event through the pipeline in dry-run mode

This workflow should work without reading logs.

On This Page