Architecture¶

MoiraWeave is designed around a narrow runtime and explicit ownership boundaries. It operates AI workloads without embedding customer business logic or agent internals.

Layers¶

Layer	Responsibility	Owned by
Workspace layer	Workload manifests, deployment values, artifacts, and secrets	The team using MoiraWeave
Runtime layer	API gateway, worker, control-plane storage, queueing, deployment templates, and observability	`moiraweave`
Experience layer	CLI and integrated Ops dashboard	`moiraweave-cli`, `moiraweave-ui`

Runtime Services¶

API gateway: auth, workload templates, workload registration, preflight, deployment operations, run submission, sessions, messages, events, artifacts, and health.
Worker: consumes Redis dispatch messages and calls model, pipeline, or agent executors.
Postgres: source of truth for workloads, runs, sessions, messages, events, and artifact metadata.
Redis Streams: queue and short-lived coordination layer.
Qdrant: optional vector store for RAG/search workloads.
UI: browser console for workloads, runs, agent sessions, artifacts, and deployment health.

End-to-End Run Flow¶

A user submits a workload run through CLI, UI, or API.
The API stores the run in Postgres and dispatches a message to Redis Streams.
The worker consumes the message and marks the run starting then running.
The executor calls the workload according to its type.
The worker stores events, assistant messages, artifacts, result, and final state.
UI and CLI read from the API only.

Agent Flow¶

Agent workloads use an adapter. The adapter sends a short-lived dispatch call to the agent runtime, then MoiraWeave tracks the run through stored state and events. Hermes, OpenClaw, LangGraph, or custom agents keep their own internal reasoning loop.

Agent runtimes can be placed in two ways. Managed runtimes are deployed by MoiraWeave as Docker Compose services or Kubernetes Deployments in the same network/namespace as the worker. External runtimes are not deployed by MoiraWeave; the manifest records spec.endpoint, and the adapter uses that URL.

MoiraWeave supports multiple agents by treating each runtime/profile as its own workload. A Hermes service, an OpenClaw gateway, and a custom HTTP agent can be deployed together if their manifests declare distinct names, service names, ports, and secrets. Sessions, messages, runs, events, artifacts, health, and deployment records remain scoped to the selected workload. External agents are registered as target: external deployment records so health and UI state still show where the runtime lives even when MoiraWeave does not own the process.

Observability¶

The API gateway exposes Prometheus metrics at /metrics on its HTTP service. The worker exposes a Prometheus metrics port named metrics. On Kubernetes, make helm-monitoring-install installs the monitoring chart and applies the MoiraWeave ServiceMonitor, PodMonitor, PrometheusRule, and Grafana dashboard ConfigMaps from infra/k8s/monitoring/.

The monitoring stack is intentionally separate from workload placement. Managed agent/model workloads may expose their own metrics endpoints later, but the core control-plane metrics are deployed with the platform monitoring install.

UI And CLI Boundary¶

The Ops dashboard covers API-level operations: guided workload creation, advanced manifest registration, run submission, run cancellation, live events, artifact browsing, agent sessions, agent messages, channel simulation, preflight, deployment planning, deployment record sync, and health.

The API can return a deployment plan for each workload and target, including generated files, service endpoint, and the CLI/Helm commands needed to apply it. It can also run preflight checks and record deployment operations. The CLI is still required for workspace-local actions that need filesystem, Docker, Helm, or Kubernetes credentials: moira init, moira up, Compose/Helm generation, deploy local --up, deploy k8s --apply, logs, and undeploy-style operations. The UI deliberately talks only to the API gateway and does not get direct access to Redis, Docker, Kubernetes, or local files.

Design Decisions¶

Use one workload.yaml model for Compose, Kubernetes, API validation, and worker dispatch.
Use stable workload service names so local and Kubernetes deployments resolve the same way.
Keep Postgres as the durable control plane.
Keep Redis out of durable state.
Keep UI/API as the canonical interaction surface.
Model Telegram, Slack, Discord, and webhooks as connectors into MoiraWeave, not direct agent access.