ADR 0009: Health, Readiness, and Prometheus Observability

Ratification

Adopted before ADR 0018. There was no separate ratification process. Git history for this file on main is the record.

Context

Why this matters: Runtimes and load balancers must know whether to send traffic to an instance: liveness (“is the process alive?”) and readiness (“can this instance serve requests right now?”—including DB). Without separate probes, restarts and routing decisions are guesses.

Metrics (latency, status codes, DB time) turn “it feels slow” into graphs and alerts. We had a basic health path but needed explicit readiness and a Prometheus baseline aligned with staging/production.

Decision

Implementation

Operational Model

Consequences

Positive

Trade-offs

Related

Page history

Date Change Author
Added Page history section (repository baseline). Ivan Boyarkin