Trust Calibration

Trust in an agent can miss in either direction. A citizen might not trust using one that would have served them well, or lean on one in a high-stakes case it is not equipped to handle. Both sides need a basis for judging whether an agent’s output can be relied on for the decision.

A citizen needs something legible to judge an agent by, and an agency needs to supply a dependable basis for that judgment. The same miscalibration runs inside government, where an officer can over-rely on an agent’s output or distrust one that would have caught the error.

01 Policy challenge

Increased use of AI agents is happening faster than citizens have any settled basis for judging when to rely on one. The gap runs both ways: people withhold trust from an agent that would serve them well, and people lean on an agent in high-stakes interactions it cannot actually be trusted to handle. Automated systems run without calibrated trust or meaningful oversight have already produced serious public harm.

As deployment widens across services, the absence of a shared, legible basis for calibrating reliance becomes the condition citizens meet by default.

02 Design challenge

Let a citizen match their reliance on a government agent to the evidence for it.

Provide the means to read what an interaction involves and how much of it is automated.

For complex or high-stakes interactions, build a repeatable, transparent basis for trust.

Let delegation expand on demonstrated use and reverse the moment a citizen wants it gone.

Keep a path open for people who can't interpret the trust signals themselves.

Patterns in this territory

7 shown

4.1

Confidence and uncertainty surfacing

An agent output annotated with a confidence band and an action-tied caveat, so the citizen sees both how sure the agent is and what happens next.

Live surface

4.2

Trust marks and certification interfaces

A service-page trust mark whose every claim is a tap-through to the algorithmic transparency record, audit report, or bias-testing result that substantiates it.

4.3

Graduated delegation

An autonomy dial with a ladder of delegation levels (informational, guided, supervised, delegated) that exposes one level at a time with a conservative default, a per-level escape hatch, and capability boundaries the citizen cannot be silently moved past.

4.4

Transparency-by-default disclosure

An interaction that opens by stating it is AI, links to a two-tier transparency record (a public-summary tier and a technical-detail tier), and raises real-time moments ("I am about to submit this form on your behalf; this cannot be undone") at the points that matter.

4.5

Automation level communication

A consent-style screen that labels the interaction with one of three plain automation levels and, before any consequential action, shows what the agent will do and what data it will access.

4.6

Human-in-the-loop oversight signaling

An interaction that states, at each stage, which named human role will review the outcome and how to reach a person, with the override pathway no harder than accepting the agent's output.

4.7

Earned trust and reversible delegation

A delegation control where successful use can expand scope and a single action contracts it instantly, with the non-agent channel always available at equal service quality.