Specification

Trust Graduation Protocol

A permission protocol for agentic systems. Agents may observe, draft, and prepare work before they are trusted to create external effects. Consequential actions require action-class evidence, approval packets, receipts, and regression when behavior degrades.

Version 0.1 Status Draft Updated June 2026 License Public protocol

1.Contract¶

An implementation MUST classify a proposed action before execution.

An implementation SHOULD expose these primitive operations:

canExecute or equivalent decision function
prepareApprovalPacket
recordReceipt
recordEvidence
posterior

The trust unit is the action_class. Evidence for draft.compose MUST NOT imply permission for email.send.external.

2.Action Classes¶

Action classes use lowercase dot notation. The v0.1 canonical registry:

Action class	Type	Description
`read.context`	internal	Read, search, inspect, summarize, or rank context.
`draft.compose`	internal	Compose a new local draft.
`draft.response`	internal	Compose a reply or response draft.
`tool.call.local`	internal	Call a local tool without external effect.
`email.send.internal`	external controlled	Send to an established internal or known recipient.
`email.send.external`	external	Send to a non-established recipient or outside party.
`calendar.create`	external controlled	Create, update, or invite through a calendar system.
`social.post.public`	external	Publish to a public social channel.
`payment.initiate`	human-only	Initiate a payment, charge, purchase, or transfer.
`proposal.submit`	external	Submit a proposal, bid, application, or commitment for review.

Legacy aliases SHOULD canonicalize as follows:

Legacy identifier	Canonical identifier
`relationship_followup_drafting`	`draft.response`
`draft_response_drafting`	`draft.response`
`referral_ask_drafting`	`draft.compose`
`workspace_trust_boundary`	`draft.response`
`social.post.external`	`social.post.public`
`calendar.create.external`	`calendar.create`
`payment.spend`	`payment.initiate`

Implementations MAY add local classes, but MUST preserve the same decision, evidence, posterior, and receipt semantics.

3.Decision States¶

Decision states are:

allowed: execution may proceed.
allowed_with_constraints: execution may proceed only inside machine-evaluable constraints.
review_required: prepare an approval packet; do not execute externally.
deferred: queued for asynchronous approval; do not execute externally.
blocked: not permitted under current policy or evidence.
human_only: only the principal may perform this action.

allowed_with_constraints is a post-approval execution boundary, not a pre-approval autonomy class. The constraints scope the authority that an approval — explicit (principal sign-off) or rule-derived (graduated policy) — has already granted. Implementations MUST NOT use allowed_with_constraints to bypass the review_required or human_only gates.

3.1Constraint Vocabulary¶

allowed_with_constraints MUST use structured constraints, not prose-only instructions. The v0.1 constraint vocabulary:

internal_only
staging_only
dry_run_only
max_amount
rate_limit
recipient_allowlist
domain_allowlist
expires_at
requires_witness
redaction_rules

{
  "status": "allowed_with_constraints",
  "action_class": "email.send.internal",
  "constraints": {
    "domain_allowlist": ["company.com"],
    "rate_limit": { "count": 5, "window": "PT1H" },
    "expires_at": "2026-06-13T18:00:00Z"
  }
}

4.Evidence Model¶

Evidence has two axes:

decision_weight: the observed outcome quality.
provenance_weight: the reliability of the source that produced the evidence.

The intended evidence update is:

evidence_weight = decision_weight × provenance_weight

Mission's current runtime ships decision-quality weighting. Provenance weighting is part of the v0.1 normative evidence model and is scheduled for runtime convergence in v0.2.

4.1Decision Weights¶

Label	Weight	Quality
`sent`	1.00	strong
`approved`	0.85	strong
`minor_edit`	0.35	partial
`edited`	-0.15	ambiguous
`heavy_rewrite`	-0.55	partial
`held`	0.00	neutral
`rejected`	-1.00	strong
`dropped`	-1.00	strong

Trust violations, safety incidents, leaks, or exposed private data SHOULD carry weight -1.00 and force review until cleared by the principal.

4.2Provenance Weights¶

Source type	Weight	Description
`receipt`	1.00	A recorded approval, refusal, correction, or execution receipt.
`principal`	1.00	Direct instruction or correction from the principal.
`connector`	0.30	Data from an external connector or imported system.
`model_inferred`	0.10	Inference made by a model without direct receipt or connector grounding.

Implementations MAY learn provenance weights per workspace and action class, but MUST NOT silently upgrade model_inferred evidence to receipt-grade evidence.

5.Posterior¶

Trust Graduation computes a Beta posterior over approval/success probability per action class.

The default prior is:

alpha = 2
beta  = 2

For each non-zero evidence row:

positive evidence adds |evidence_weight| to alpha
negative evidence adds |evidence_weight| to beta

Implementations SHOULD compute mean, ci_low, ci_high, ci_width, samples, tier, and graduation_ready. The Mission reference computes the 95% credible interval with the regularized incomplete beta function and quantile bisection.

Default graduation thresholds:

Action class	`ci_low_min`	`samples_min`
default	0.80	10
`draft.compose`	0.80	10
`email.send.external`	0.92	30
`calendar.create`	0.88	20

External-effect classes MUST NOT auto-promote to ungated external execution solely from the posterior. The posterior may recommend readiness; the enforcement gate remains authoritative.

6.JSON Schemas¶

These schemas are intentionally minimal. Implementations MAY add fields. All schemas use JSON Schema draft 2020-12 and resolve at their $id URL.

action-class.schema.jsonRegistry row decision.schema.jsonGate verdict with constraint vocabulary approval-packet.schema.jsonPrepare-only review packet receipt.schema.jsonOutcome record, hash-chain ready evidence-row.schema.jsonOne observation feeding the posterior posterior.schema.jsonComputed Beta posterior

7.Hash-Chained Receipts¶

Receipt hash chaining is specified for auditability but not required for minimal v0.1 implementations.

Implementations SHOULD use RFC 8785 JSON Canonicalization Scheme before hashing receipt commitments. A receipt MAY delete or redact private content while preserving:

receipt_id
prev_hash
content_hash
redacted_for
non-sensitive metadata needed for audit continuity

Key rotation and external anchoring are deferred to v0.2. Mission Cloud may act as the first anchor store; future implementations may use a public transparency log.

8.Standards Mapping¶

Standards mapping is informational, not required for protocol conformance.

W3C PROV-O. Receipts, evidence rows, and approval packets can be represented as provenance entities, activities, and agents.
NIST AI RMF / NIST AI 600-1. Trust Graduation provides concrete governance, provenance, testing, and incident-record artifacts for agentic work.
ISO/IEC 42001. Trust Graduation receipts and risk reports can support traceability, transparency, and AI management-system evidence.
EU AI Act Article 14. Approval packets, override, stop/blocked states, and human-only classes operationalize human oversight for consequential AI-assisted actions.

9.Implementation Status¶

Mission currently ships:

canonical action-class registry
reference wrapper
approval packet pattern
receipt pattern
decision-quality weighting
Beta posterior with 95% credible intervals
prepare-before-execute public product boundary

Mission has not yet fully shipped:

unified provenance-weight × decision-weight runtime updates
MCP proxy enforcement around arbitrary third-party MCP servers
hash-chained receipt storage
external anchoring
standards appendices as public buyer collateral

← Back to homepage