← Kill Switches, Circuit Breakers, and Canary Tokens Checkpoints and Rollback →

Human-in-the-Loop: Propose-Then-Commit

> The 2026 consensus on HITL is specific. It is not "the agent asks, the user clicks Approve." It is propose-then-commit: the proposed action is persisted to a durable store with an idempotency key; surfaced to a reviewer with intent, data lineage, permissions touched, blast radius, and a rollback plan; committed only after positive acknowledgement; verified after execution to confirm the side effect actually happened. LangGraph's interrupt() plus PostgreSQL checkpointing, Microsoft Agent Framework's RequestInfoEvent, and Cloudflare's waitForApproval() all implement the same shape. The canonical failure mode is the rubber-stamp approval: "Approve?" is clicked without review. The documented mitigation is challenge-and-response with an explicit checklist.

Type: Learn

Languages: Python (stdlib, propose-then-commit state machine with idempotency)

Prerequisites: Phase 15 · 12 (Durable execution), Phase 15 · 14 (Tripwires)

Time: ~60 minutes

The Problem

An agent takes an action. The user has to decide: approve or not. If the decision is instant, it is probably not a review. If the decision is structured, it is slow but trustworthy. The engineering question is how to make a structured review the path of least resistance.

The 2023-era HITL pattern was a synchronous prompt: "Agent wants to send email to X with body Y — approve?" The user clicks Approve. Everyone feels the system is safe. In practice this surface is heavily rubber-stamped: users approve fast, approvals predict little, and when the agent goes wrong, the audit trail shows a long history of approvals the user cannot recall.

The 2026 pattern — propose-then-commit — moves HITL onto a durable substrate, attaches structured metadata, and requires positive commit. Every managed agent SDK ships a version: LangGraph interrupt(), Microsoft Agent Framework RequestInfoEvent, Cloudflare waitForApproval(). The API names differ; the shape does not.

The Concept

The propose-then-commit state machine

Propose. Agent produces a proposed action. Persisted to a durable store (PostgreSQL, Redis, Durable Object). Includes:

- intent (why is the agent doing this)

- data lineage (what source led to this proposal)

- permissions touched (which scopes / files / endpoints)

- blast radius (what is the worst case)

- rollback plan (if committed, how do we undo it)

- idempotency key (unique per proposal; resubmission returns the same record)

Surface. Reviewer sees the proposal with all metadata. The reviewer is a person (not the agent reviewing itself).
Commit. Positive acknowledgement. The action executes.
Verify. After execution, the side effect is read back and confirmed. If the verify step fails, the system is in a known bad state and alerting engages.

The idempotency key

Without an idempotency key, a retry after a transient failure can double-execute an approved action. Concrete example: user approves "transfer $100 from A to B." Network blips. Workflow retries. The user has approved once but the transfer executes twice. The idempotency key ties the approval to a single, unique side effect; the second execution is a no-op.

This is the same idempotency pattern Stripe and AWS APIs use. Reusing it for agent approvals is explicit in the Microsoft Agent Framework docs.

Durability: why approvals outlast processes

The approval waiting room is a piece of state the agent does not own. The workflow is paused (Lesson 12). When the approval arrives, the workflow resumes from exactly that point. This is why LangGraph pairs interrupt() with PostgreSQL checkpointing and not just in-memory state — an approval two days later still finds the workflow intact.

Rubber-stamp approvals and the challenge-and-response mitigation

The default UI for HITL ("Approve" / "Reject" buttons) produces fast approvals with no genuine review. Documented mitigation: a challenge-and-response checklist that requires positive answers to specific questions before the Approve button is enabled. Concrete shape:

"Do you understand what resource this touches? [ ]"
"Have you verified the blast radius is acceptable? [ ]"
"Do you have a rollback plan if this fails? [ ]"

Not bureaucracy for its own sake — a forcing function. The reviewer who cannot tick the boxes either asks for clarification (escalation) or declines (safe default). The Anthropic agent-safety research explicitly cites checklist-driven HITL as a mitigation for rubber-stamp approval patterns.

What counts as consequential

Not every action needs propose-then-commit. The 2026 guidance:

Consequential actions (always HITL): irreversible writes, financial transactions, outbound communication, production database changes, destructive file-system operations.
Reversible actions (sometimes HITL): edits to local files, staging-env changes, reversible writes with clear rollback.
Reads and inspections (never HITL): reading a file, listing resources, calling a read-only API.

Post-action verification

"The commit ran" is not the same as "the side effect happened." Network-partition and race conditions can produce a workflow that thinks it succeeded while the backend did not persist. The verify step re-reads the target resource after commit to confirm. This is the same pattern as database transactions with RETURNING clauses or AWS GetObject after PutObject.

EU AI Act Article 14

Article 14 mandates effective human oversight for high-risk AI systems in the EU. "Effective" is not decorative. Regulatory language specifically excludes rubber-stamp patterns. Propose-then-commit with challenge-and-response is the shape that survives Article 14 scrutiny in the Microsoft Agent Governance Toolkit compliance docs.

Use It

code/main.py implements a propose-then-commit state machine in stdlib Python. Durable store is a JSON file. Idempotency key is a hash of (thread_id, action_signature). The driver simulates three cases: a clean approval flow, a retry after transient failure (which must not double-execute), and a rubber-stamp default versus a challenge-and-response flow.

Ship It

outputs/skill-hitl-design.md reviews a proposed HITL workflow for propose-then-commit shape and flags missing metadata, idempotency, verification, or challenge-and-response layers.

Exercises

Run code/main.py. Confirm that a retry of an approved proposal uses the durable record and does not re-execute. Now change the idempotency key to include a timestamp and show the retry double-executes.

Extend the proposal record with a rollback field. Simulate an execution whose verify step fails. Show the rollback firing automatically.

Read Microsoft Agent Framework's RequestInfoEvent docs. Identify one metadata field the API includes that the toy engine is missing. Add it and explain what it protects against.

Design a challenge-and-response checklist for a specific action (e.g., "post to a public Twitter account"). What three questions must the reviewer answer? Why those three?

Pick one case where a synchronous "Approve?" prompt would be sufficient (no durable store needed). Explain why, and name the risk class you are accepting.

Key Terms

Term	What people say	What it actually means
Propose-then-commit	"Two-phase approval"	Persisted proposal + positive commit + verify
Idempotency key	"Retry-safe token"	Unique per proposal; second execution no-ops
Data lineage	"Where it came from"	The specific source content that led to the proposal
Blast radius	"Worst case"	Scope of effect if the action goes wrong
Rubber-stamp	"Fast approval"	"Approve" clicked without genuine review
Challenge-and-response	"Forcing checklist"	Reviewer must positively acknowledge specific questions
RequestInfoEvent	"MS Agent Framework primitive"	Durable HITL request with structured metadata
`interrupt()` / `waitForApproval()`	"Framework primitives"	LangGraph / Cloudflare equivalents of the same shape