Tier 2 · Design2.215 min

The human gate — and its hard limit

A fiery orange-and-pink sunset over a harbour ringed by dark hills

When an agent touches something that matters — a decision about a person, a payment, a promise — the standard advice is “keep a human in the loop.” It’s good advice. It’s also the most misunderstood safeguard in the whole field, because the comfortable version of it barely works. This lesson is about building the gate so it actually holds.

The gate, done properly

A human gate is a point in the agent’s work where it must stop and a person decides before anything happens in the world. The agent prepares — reads, drafts, ranks, flags — and then it waits. A person looks, decides, and it’s the person’s decision that goes out.

That’s the shape. The trap is in one word: decides. There’s a world of difference between a person who makes the decision and a person who signs off on the machine’s decision. They look identical on an org chart. They are not the same safeguard, and the evidence on the gap is stark.

Why “a human signs off” is weaker than it sounds

Put a person in front of an AI’s recommendation and, over and over, they adopt it — including when it’s wrong.

In a 2025 study of 528 people across more than 1,500 hiring scenarios, reviewers went with the AI’s pick up to 90% of the time, against a roughly 50% baseline when choosing for themselves. The pull persisted even when they rated the AI’s quality as low. They didn’t trust it, and they deferred to it anyway.
In another controlled study, an AI system quietly downgraded one national group of applicants by 10%. Around 60% of reviewers never noticed. The bias wasn’t hidden in code they couldn’t see — it was in the outputs in front of them, and it sailed through.
A survey of 41 real-world human-oversight policies found the same thing structurally: the policies assume people can catch the machine’s mistakes, and mostly people can’t perform the oversight the policy is counting on.

This has a name — automation bias — and it doesn’t go away because you’re clever or well-intentioned. A fast, fluent, confident recommendation is designed to be agreed with. A tired person at the end of a stack of forty will agree with it. The “human in the loop” becomes a rubber stamp that launders the machine’s call into a human one — while adding almost none of the protection everyone assumes it adds.

What separates a real gate from a rubber stamp

The law, as it happens, turns on exactly this distinction. Under Europe’s GDPR (which reaches you if you ever process an EU-based person’s data), a solely automated decision about someone is prohibited in principle — and what pulls a decision back out of that prohibition is a genuine, non-token human involvement: someone with the authority and the information to reach a different answer, not someone who clicks approve. New Zealand’s Privacy Commissioner makes the same practical point from the other side: a token human-in-the-loop may not cure the automation blindness at all. (General education, not legal advice — the specifics are unsettled and worth real advice.)

So a gate that actually protects you has three properties, and it’s Anchor 3 — you answer for it — made concrete:

The person can genuinely say no — and sometimes does. If your reviewer has never once overturned the agent, you don’t have a gate; you have a formality.
They decide from the evidence, not the verdict. The agent hands over what it found and why — not a score or a recommendation to agree with. (That’s the next lesson.) A person weighing evidence resists the pull; a person handed a verdict rubber-stamps it.
The decision is backed by testing, not a glance. You can’t eyeball your way to catching a 10% skew — 60% of people miss it. You find that kind of thing by measuring the agent’s outputs across groups, which is Tier 3. The gate and the test work together; neither carries the load alone.

The honest conclusion

For a high-stakes call about a person, “we anonymise it and a human signs off” — the defence almost everyone reaches for — is close to the exact thing the evidence says fails. That’s not a reason to despair; it’s a reason to build the gate as a real decision, put testing behind it, and accept the lesson the Recruiter will drive home: sometimes the right gate is to not let the agent make the call at all.

Think of an agent decision you’d want a person to check. Be honest: would that person have the time, the information, and the standing to actually overturn it — or would they, at the end of a busy day, click approve? What would have to change for it to be a real gate?

A real gate needs the agent to hand over evidence, not a verdict. That’s a design choice you make when you build it — criteria, not vibes.

Shared freely, in good faith. If it's been of value, a koha toward development and running costs is warmly welcomed.

Leave a koha →

← 2.1 2.3 →

The human gate — and its hard limit

The gate, done properly

Why “a human signs off” is weaker than it sounds

What separates a real gate from a rubber stamp

The honest conclusion

Next