The Five Gates Where AI Delivery Actually Stalls

Generative tooling made the build fast, but it did nothing to the conditions on your side of the table that decide whether that speed ever reaches production.

We see the same pattern in almost every AI delivery conversation we walk into. The client has invested in generative capability, the engineering team is producing more than it ever has, and the program is not shipping any faster than it did a year ago. The build accelerated, but the delivery did not. Leaders reach for a technology explanation, yet there is almost never one. The build was never the gate. The gates sit upstream and around it, on the organization's own side of the table, and generative tooling does nothing to open them.

The speed of an AI delivery engagement is not set by how fast the agents write code. It is set by how fast the organization can decide, grant access, clear governance, provide working environments, and hold to an operating rhythm. When those conditions are in place, the velocity the tools promise actually reaches production. When they are not, the team produces faster and ships at the same pace it always did, because the new output simply queues behind the same old constraints. We have learned to treat readiness not as a precondition for delivery but as the delivery itself, and we establish it deliberately, at five specific gates.

Gate one: decision authority

Agentic delivery produces work in hours that used to take days, and that only helps if someone can accept or redirect that work at the same speed. On most programs the single largest source of lost time is not the work, it is the wait for a decision. We name one accountable approver for each class of decision, and a deputy for the days that person is unavailable. We set explicit service levels on decisions just as our industry has always set them on systems; a specification earns a decision within a day and a completed increment earns an acceptance within two. Without that discipline, the agents fill the queue and the queue waits on someone's calendar.

Gate two: discovery access

You cannot specify what you cannot see, and an AI agent executes against the specification, not against your intent. Before a team can write a specification an agent can use, it needs access to the people who hold the process in their heads, the data the work depends on, and the systems it has to touch. In regulated environments such as healthcare, and in data-sensitive environments such as retail and quick-service restaurants, that access is exactly what is slowest to arrange and most often underestimated. We sequence discovery access early and explicitly, because every day a team spends guessing at what it cannot see is a day it spends generating rework at machine speed.

Gate three: governance clearance

Three questions decide whether AI-generated work can ship, and all three have to be answered before delivery starts rather than discovered in the middle of it. Is AI-generated code permitted in this codebase at all? Who owns the intellectual property it carries, and what open-source licensing obligations come attached to it? How is the data the system touches classified, and what may an autonomous process do with it? These are not engineering questions, they are legal, security, and compliance questions, and they carry the longest lead time of anything in the engagement. In a year when regulators across jurisdictions are moving from published policy to active enforcement, governance clearance is often the gate that decides whether a program is buildable at all. We treat it as a critical path, not paperwork.

Gate four: environment readiness

Accepted work that cannot be deployed is not delivered, it is inventory. Build, test, and staging environments have to be provisioned, accessible, and representative of production before the velocity of generation means anything downstream. This is the least glamorous of the five gates and one of the most common places we watch programs quietly stall, because environment provisioning usually sits across organizational boundaries that no single delivery team controls.

Gate five: the working agreement

The first four gates hold only if there is an operating rhythm that keeps them honest. The working agreement defines the cadence, the decision forums, the escalation paths, and the shared definition of done between our team and the client's. It is the connective tissue that turns five separate conditions into a system that can sustain pace. Without it, readiness decays the moment the first deadline applies pressure.

Start narrow, prove the rhythm

We do not try to stand all five gates up across an entire portfolio at once. That is the failure mode the industry has already named, heavy up-front specification and a big-bang release that collapses under its own weight. We start with a deliberately narrow slice of real work and use it to prove the rhythm of decide, build, accept, and repeat. A narrow slice exposes which gate is actually weakest in a given organization, and it does so in weeks rather than quarters, before the cost of being wrong has scaled. Readiness is a capability an organization earns on a small surface and then extends, not a document it approves on a large one.

The same gates across every practice

These gates are not specific to any one kind of AI work. Whether an engagement runs through our Orchestration, Design, or Engineering practice, the build is the part generative tooling has already made fast, and the readiness is the part that decides whether that speed survives contact with the organization. The technology is now the smallest variable in the equation, the operating model around it is the largest.

Where to start

The organizations that win with AI delivery over the next few years will not be the ones with the most advanced tooling. They will be the ones that can decide quickly, clear governance early, and hold an operating rhythm under pressure, because those are the conditions that convert generated output into shipped outcomes. That capability is learnable, and it is where we start. Before we build, we run a delivery readiness assessment against these five gates, so the speed you are paying for is the speed you actually get. If your build has gotten faster and your delivery has not, that gap is the conversation worth having, and it is the one we would start with.