Skip to main content

AI writes half your code and nothing ships faster

AI now writes nearly half the commits, but time to production hasn't moved. The bottleneck shifted downstream to tests, CI, and the delivery pipeline.

AI writes half your code and nothing ships faster

Ricardo Argüello

Ricardo Argüello
Ricardo Argüello

CEO & Founder

Software Development 5 min read

AI already writes a huge share of the code. On many teams, close to half the commits. And still, the time it takes a feature to reach production has barely moved.

That should be strange. If the most visible part of the work got ten times faster, why is the total clock the same?

Here is the thesis in one line. We solved code writing, but the chain that comes after it didn’t grow with it. AI amplifies whatever system it runs through. If your delivery chain is messy, all you get is shipping the mess faster. The bottleneck didn’t disappear. It moved downstream, and most teams are still optimizing the step that stopped being the constraint.

Code got cheap. Delivery didn’t.

Priyanka Vergadia framed it this week with a clarity worth repeating: AI writes nearly half the commits on most teams, and time to production hasn’t changed at all. The savings from writing evaporate downstream, in waiting, rework, and coordination.

The data backs the observation. Per a report covered by TechRadar, 69% of heavy AI tool users report frequent deployment problems with the generated code, nearly half say manual quality assurance, remediation, and validation work has increased, and only 21% of teams can stand up build and deployment pipelines quickly. Code goes in faster. Everything after it clogged up.

I have been shipping software for decades, and if there is one thing every productivity jump teaches you, it is that the limit doesn’t get removed. It relocates. Speeding up one part of the chain doesn’t speed up the whole: it just exposes the link you didn’t touch. This time that link is delivery.

A flaky test no longer fails once a week, it fails daily

The mechanism is concrete, and anyone with a CI will recognize it. More commits run the full test suite far more often. A flaky test, the kind that sometimes passes and sometimes fails with nobody touching anything, stops being a weekly annoyance and starts failing every day. And a single bad test is enough to block every merge waiting in line behind it.

On top of that comes the invisible cost: every pipeline rerun triggered by a failure that wasn’t real burns compute time and, worse, burns the team’s patience. PR review becomes the new funnel, because now there is twice the volume of changes to review and each one carries code the reviewer can’t approve at a glance.

None of these problems is new. What’s new is the volume. A crack in delivery that held fine at fifty commits a day becomes a dam at two hundred. AI didn’t create the problem. It amplified it until it could no longer be ignored.

AI amplifies the system, including the broken one

There is one statistic that sums up the whole thing. 93% of developers use AI, but measured productivity gains hover around 10%. The gap between those two numbers is exactly the mess in the delivery chain.

Because AI is not a selective accelerator that only improves the good parts. It is an amplifier. It turns up the volume on the entire system it passes through. If your testing process is solid, it makes it fly. If it’s fragile, it multiplies the failures. If your review was already saturated, it drowns it. Accelerating the input of a messy system doesn’t give you a fast system. It gives you a system that produces mess faster.

I’ve written before that the bottleneck moved and that building got cheap while deciding what to build didn’t. This post is a different face of the same shift, a more concrete one that’s easier to ignore: it’s not only about deciding what to build, it’s about what happens to everything built when it hits the delivery chain at ten times the old speed. You can decide perfectly well and still choke on a fragile CI. I see it all the time.

What we do about it at IQ Source

When a company asks us to accelerate its development with AI, we don’t look only at the code-writing step. We look at the whole chain: write, test, review, deliver. Because multiplying the input of a chain without knowing where its weakest link sits is the most expensive way to find that link.

AI Maestro is the discovery where that full value stream gets mapped before anyone touches the accelerator. We identify where delivery will break when commit volume spikes: the flaky tests that hold today, the suite that already runs too long, the review that depends on two people. And we reinforce that link before turning up the speed, not after the first jam in production. It’s the same principle I wrote about when a failure looked like the agent’s and was actually the architecture’s: the problem is almost never where everyone is looking.

The next time someone celebrates that AI now writes half the team’s code, ask one question before the party: does that half reach production faster, or does it just pile up in a review-and-test queue nobody made bigger? If it’s the second, you didn’t speed up delivery. You just put more pressure on the same narrow door.

Find your delivery’s weak link before you accelerate

Frequently Asked Questions

CI/CD software delivery AI in development flaky tests DevOps AI Maestro engineering productivity

Related Articles

The agentic moat is not the model. It's seven files.
Software Development
· 7 min read

The agentic moat is not the model. It's seven files.

An April 30 paper from Fudan + Peking measures seven harness components. The system prompt is the only one that regresses below baseline when isolated.

Anthropic AI Maestro Tech Partner