The hidden cost of AI coding is review, not writing
Ricardo Argüello — June 29, 2026
CEO & Founder
General summary
Business Insider reported that the hidden cost of the AI coding boom is workplace paralysis. The problem isn't that AI writes bad code. It's that it writes faster than any human can review, so the bottleneck moved from typing to reviewing. Without a system built to absorb that shift, more speed produces less progress.
- Typing speed was never the bottleneck. Now that AI generates code faster than anyone can read it, the work that piles up is review, and review doesn't speed up with a bigger model.
- The data backs it up: GitClear measured copy-pasted code rising while refactored code fell; Google's DORA report tied AI use to lower delivery throughput and stability.
- A controlled METR study found experienced developers were 19% slower with AI, while believing they were 20% faster. The paralysis starts in that perception gap.
- Cognitive debt is not technical debt. It doesn't live in the code; it lives in the team that stopped understanding what it ships. No linter catches that.
- The way out isn't another tool. It's a system of loops the team owns, with review and ownership built in, where the metric stops being how much code ships and becomes how much the approver actually understands.
Picture hiring ten writers who produce a thousand pages a day, but you still have one editor who reads at the same speed as always. The factory outputs more than ever, and the stack of unreviewed pages keeps growing. Nobody writes slowly; the problem is nobody can keep up with the reading. AI coding works the same way: the model generates code at a speed your review process was never designed to absorb, and the productivity you see on the slide jams in the review queue you don't see.
AI-generated summary
Business Insider ran a piece on Monday that should make a few engineering leaders uncomfortable: the hidden cost of the AI coding boom isn’t a technical one. It’s human, and it looks a lot like paralysis.
Tim Paradis and Thibault Spirlet tell it through real engineers. The first reaction to every new tool isn’t “this is exciting,” it’s “I’m behind.” Model releases went from 18 in 2023 to 69 in 2025. Nobody finishes evaluating last week’s release before this week’s lands.
But the paralysis isn’t the AI’s fault. And the fix isn’t another tool either. It’s a system the team owns, with review and ownership built into it. Here’s why.
The bottleneck moved, and almost nobody reorganized around it
Gergely Orosz, who writes the most-read engineering newsletter in the industry, put it plainly: typing speed was never the bottleneck. Writing the code was always the fast part. The slow part was always understanding it, reviewing it, and deciding whether it belongs in production.
So AI shows up and solves the part that was already fast. It generates code at twice the rate it did six months ago. And the slow part? Still just as slow, because review requires a human to load the full context into their head, and that doesn’t accelerate with a larger model.
The outcome is predictable. CIO ran a headline that says it all: “AI killed the code review.” PRs pile up or get rubber-stamped. The review queue is the new bottleneck, and most teams just dumped more output on top of it without touching the queue itself.
This is the same point I make from another angle in the four loops that replaced prompt engineering: the value isn’t in generating faster, it’s in closing the review-and-correct loop with judgment. Generation is the easy step. It always was.
What the speed hides
Here’s where the “AI made us 10x more productive” story collides with the data.
METR ran a controlled study with experienced developers working in real repos over a million lines long. The result stings: they were 19% slower using AI. And the most revealing part isn’t the slowdown. It’s that they believed they had been 20% faster. A gap of nearly 40 points between what they felt and what happened.
That gap is paralysis in its purest form. It’s not that the team stops working. It’s that it works with the feeling of flying while moving less, and nobody notices until the calendar doesn’t add up.
GitClear analyzed 211 million lines of code and measured the fingerprint: copy-pasted code went up, duplicated blocks multiplied, and the code that gets moved or refactored (the signal that someone is cleaning up, not just adding) dropped by half. More volume, less care.
Google’s DORA 2024 report confirms it at the delivery level: for every increase in AI use, they measured a drop in delivery throughput and a larger drop in stability. AI pushes the size of each change up, and big batches are harder to review and easier to break.
None of these three numbers says “AI is bad.” They say something more uncomfortable: AI is excellent at accelerating the half of the work that was already fast, and that throws the whole system off balance if you don’t redesign the other half.
Cognitive debt is not technical debt
In the comments under that same Business Insider piece, Jean-François Bernier dropped the best line in the whole thread. He said engineers aren’t overwhelmed by AI itself, but by the expectation that they absorb every new tool, agent, and benchmark the moment it ships. And he named it: that isn’t learning anymore. It’s cognitive debt.
It’s worth separating the two kinds of debt cleanly, because people conflate them and they’re nothing alike.
Technical debt lives in the code. It’s ugly, duplicated, untested. A linter catches it, a refactor pays it down.
Cognitive debt lives in the team. It’s the understanding you lose when you delegate so much that you stop grasping what you ship. No linter catches it. It shows up the day something breaks in production and the person who approved the PR can’t explain why the code did what it did, because they never loaded it into their head. They just let it through.
The MIT Media Lab published a preliminary study (not yet peer-reviewed, so treat it as a signal, not proof) that measured exactly this with EEG: the group that leaned hardest on the model showed lower brain connectivity and worse recall of what they had just produced. They called it, fittingly, cognitive debt.
This connects directly to the line I drew between cognitive delegation and cognitive surrender: delegating execution is fine; surrendering judgment is where the trouble starts. A team approving code it doesn’t understand isn’t delegating, it’s surrendering, and it does so very efficiently.
What we do differently at IQ Source
The lesson isn’t “slow the AI down” or “review more carefully.” It’s to build a system instead of buying another tool, which is the same thesis I made when Block built Builderbot instead of buying a chatbot.
A system, here, means concrete things. It means work loops where review isn’t an optional final step but part of the loop. It means everything the agent ships has a human owner who understands it, not just someone who approved it. It means explicit quality criteria defined before anything gets generated, so that reviewing isn’t reading eight pages blind but comparing against a reference that already exists.
And it means changing the metric. As long as you measure how many lines or how many PRs your team produces, you’re measuring the easy thing. The metric that matters is how much of the code reaching production has someone who understands it and answers for it. That measures coherence, not volume. And coherence is the first thing to break when speed goes up without the system changing.
That’s what we design in the first phase of AI Maestro: we don’t plug in an agent and wish you luck. We map your real processes, define the criteria you’ll use to evaluate what the agent produces, and build the review system that turns AI’s speed into real progress instead of a queue of unopened PRs.
If your team feels like it’s producing more than ever but getting less far, the AI isn’t failing. You bolted a race engine onto a review process that still walks. Fix the queue, not the engine.
Let’s fix the review queue, not the engineFrequently Asked Questions
AI generates code faster than any human can review it, so the bottleneck shifts from writing to reviewing, and review doesn't speed up with a bigger model. Teams without a system to absorb that volume pile up unreviewed code, lose context on what they ship, and make less progress despite producing more.
A controlled METR study found experienced developers were 19% slower using AI, while believing they were 20% faster. GitClear documented more copied code and less refactoring, and Google's DORA 2024 report tied AI use to lower delivery throughput and stability. Perceived speed does not match measured speed.
Cognitive debt is the loss of understanding a team accumulates when it delegates so much to AI that it stops understanding what it ships. Unlike technical debt, it doesn't live in the code; it lives in people. A linter catches technical debt; nothing catches cognitive debt until something breaks and nobody can say why.
A company avoids it by building a system instead of buying another tool: work loops the team owns, with code review and ownership built in, and explicit quality criteria for evaluating what the agent ships. The metric stops being how much code is generated and becomes how much the reviewer actually understands and approves with judgment.
Related Articles
An AI Skill Isn't a Prompt. It's Code With Your Keys.
26% of public skills have vulnerabilities and over 5% are malicious. NVIDIA shipped a scanner. Why vetting a skill is now part of what you deliver to a client.
You built a Foxconn factory to babysit your AI
Garry Tan admitted he wrote 540,000 lines of code he didn't need. For 36 years, capability meant lines of code. That equation just inverted, and most of us missed it.