What is Theory of Constraints and how does it apply to AI adoption in companies?

Theory of Constraints, formalized by Eliyahu Goldratt in his 1984 novel The Goal, holds that every productive system has exactly one constraint binding total throughput and that improving any other part only lengthens queues. Applied to AI, it means generating more code upstream does not accelerate delivery when the real constraint sits in review, integration testing, or production approval.

Why do AI productivity metrics like token usage mislead executives about real delivery speed?

Token usage metrics and internal usage leaderboards measure upstream generation, not downstream delivery. Goldratt demonstrated in 1984 that rewarding production at non-constraint stations only lengthens queues. A team can consume twice the tokens and ship half the features when the constraint sits in review capacity rather than in generation speed.

How does a small or mid-sized company identify its real workflow constraint when rolling out AI tools?

Attend three standups in a row with a notebook and write down every time someone says the phrase 'blocked on.' The term that appears most often points to the real constraint, not the one your intuition suggests. Goldratt's Five Focusing Steps begin there: identify the constraint before buying any AI tool or reorganizing the team.

What is the difference between optimizing AI code generation and elevating the review constraint?

Optimizing AI code generation speeds up the creation of pull requests but not the capacity to review them. Raising capacity at the review constraint, by contrast, lifts downstream throughput: more senior reviewers, higher PR quality, automation of validations that do not require human judgment. It is step four of the process Goldratt prescribed in 1984.

www.iqsource.ai

Goldratt diagnosed AI workflow gridlock in 1984

Ricardo Argüello

Goldratt diagnosed AI workflow gridlock in 1984

Ricardo Argüello — May 19, 2026

Ricardo Argüello

CEO & Founder

May 19, 2026 Business Strategy 8 min read

General summary

Alex Wang named the symptom on LinkedIn this week: AI accelerates generation, but review and approval capacity does not move at the same speed, so the bottleneck shifts instead of disappearing. The cure already exists. Eliyahu Goldratt diagnosed this mechanism in a 1984 industrial novel, and the Five Focusing Steps he prescribed are still, 42 years later, the only operating recipe with real outcomes behind them. This article applies those five steps directly to the AI investment your team is about to approve this quarter.

AI-generated summary

Explore other styles:

Every AI adoption dashboard I look at this week is optimizing the wrong thing. Eli Goldratt explained why in a 1984 novel about an industrial machinery plant: every productive system has exactly one constraint binding total throughput, and local improvements anywhere else just lengthen the queues. AI is the new faster machine. The novel still reads. Until your AI investment moves the real constraint, adding more generation upstream just grows the queue forming in front of whoever approves the work today.

The token KPI, the internal Claude usage ranking, the Cursor adoption panel: each one measures somewhere your team already has spare capacity. None of them measures where the flow actually jams.

The constraint moves; it does not vanish

Alex Wang published an article on May 15 that went viral this week with over 550 reactions. Wang authors the “Learn AI Together” newsletter with 548K subscribers. The core observation, verbatim: “AI increases output, but workflow capacity decides whether that output becomes progress.” When review, integration, and approval capacity does not move at the same speed, the bottleneck does not disappear; it shifts toward whoever now has to validate five times the material in the same hours. Wang names the symptom. The cure exists and is 42 years old.

Eliyahu Goldratt published The Goal in 1984 as an operating novel about Alex Rogo, a plant manager fighting to save a factory from imminent closure. The book sold ten million copies and is still assigned in MBA programs because it explained a simple mechanism most companies ignore when they buy new technology. Every productive system has a single constraint that binds total throughput. Everything else has spare capacity. If you optimize stations that already had spare capacity, you do not lift throughput; you only lengthen the queues forming in front of the station that is genuinely limited. Goldratt called that station the system’s Herbie, after the slowest scout in a Boy Scout troop on a hike: the entire troop walks at Herbie’s pace no matter how much faster the others could go.

In an enterprise AI rollout, the constraint is almost never code generation. It is one of four specific places. Pull request review. Integration testing. Decision authority to merge to production. Or rework, when AI-generated code passes shallow review but breaks something two sprints later. A mid-sized company where one senior engineer reviews everything on Friday afternoon has a Herbie with a name. Cursor or Claude Code adoption is not going to relieve that Herbie; it is going to bury him.

The Five Focusing Steps applied to AI

Goldratt prescribed a five-step process that remains, to this day, the only operating recipe inside Theory of Constraints with measured results behind it. Each step applies directly to an AI investment decision your team is about to make this quarter.

One. Identify the constraint. Do not trust the dashboard. Watch where the queue piles up in the standup. The phrase you have to hear is “blocked on review” or “blocked on approval.” The constraint is the person, process, or system that keeps showing up in that phrase week after week. If five different standups name the same person as the blocker, you do not need more analysis. You found Herbie.

Two. Exploit it. Before you add capacity, before you subcontract, before you buy more tooling, make sure the constraint is working only on what only it can do. The rule on a factory floor: one hour lost at the constraint is one hour lost across the whole system. AI behaves identically. Reduce the noise the senior reviewer has to process. Raise the quality of AI output before raising the volume. The tokens-per-shipped-feature KPI we covered two days ago is a step-two metric, not a step-one metric: it only matters once you already know the constraint sits in review, not in generation.

Three. Subordinate everything else. This is the step most companies skip. It means upstream generation paces itself to the constraint’s capacity, not the other way around. When you see an internal usage ranking like Meta’s reported Claudeonomics dashboard measuring tokens consumed per team and putting them in a public order, you are looking at the exact anti-pattern Goldratt warned about in 1984. Rewarding token consumption is rewarding upstream generation without verifying that the downstream constraint can process the queue. The number goes up, queues lengthen, throughput stays flat. Wang reported the Meta case this week alongside several parallels at other large companies; the observation matters more as a category than as an anecdote.

Four. Raise capacity at the constraint. Only after steps one through three are exhausted. Here you have two options, both expensive. Hire more senior capacity, a six-to-nine-month cycle in the current talent market. Or contract specialized review capacity by the hour. IQ Source’s Technology Partner service exists specifically for this step: we absorb the review and architecture load that does not need to live inside your senior team, while the senior team keeps the product and direction calls.

Five. Go back to step one. When you add capacity, the constraint does not disappear, it moves. Today it is the senior reviewer. In six months it may be the QA cycle, the production approver, the security team signing off every deploy. AI adoption is not a project; it is an identify-raise-identify cycle that repeats every quarter. The company that understands this pays for the next AI phase with margin. The one that does not pays for architecture it never ships.

Why this hits small and mid-sized companies harder

Most of the clients we work with at IQ Source are small or mid-sized companies with one to three senior engineers. The constraint is structurally identifiable because there is only one candidate: the single reviewer. At a large company there is always the option to hire another principal engineer; at a smaller one that move takes six to nine months in the current talent market and is the first commitment to drop when a soft quarter arrives. That is why step two, exploit the constraint, matters more at a mid-sized company than at a Big Tech company. A Big Tech company can absorb hidden waste. A mid-sized firm cannot.

Three concrete moves a mid-sized-company CTO can run this week, with no extra budget and no new hires.

One. Sit in on a standup with a notebook. Write down every time someone says “blocked on.” The phrase that appears most often points to your real constraint, not the one your intuition suggests.

Two. Take one queue in front of the identified constraint. Reduce its input before optimizing anything else. If the senior reviewer is jamming on poorly prepared pull requests, raise the self-review bar before raising the generation volume. One hour spent on a new PR template buys more than ten hours of Cursor configuration.

Three. Audit one AI adoption dashboard. If the metric counts upstream generation (tokens consumed, prompts sent, lines suggested), pull it from the executive report. Replace it with a constraint metric: review wait time, pull requests queued at end of Friday, rework rate from past defects.

What we do at IQ Source

AI Maestro is the two-month discovery program that operates steps one and two for you. The first deliverable is a Process Reality Map that names the constraint by name, not as a concept. The second is an AI Opportunity Score that orders the available bets by whether they raise capacity at the real constraint or merely optimize noise somewhere else. The Go/No-Go gate at the end of the program decides whether the AI implementation actually makes sense or whether the investment is better spent fixing another part of the flow first.

Technology Partner is step four when the constraint sits in senior review capacity and hiring is not a near-term option. We absorb the review and architecture load that does not need to live in your senior team, while the senior team keeps what only they can do: product decisions, technical direction, customer relationship.

The two services cover both sides of the problem Wang named this week. AI Maestro identifies where the jam sits before you buy one more tool. Technology Partner gives you real capacity to move the jam when the problem is structural.

If you want an hour of your team with me, no attached proposal, to identify your constraint in a mapping session: info@iqsource.ai. You bring the “blocked on” list from your last three standups. I bring The Goal and a notebook.

Goldratt closed the five-step process with an explicit warning: step five is not “finish and celebrate”; it is “go back to step one before inertia drags the previous constraint back in.” 42 years later that is still the hardest discipline. The right AI bet is the one that earns you the right to move the bottleneck somewhere harder. The wrong one just buys you a faster Herbie running the same jam in less time.

Map your bottleneck in 60 minutes

Frequently Asked Questions

Theory of Constraints Eliyahu Goldratt AI workflow AI Maestro Technology Partner AI governance review bottleneck

Anthropic's J-Space Can Now Audit Claude's Intent

Business Strategy

July 7, 2026 · 7 min read

Anthropic's J-Space Can Now Audit Claude's Intent

Anthropic published the J-space: an internal layer that audits what Claude is thinking before it answers, catching deception, fabrication, and hidden goals.

J-space Anthropic Claude

Bridgewater Fine-Tuned a Model and Beat GPT, Claude, Gemini

Business Strategy

July 6, 2026 · 5 min read

Bridgewater Fine-Tuned a Model and Beat GPT, Claude, Gemini

Bridgewater trained Qwen3-235B with Tinker and hit 84.7% accuracy, 13.8x cheaper than GPT, Claude, and Gemini with the best prompts money could write.

Bridgewater Thinking Machines Tinker

Goldratt diagnosed AI workflow gridlock in 1984

Goldratt diagnosed AI workflow gridlock in 1984

General summary

The constraint moves; it does not vanish

The Five Focusing Steps applied to AI

Why this hits small and mid-sized companies harder

What we do at IQ Source

Frequently Asked Questions

Related Articles

Anthropic's J-Space Can Now Audit Claude's Intent

Bridgewater Fine-Tuned a Model and Beat GPT, Claude, Gemini

IQ Source Assistant