When AI Saves Time but Costs Time: The Hidden Organizational Drag in Developer Efficiency

Oct 8

In a randomized controlled trial conducted in 2025, experienced open-source developers who were allowed to use AI coding tools took 19% longer to complete assigned tasks compared to those who couldn’t. (METR Research, 2025)

It’s a provocative result — on its face, counterintuitive. But dig deeper, and you begin to see a pattern: AI tools may deliver gross gains (faster snippets, scaffolded code, boilerplate), yet hidden overhead emerges in coordination, context switching, review, and alignment.

That overhead is what we call organizational drag — the invisible friction that often turns “time saved” into net stagnation. In this post, we’ll walk through the evidence, diagnose where drag accumulates, and explore how product and engineering leaders can reclaim the promise of AI by focusing on net efficiency — not just optimistic speedups.

The Promise vs. the Reality

The dominant narrative in tech today is simple: bring AI into development workflows, and watch productivity soar. AI copilots, code assistants, and automated test generators have flooded the market with promises of 10× efficiency. And in isolated tasks, the promise can hold:

A GitHub Copilot study found developers completed an HTTP server task 55% faster with AI.
Google’s internal RCT showed roughly a 21% reduction in time-on-task for 96 engineers in controlled environments.
Atlassian’s 2025 Developer Experience report showed nearly all developers now report time savings of over 10 hours per week with AI help.

But those are gross gains, measured narrowly — not in context of the entire system of work. Real-world studies, like METR’s, reveal a different story: when AI enters messy, collaborative environments, coordination costs, quality reviews, and misalignment begin to compound. The outcome? Net velocity remains flat — or even declines.

Why the Disconnect? Understanding Organizational Drag

Organizational drag is the “friction tax” teams pay when complexity grows faster than clarity. AI tools add new layers to already-complex systems — new artifacts, new workflows, and new human behaviors that must adapt. Let’s unpack the major sources of drag.

Coordination & Handoff Friction Every AI-generated suggestion creates something that must be reviewed, tested, or reworked. Teams spend additional time clarifying ownership, managing handoffs, and reconciling mismatched assumptions.
Context Switching & Tool Overhead Developers juggle between IDEs, AI prompts, documentation, and design systems — often waiting for responses, validating outputs, or correcting context loss. Each switch interrupts flow, creating micro-delays that compound across a sprint.
Review & Validation Overhead AI-generated code still requires human verification. Bugs, style mismatches, and security flaws multiply review cycles. The risk is subtle: the more teams trust AI, the less they scrutinize — until late-stage QA or production reveals the gaps.
Information Retrieval & Knowledge Drift AI may generate snippets faster, but it can’t (yet) unify your knowledge base. Teams still spend hours reconciling outdated specs, missing documentation, or conflicting architectural decisions.
Psychological Bias Perhaps the trickiest: developers feel faster using AI. In the METR study, participants believed AI saved them ~20% of time — even as measured performance showed a 19% slow-down. This overconfidence makes it harder for organizations to perceive drag until it’s systemic.

Measuring the Net Impact

To manage drag, you must measure beyond speed. Gross output metrics like “lines of code” or “PRs merged” don’t capture systemic friction. Instead, track net productivity — the true throughput after accounting for drag.

Recommended metrics:

Cycle time: from ticket start to production.
Review latency: time between PR open and merge.
Defect density: rework caused by AI-assisted code.
Context switching time: proportion of time lost to non-flow activities.
Developer sentiment: qualitative indicators of friction and burnout.

Use before/after studies or pilot groups to compare AI-enabled vs. control teams. Instrumentation tools (like Faros.ai) can now automate these analyses. The goal isn’t to disprove AI’s benefits — it’s to separate gross speed from net velocity.

Designing for Net Productivity

AI success isn’t automatic. It’s engineered.

Governance & Boundaries Define where AI adds value (boilerplate, tests, documentation) and where human expertise must lead (architecture, logic). Codify review policies and establish accountability.
Toolchain Integration Integrate AI into your existing workflows, not alongside them. Connect your IDE, documentation, and design sources so developers don’t lose context or duplicate work.
Frictionless Context Delivery Use living documentation — systems where design and code share the same source of truth. Provide AI assistants with relevant metadata and architectural context to reduce “hallucination by omission.”
Review & QA Orchestration Keep humans in the loop but automate the guardrails: static analysis, security scans, and style checks. Think of AI as a pair programmer — not an unmonitored contractor.
Incremental Adoption Start with internal tools, small modules, or test automation. Measure, iterate, and expand only when velocity and quality both rise.
Cultural Alignment The human factor matters most. Train teams to challenge AI outputs, not defer to them. Reinforce learning and humility: AI speeds up execution, not understanding.

Looking Ahead: Toward True AI-Augmented Flow

The next wave — agentic AI — promises to handle larger contexts: self-repairing pipelines, autonomous code reviews, proactive test generation. But even then, friction won’t vanish; it will simply shift. True AI-augmented flow requires alignment across:

Architecture (systems that are modular and observable)
Process (governed yet flexible workflows)
Culture (teams that embrace curiosity and measurement) When all three align, organizational drag becomes the exception — not the rule.

Conclusion & Call to Action

AI is not just a coding tool; it’s an organizational mirror. If it’s slowing you down, the problem isn’t the model — it’s the system it’s dropped into.

To recap:

Gross gains ≠ Net gains.
Measure drag through system metrics, not anecdotes.
Integrate intentionally; pilot before scaling.
Align teams, architecture, and expectations.

If you lead an engineering or product organization, start small: run a two-team experiment, measure both throughput and friction, and begin documenting where drag accumulates. That’s where real transformation begins.

Christopher Coughlan