Enterprise Impact of the AI Fever: What Almost Four Years of GenAI Have Actually Bought
Almost four years since ChatGPT, the enterprise picture is real but uneven. McKinsey's State of AI 2025: 88% of organizations now use AI regularly, but only one-third have begun to scale and only ~6% qualify as AI high performers attributing more than 5% of EBIT to AI. MIT NANDA's GenAI Divide estimates 95% of generative-AI pilots return zero P&L impact. IBM 2026: 76% have a Chief AI Officer, but only 25% of the workforce uses AI regularly. The firms compounding value treat AI as an operating-model programme: they redesign workflows, route cheap models to cheap work, measure baseline and uplift, and govern AI as a production system.
Key takeaways
- The hype-to-impact gap is structural and now well measured. Sponsorship is high, workflow redesign is rare, and only ~6% of organizations attribute more than 5% of EBIT to AI in McKinsey's November 2025 survey.
- MIT NANDA, July 2025: roughly 95% of generative-AI pilots are still failing to deliver measurable returns. The gap is concentrated in custom internal builds, not in purchased copilots.
- Agentic AI is real but narrow. 23% of organizations are scaling at least one agent somewhere, 39% are experimenting, but no individual function shows more than 10% scaling agents.
- What enterprises actually build is narrow: copilots, enterprise search, customer-service bots, code assistants, and bounded back-office agents. RAG and tool use have replaced 'one giant model'.
- Token metering is rarely the dominant cost line for modest internal workloads. Seats, search, evaluation, security and human review usually outweigh the model bill until volumes get very large.
Canonical URL: https://juanbeltran.ch/blog/enterprise-impact-ai-fever