The Enterprise AI Playbook · Sintesi interattiva

01 · IL PARADOSSO 01 · THE PARADOX

Il 95% dei pilot fallisce.
Loro hanno studiato il 5% che funziona. 95% of pilots fail.
They studied the 5% that works.

L'MIT NANDA ha mostrato che il 95% dei progetti GenAI enterprise non produce alcun impatto economico misurabile. Stanford Digital Economy Lab ha guardato dall'altra parte: cosa hanno fatto, in concreto, le 41 organizzazioni che hanno avuto successo. MIT NANDA showed that 95% of enterprise GenAI projects produce no measurable financial impact. Stanford Digital Economy Lab looked at the other side: what, concretely, did the 41 organizations that succeeded actually do.

95%

delle iniziative AI enterprise non produce impatto misurabile sul P&L of enterprise AI initiatives produce no measurable P&L impact

MIT NANDA · 2025

case study di deployment AI riusciti, analizzati con metrica rigorosa: stabilità, adozione, valore quantificato, scalabilità cases of successful AI deployments, analyzed with rigorous metrics: stability, adoption, quantified value, scalability

STANFORD DIGITAL ECONOMY LAB · 2026

Stessa tecnologia. Stessi modelli. Stessi use case. Risultati radicalmente diversi. Dove si gioca davvero la partita? Same technology. Same models. Same use cases. Radically different outcomes. Where is the real game played?

03 · LA VARIANZA TEMPORALE 03 · TIMELINE VARIANCE

Stesso use case.
Settimane oppure anni. Same use case.
Weeks or years.

Una fintech ha migrato milioni di righe di codice legacy in poche settimane con un agente AI. Una banca con lo stesso obiettivo "ci mette anni solo per partire". La differenza non è l'AI: è il contesto organizzativo. Sponsorship esecutiva, fondamenta esistenti e disponibilità degli utenti finali sono i tre acceleratori; data quality e compliance i tre rallentatori principali. A fintech migrated millions of lines of legacy code in weeks with an AI agent. A bank with the same goal reports "it takes us multiple years just to even stand one of these things up". The difference is not AI: it is organizational context. Executive sponsorship, existing foundations, and end-user willingness are the three accelerators; data quality and compliance the main brakes.

Migrazione codice legacy Legacy code migration FINTECH

SETTIMANE WEEKS

Recruiting AI AI recruiting PROFESSIONAL SERVICES

~1 MESE

Customer support tech Customer support tech TECHNOLOGY COMPANY

6 MESI 6 MONTHS

Customer support banca Bank customer support FINANCIAL SERVICES

+2 ANNI +2 YEARS

04 · SUPERVISIONE UMANA 04 · HUMAN OVERSIGHT

Più autonomia all'AI.
Più produttività. More AI autonomy.
More productivity.

Il modello a escalation — l'AI gestisce l'80%+ dei casi in autonomia, l'umano interviene solo sulle eccezioni — produce il 71% di guadagno mediano di produttività, contro il 30% del modello ad approvazione (umano valida ogni output). Non significa "meno controllo è meglio": dipende dal contesto regolatorio e dal costo dell'errore. Ma dove si può, l'autonomia paga. The escalation model — AI handles 80%+ of cases autonomously, humans only review exceptions — delivers 71% median productivity gain, versus 30% for the approval model (human validates every output). This doesn't mean "less oversight is universally better": it depends on regulatory context and error cost. But where it's possible, autonomy pays.

Escalation Escalation

+71% PRODUTTIVITÀ MEDIANA +71% MEDIAN PRODUCTIVITY

L'AI gestisce 80%+ in autonomia. L'umano interviene solo su eccezioni o sample ≤20%. AI handles 80%+ autonomously. Human reviews only exceptions or sample ≤20%.

Approvazione Approval

+30% PRODUTTIVITÀ MEDIANA +30% MEDIAN PRODUCTIVITY

L'AI fa il lavoro. L'umano rivede e approva ogni output prima dell'azione. AI does the work. Human reviews and approves every output before action.

Collaborazione Collaboration

+22% PRODUTTIVITÀ MEDIANA +22% MEDIAN PRODUCTIVITY

Umano e AI lavorano insieme su ogni task in continuo. Tipico per coding, clinica, contenuti regolati. Human and AI work together continuously on each task. Typical for coding, clinical, regulated content.

06 · LA RESISTENZA 06 · RESISTANCE

Non sono gli utenti finali.
Sono Legal, HR, Risk. It's not the end users.
It's Legal, HR, Risk.

Il senso comune è che siano gli utenti finali a frenare l'adozione AI. I dati Stanford raccontano un'altra storia: le funzioni di staff (Legale, HR, Risk, Compliance) sono la prima fonte di resistenza al 35%, davanti agli utenti finali al 23%. Hanno l'autorità organizzativa per rallentare o bloccare un progetto, indipendentemente dal supporto esecutivo. Conventional wisdom says end users resist AI adoption. The Stanford data tells a different story: staff functions (Legal, HR, Risk, Compliance) are the most frequent source of resistance at 35%, ahead of end users at 23%. They have the organizational authority to slow or stop projects regardless of executive support.

35%

Funzioni di staff Staff functions

Legal · HR · Risk · Compliance. Temono responsabilità e rischio regolatorio. Legal · HR · Risk · Compliance. Worry about liability and regulatory exposure.
23%

Utenti finali interni Internal end users

Diffidano dell'inconsistenza dell'AI rispetto a sistemi deterministici. Distrust AI inconsistency vs deterministic systems they know.
19%

C-Level / CFO C-Level / CFO

Pretendono ROI dimostrato prima di investimenti su scala. Demand proven ROI before broader investment.
15%

Middle management Middle management

Spesso più resistente di senior e junior — protegge il proprio ruolo. Often more resistant than seniors and juniors — protects its role.
8%

Lavoratori in prima linea Frontline workers

Paura di sostituzione. Apparsa solo in 2 casi su 51. Fear of replacement. Appeared in only 2 of 51 cases.

07 · HEADCOUNT 07 · HEADCOUNT

Riduzione comune.
Ma non maggioritaria. Reduction is common.
But not the majority.

Quando l'AI produce guadagni di produttività alti, cosa succede al personale? La risposta è meno univoca di quanto si racconti. La riduzione è la singola voce più frequente al 45%, ma le tre alternative — riassegnazione a lavoro a maggior valore, blocco delle assunzioni, nessuna riduzione — sommano il 55%. La tecnologia non determina l'esito: lo determina la strategia dell'azionista. When AI delivers high productivity gains, what happens to staff? The answer is less univocal than the noise suggests. Reduction is the single largest category at 45%, but the three alternatives — redeployment to higher-value work, hiring avoidance, no reduction — sum to 55%. Technology doesn't dictate the outcome: ownership strategy does.

45%

Riduzione Reduction

22%

Riassegnazione Redeployment

18%

Hiring evitato Hiring avoided

15%

Nessun cambio No change

Riduzione diretta Direct reduction

Tipica in PE, turnaround. Esito di applicazioni cost-driven. Typical in PE, turnaround. Cost-driven applications.

Verso lavoro a maggior valore To higher-value work

L'AI assorbe il lavoro meccanico, le persone passano al successivo collo di bottiglia. AI absorbs mechanical work; people move to the next bottleneck.

"Sostituisce assunzioni" "Replaces hires"

"L'AI non sostituisce chi hai. Sostituisce chi non assumi più." "AI is not replacing the person you have. AI is replacing the person you don't need to hire."

Accelerazione invece che cuts Acceleration over cuts

"I risparmi sono andati ad accelerare la roadmap, non a ridurre lo staff." "Savings were used to accelerate the roadmap, not reduce staff."

08 · AGENTIC AI 08 · AGENTIC AI

Il 20% dei casi.
Il 71% di guadagno. 20% of cases.
The 71% productivity gain.

L'AI agentica — sistemi che prendono azioni autonome end-to-end — rappresenta solo il 20% dei casi del campione. Ma dove è applicata bene, produce il 71% di guadagno mediano contro il 40% dell'high-automation. METR misura il "task horizon" dei modelli frontier: raddoppia ogni 7 mesi. A inizio 2026 i modelli completano in autonomia task da 15 ore di lavoro umano. La traiettoria è chiara. Agentic AI — systems that take autonomous end-to-end actions — represents only 20% of the sample. But where applied well, it delivers 71% median productivity gain versus 40% for high-automation. METR measures the frontier-model "task horizon": doubling every 7 months. By early 2026, models autonomously complete tasks worth 15 hours of human work. The trajectory is clear.

71% vs 40%

guadagno mediano di produttività delle implementazioni agentiche, contro il 40% delle high-automation tradizionali. median productivity gain of agentic implementations, vs 40% for traditional high-automation.

SOLO IL 20% DEI CASI · OGGI ONLY 20% OF CASES · TODAY

METR · AI TASK HORIZON · 2019–2026

L'AI agentica non è una nuova UI. È una ridefinizione del ruolo di umani e macchine nel workflow. Agentic AI is not a new UI. It is a redefinition of the role of humans and machines in the workflow. — THE ENTERPRISE AI PLAYBOOK · CAPITOLO 8 — THE ENTERPRISE AI PLAYBOOK · CHAPTER 8

09 · IL DATO 09 · DATA

Il 6% ha dati puliti.
Eppure funziona lo stesso. 6% have clean data.
And it works anyway.

La narrativa dominante dice che l'AI ha bisogno di dati puliti per funzionare. I dati Stanford raccontano l'opposto: solo il 6% delle implementazioni aveva dati pronti per l'AI. Ma negli altri casi, gli LLM sono stati parte della soluzione — non solo consumatori di dati puliti, ma lo strumento che ha reso utilizzabili dati confusi. Nel 88% dei casi gli LLM hanno sbloccato dati prima inaccessibili. The dominant narrative says AI needs clean data to work. The Stanford data tells the opposite story: only 6% of implementations had data ready for AI. But in the others, LLMs were part of the solution — not just consumers of clean data, but the tool that made messy data usable. In 88% of cases, LLMs unlocked previously inaccessible data.

PRIMA · L'ASSUNTO BEFORE · THE ASSUMPTION

delle imprese ha dati pronti per l'AI. of enterprises have data ready for AI.

Aspettare che i dati siano "puliti" prima di partire vuol dire non partire mai. Solo 1 caso su 16 ha trovato dati allineati allo stato dell'arte richiesto dai progetti AI. Waiting for "clean" data before starting means never starting. Only 1 in 16 cases found data aligned with the state-of-the-art needed for AI projects.

DOPO · LA SCOPERTA AFTER · THE DISCOVERY

88%

degli LLM ha sbloccato dati prima inaccessibili. of LLMs unlocked previously inaccessible data.

Voce, documenti scansionati, log, codice legacy: dati che fino a due anni fa non erano utilizzabili sono ora la fonte primaria. Salvate tutto. Anche imperfetto. Vi servirà. Voice, scanned documents, logs, legacy code: data that was unusable two years ago is now the primary source. Save everything. Even imperfect. You will need it.

10 · LA SCELTA DEL MODELLO 10 · MODEL CHOICE

Il modello è quasi sempre
una commodity. The model is almost always
a commodity.

Per il 42% delle implementazioni il modello fondazionale è completamente intercambiabile. Per il 39% conta moderatamente. Solo nel 19% dei casi è un differenziale critico — e si tratta di task ad alta complessità: coding, compliance, agentic. Tra i task di routine, il 71% li tratta come puramente intercambiabili. Il vantaggio durevole non sta nel modello: sta nello strato di orchestrazione. For 42% of implementations the foundation model is fully interchangeable. For 39% it matters moderately. Only in 19% of cases is it a critical differentiator — and only for high-complexity tasks: coding, compliance, agentic. Among routine tasks, 71% treat the model as fully commodity. The durable advantage isn't in the model: it's in the orchestration layer.

Commodity Commodity INTERCAMBIABILE INTERCHANGEABLE

42%

Importanza moderata Moderate importance CONTA, MA NON DECISIVO MATTERS BUT NOT DECISIVE

39%

Differenziale critico Critical differentiator TASK COMPLESSI · AGENTIC COMPLEX · AGENTIC TASKS

19%

"Il vantaggio durevole è nello strato di orchestrazione, non nel foundation model." "The durable advantage is in the orchestration layer, not the foundation model."

11 · I SEI MODI DI FALLIRE 11 · THE SIX FAILURE MODES

Sei cause profonde.
Sei contromisure. Six root causes.
Six countermeasures.

Il 61% delle implementazioni ha avuto almeno un fallimento significativo prima di raggiungere il valore in produzione. Stanford ha consolidato queste esperienze in sei cause profonde, non in sintomi. Tocca le righe per espanderle. 61% of implementations had at least one significant failure before reaching production value. Stanford consolidated these into six root causes, not symptoms. Tap rows to expand.

SINTOMI SYMPTOMS

I pilot si bloccano e non scalano maiPilots stall and never scale
Bassa adozione nonostante il deploymentLow usage despite deployment
Nessun champion internoNo internal champions

CONTROMISURE HOW TO OVERCOME

Mandato CEO visibile, legato agli OKRVisible CEO mandate tied to OKRs
Inquadrare l'AI come rimozione del lavoro ripetitivo, non sostituzione delle personeFrame AI as removing repetitive tasks, not replacing people
Dare voce a junior ambassador per bypassare il middle management resistenteEmpower junior ambassadors to bypass resistant middle management
Training strutturato su use case specifici, non solo accesso al toolStructured training on specific use cases, not just tool access

SINTOMI SYMPTOMS

Il modello dà risposte generiche o sbagliateModel gives generic or incorrect answers
Output di qualità inferiore a quanto produrrebbe un dipendente espertoOutput quality below what an experienced employee would deliver
Gli utenti perdono fiducia e smettono di usarloUsers lose trust and stop using it

CONTROMISURE HOW TO OVERCOME

Costruire un'architettura dei dati accessibile prima di iniziareBuild accessible data architecture before starting any AI project
Fare della documentazione un prerequisito, non un ripensamentoMake knowledge documentation a prerequisite, not an afterthought
Usare l'AI stessa per estrarre e strutturare la conoscenza tacitaUse AI itself to extract and structure tacit knowledge from employees

SINTOMI SYMPTOMS

Mesi di attesa per approvazioniMonths waiting for approvals
Use case ristretti a zone "sicure" a basso valoreUse cases restricted to low-value safe zones

CONTROMISURE HOW TO OVERCOME

Coinvolgere legal come partner dall'inizio, non come gatekeeper finaliEngage legal early as partners, not last-minute gatekeepers
Implementare PII scrubbing, redaction e audit trail dal giorno 1Implement PII scrubbing, redaction, and audit trails from day one
Costruire processi di risk e controlli prima che vengano richiestiBuild risk and controls processes before they are demanded

SINTOMI SYMPTOMS

Il sistema fallisce alla scala di produzioneSystem fails at production scale
Cicli di rework costosiCostly rework cycles
Gli utenti perdono fiducia dopo aver visto erroriUsers lose trust after seeing errors

CONTROMISURE HOW TO OVERCOME

Costruire framework modulari che assorbano l'evoluzione tecnologicaBuild modular frameworks that absorb rapid tech evolution
Approcci ibridi: 80% tecnologia, 20% rifinitura umanaUse hybrid approaches: 80% technology, 20% human refinement
Validazione doppio modello prima di affidarsi a un solo outputStart with dual model validation before trusting single outputs

SINTOMI SYMPTOMS

Soluzione in cerca di un problemaSolution looking for a problem
La leadership uccide il progetto prematuramenteLeadership kills project prematurely

CONTROMISURE HOW TO OVERCOME

Mappare i processi end-to-end e trovare prima i veri colli di bottigliaMap processes end to end and find real bottlenecks first
Validare con utenti finali, non solo con sponsor esecutiviValidate with end users, not just executive sponsors
Aspettarsi che la maggior parte dei progetti AI fallisca al primo tentativoSet expectations that most AI projects fail on the first attempt
Definire il successo come miglioramento iterativo, non perfezione al day-1Frame success as iterative improvement, not perfection on day one

SINTOMI SYMPTOMS

Iterazione lenta, dipendenza dal vendorSlow iteration, vendor dependency
Il progetto perde priorità quando il champion se ne vaProject loses priority when champion leaves

CONTROMISURE HOW TO OVERCOME

Creare ruoli dedicati di data science — non riconvertire solo lo staff esistenteCreate dedicated data science roles — do not just retrain existing staff
Garantire sponsorship su più livelli di leadershipSecure sponsorship across multiple leadership levels
Costruire capability interna per non dipendere da una sola personaBuild internal capability so progress doesn't depend on one person
Documentare le vittorie continuamente per mantenere il commitmentDocument wins continuously to maintain organizational commitment

12 · IL PLAYBOOK 12 · THE PLAYBOOK

Cinque mosse,
distillate dai dati. Five moves,
distilled from the data.

Le organizzazioni che hanno avuto successo non avevano AI migliore. Avevano processi e esecuzione migliori. Da 51 deployment riusciti emerge un playbook breve. The organizations that succeeded did not have better AI. They had better process and execution. From 51 successful deployments, a short playbook emerges.

Inizia dal lavoro invisibile Start with invisible work

Documentazione di processo, layer di accesso ai dati, change management. Non sono overhead: sono il lavoro vero. Le organizzazioni che li hanno trattati come prerequisiti, non come ripensamenti, sono arrivate prima in produzione. Process documentation, data access layers, change management. They are not overhead: they are the real work. Organizations that treated them as prerequisites, not afterthoughts, reached production faster.

Investi nella misurazione Invest in measurement

KPI chiari prima del deployment. Non solo headcount o cost saving: includi metriche di qualità, customer value e crescita ricavi. You get what you measure. Clear KPIs before deployment. Not only headcount or cost savings: include quality, customer value, and revenue growth metrics. You get what you measure.

Salva tutto Save everything

Anche dati confusi, incompleti, apparentemente inutili. Gli LLM oggi puliscono, strutturano ed estraggono significato da fonti che fino a due anni fa erano scarto. Il costo dello storage è trascurabile rispetto al costo di non avere il dato quando arriva l'use case giusto. Even messy, incomplete, seemingly useless data. LLMs today clean, structure, and extract meaning from sources that were noise two years ago. Storage cost is negligible compared to the cost of not having data when the right use case arrives.

Architettura multi-modello dal day-1 Multi-model architecture from day-1

I modelli sono componenti intercambiabili dentro un layer di orchestrazione che controlli tu. Routing per task: modelli piccoli per la classificazione, grandi per il ragionamento, open per funzioni specializzate. Niente vendor lock-in. Models are interchangeable components inside an orchestration layer you own. Task routing: small models for classification, large for reasoning, open for specialized functions. No vendor lock-in.

Pianifica per l'agentic AI Plan for agentic AI

Il gap di produttività tra agentic e non-agentic (71% vs 40%) si allargherà. Confini decisionali chiari, escalation strutturata, accesso multi-sistema ai dati. Chi costruisce questa infrastruttura ora intercetterà la prossima ondata. The productivity gap between agentic and non-agentic (71% vs 40%) will widen. Clear decision boundaries, structured escalation, multi-system data access. Those who build this infrastructure now will capture the next wave.

Il playbook dell'AI
nelle imprese. The Enterprise
AI Playbook.

Il 95% dei pilot fallisce.
Loro hanno studiato il 5% che funziona. 95% of pilots fail.
They studied the 5% that works.

Stesso use case.
Settimane oppure anni. Same use case.
Weeks or years.

Più autonomia all'AI.
Più produttività. More AI autonomy.
More productivity.

Non sono gli utenti finali.
Sono Legal, HR, Risk. It's not the end users.
It's Legal, HR, Risk.

Riduzione comune.
Ma non maggioritaria. Reduction is common.
But not the majority.

Il 20% dei casi.
Il 71% di guadagno. 20% of cases.
The 71% productivity gain.

Il 6% ha dati puliti.
Eppure funziona lo stesso. 6% have clean data.
And it works anyway.

Il modello è quasi sempre
una commodity. The model is almost always
a commodity.

Sei cause profonde.
Sei contromisure. Six root causes.
Six countermeasures.

SINTOMI SYMPTOMS

CONTROMISURE HOW TO OVERCOME

SINTOMI SYMPTOMS

CONTROMISURE HOW TO OVERCOME

SINTOMI SYMPTOMS

CONTROMISURE HOW TO OVERCOME

SINTOMI SYMPTOMS

CONTROMISURE HOW TO OVERCOME

SINTOMI SYMPTOMS

CONTROMISURE HOW TO OVERCOME

SINTOMI SYMPTOMS

CONTROMISURE HOW TO OVERCOME

Cinque mosse,
distillate dai dati. Five moves,
distilled from the data.

Inizia dal lavoro invisibile Start with invisible work

Investi nella misurazione Invest in measurement

Salva tutto Save everything

Architettura multi-modello dal day-1 Multi-model architecture from day-1

Pianifica per l'agentic AI Plan for agentic AI

La finestra si sta chiudendo. The window is closing.

Il playbook dell'AInelle imprese. The EnterpriseAI Playbook.

Il 95% dei pilot fallisce.Loro hanno studiato il 5% che funziona. 95% of pilots fail.They studied the 5% that works.

Stesso use case.Settimane oppure anni. Same use case.Weeks or years.

Più autonomia all'AI.Più produttività. More AI autonomy.More productivity.

Lo sponsor non approva.Lo sponsor guida. The sponsor doesn't approve.The sponsor steers.

Non sono gli utenti finali.Sono Legal, HR, Risk. It's not the end users.It's Legal, HR, Risk.

Riduzione comune.Ma non maggioritaria. Reduction is common.But not the majority.

Il 20% dei casi.Il 71% di guadagno. 20% of cases.The 71% productivity gain.

Il 6% ha dati puliti.Eppure funziona lo stesso. 6% have clean data.And it works anyway.

Il modello è quasi sempreuna commodity. The model is almost alwaysa commodity.

Sei cause profonde.Sei contromisure. Six root causes.Six countermeasures.

SINTOMI SYMPTOMS

CONTROMISURE HOW TO OVERCOME

SINTOMI SYMPTOMS

CONTROMISURE HOW TO OVERCOME

SINTOMI SYMPTOMS

CONTROMISURE HOW TO OVERCOME

SINTOMI SYMPTOMS

CONTROMISURE HOW TO OVERCOME

SINTOMI SYMPTOMS

CONTROMISURE HOW TO OVERCOME

SINTOMI SYMPTOMS

CONTROMISURE HOW TO OVERCOME

Cinque mosse,distillate dai dati. Five moves,distilled from the data.

Inizia dal lavoro invisibile Start with invisible work

Investi nella misurazione Invest in measurement

Salva tutto Save everything

Architettura multi-modello dal day-1 Multi-model architecture from day-1

Pianifica per l'agentic AI Plan for agentic AI

La finestra si sta chiudendo. The window is closing.

Il playbook dell'AI
nelle imprese. The Enterprise
AI Playbook.

Il 95% dei pilot fallisce.
Loro hanno studiato il 5% che funziona. 95% of pilots fail.
They studied the 5% that works.

Stesso use case.
Settimane oppure anni. Same use case.
Weeks or years.

Più autonomia all'AI.
Più produttività. More AI autonomy.
More productivity.

Lo sponsor non approva.
Lo sponsor guida. The sponsor doesn't approve.
The sponsor steers.

Non sono gli utenti finali.
Sono Legal, HR, Risk. It's not the end users.
It's Legal, HR, Risk.

Riduzione comune.
Ma non maggioritaria. Reduction is common.
But not the majority.

Il 20% dei casi.
Il 71% di guadagno. 20% of cases.
The 71% productivity gain.

Il 6% ha dati puliti.
Eppure funziona lo stesso. 6% have clean data.
And it works anyway.

Il modello è quasi sempre
una commodity. The model is almost always
a commodity.

Sei cause profonde.
Sei contromisure. Six root causes.
Six countermeasures.

Cinque mosse,
distillate dai dati. Five moves,
distilled from the data.