Why data driven processes are the real bottleneck
The statement is well known: data is the oxygen for AI. And yet many organizations still treat data as a by‑product. They invest in Copilots, agents and intelligent models, but allow them to run on fragmented spreadsheets, outdated records and poorly maintained systems.
The strategic question is therefore uncomfortably simple:
Are we willing to design our processes in a way that truly feeds AI,
or are we unleashing AI on data that is fundamentally unfit for the task?
That difference determines whether AI delivers structural value, or remains stuck in demos and pilots.
“AI does not fail because it is not intelligent, but because we do not give it anything solid to reason with.”
Data driven processes: AI starts before the AI
A data driven organization does not see data as output, but as the foundation of the process.
Every process step: creates data
uses data
and does so in a consistent way
A familiar example makes this tangible. In a data driven order process, every order is fully registered in Business Central. Fields are completed. Definitions are consistent. Statuses are reliable.
For AI, this is gold.
Patterns become visible. Decisions are traceable. Automation becomes scalable.
Organizations that already design their processes this way hold a major advantage today. For them, AI is often simply the next layer.
AI on immature data: the same problem, packaged faster
The reality in many organizations looks very different.
Orders arrive via PDF. Customer information is scattered across CRM, email and Excel. Each department maintains its own version of the truth.
In such an environment, AI must first perform the work that data engineers often spend eighty percent of their time on: cleaning, interpreting and repairing data.
This leads to two familiar problems.
Garbage in, garbage out.
Promising AI initiatives that stall in proof of concept.
The AI is not unintelligent. It simply receives poor input.
The result is Copilots that produce generic answers. Agents that are not trusted. Frustration instead of acceleration.
Why this choice can no longer be postponed
Because AI acts faster and at greater scale. Every error or inconsistency in data multiplies rapidly. What used to be a manual correction becomes an automated mistake.
Many organizations have already experienced this. They enthusiastically unleashed generative AI on documents, only to discover later that:
sensitive data leaked
decisions were based on outdated information
no one could explain how outputs were derived
On top of that, regulation no longer allows room for casualness.
Governance and compliance: data is no longer a technical side issue
The EU AI Act and the GDPR make one thing clear.
Without demonstrable data quality and data lineage, AI is not compliant.
Organizations must be able to explain: which data is used
why that data is relevant
how errors are controlled
who is accountable
Without proper data, documentation and governance, this is simply not possible.
This often means: anonymizing datasets
centralizing data in governed environments
explicitly defining which data AI may and may not use
AI without data management is not just risky. It can become legally untenable.
Implications for process design
Choosing data driven operation means that processes must change.
Concretely:
Customer interactions are always recorded in CRM.
Labels, categories and status fields become mandatory.
Employees are trained on data quality, not only on speed.
Sometimes this even means collecting new data that did not exist before. Think of sensors, logging or additional data points.
If this does not happen and AI is still rolled out broadly, another pattern emerges:
continuous data cleaning
shadow spreadsheets
AI solutions that work in theory but are ignored in practice
And with that, momentum disappears.
Data ownership: from an IT problem to an organizational question
Perhaps the biggest shift lies here.
Data does not belong to IT.
Data belongs to the process.
That requires new responsibilities:
Who owns customer data?
Who safeguards product data?
Who is accountable for data quality?
More and more organizations therefore introduce: data owners in the business
data stewards per domain
KPIs on data quality, not just output
This may feel bureaucratic, but without this foundation AI will eventually collapse.
Executive implications: investing in invisible infrastructure
For executives, this is often a difficult narrative.
Data initiatives rarely deliver immediate customer impact. They are expensive, time consuming and not particularly glamorous. But they are foundational.
Organizations that manage data with the same seriousness as finances, complete with audits, quality scores and dashboards, build a structural advantage.
That requires hard choices:
budget for data cleanup
incentives for correct data entry
clear policies on external AI usage
And sometimes strategic investments, such as acquiring external datasets when internal data is insufficient.
These are not IT decisions. They are board decisions.
In closing
AI is not a magical solution for poor processes. It amplifies what already exists.
Good data makes AI exponentially smarter.
Bad data makes AI exponentially more dangerous.
Organizations that recognize this now and design their processes to be truly data driven ensure that their AI improves every single day.
The rest remain stuck in pilots, wondering why it never quite works for them.
Data is the oxygen for AI.
And without oxygen, even the smartest AI cannot survive for long.


