Data Domains & AI Adoption: A Match Made in Heaven
Introduction:
Data is the lifeblood of any modern enterprise organisation. It flows through each and every business across the world and has by its very existence, become a modern day commodity. Indeed, in 2022 the World created close to 92 zettabytes of data. That suffix certainly wasn’t taught when I was saving homework to a 100 kilobyte disk back in the… ahem, 90’s. ??
However, as organisations amass increasing amounts of data, they also face new challenges around data ownership and management. These ownership challenges are exacerbated by the rise of digital transformation, cloud computing, and the growth of data-driven business models. In years gone by, I often commented about “born in the cloud businesses” or “cloud native organisations”. However, with the advent of artificial intelligence (AI), there are businesses and disruptors who are by all intents and purposes “AI-Ready” organisations.
As a result, enterprise organisations are looking for ways to utilise AI to make money, save money and reduce risk. However, with data sprawling across private and public cloud infrastructures, business leaders must get a better handle of their data to ensure that it is of a sufficient quality and integrity to support AI driven use cases.?
In this context, data domains have emerged as a foundational solution to the problem of data ownership in the enterprise, to support the adoption of AI in a safe and secure manner. In this blog, I will cover what data domains are, their key characteristics, the roles of data owners when defining domains and some simple steps to follow when defining them in your business.?
What are data domains??
Don’t worry, we won’t be going to another galaxy with this definition! Though, the very word domain does invoke memories of Star Trek…
In simple terms, data domains are logical groupings of data. Data domains are all about ownership, governance and responsibility. In respect of good governance, data domains refer to a specific area of responsibility within an organisation for managing data assets.?
Typically, domains are defined by a specific subject matter, business function or service/product for which data is collected, stored, and used. By defining data domains, organisations can establish clear ownership, accountability, and control over the management of data assets and ensure that they align with the organisation's data strategy, business objectives and regulatory commitments. This is crucial when considering ethical obligations around the use of AI. Being able to trust where data has come from and prove its origin is crucial, particularly for regulated businesses.?
Upon defining their respective data into domains, organisations can provide a clear and structured approach to managing data quality, security, privacy, and compliance. Ultimately, this aids to remove any question marks and ambiguity about who actually owns the data and what they are responsible for; in terms of maintaining data quality and accessibility to other domains across the organisation.
The role of Data Owners in a domain driven model
Within any given domain, data owners play a crucial role in controlling the use, storage, and maintenance of their data-sets and data products. They are responsible for ensuring the accuracy, quality and security of their data and should be made responsible for deciding who and how to access the data under their control. In simpler terms, a data owner is the person or entity that holds the rights to a set of data and is responsible for managing and protecting it.
Indeed, a single data domain can consist of many different data sources, data products and have many data owners. That aside, there are 7 key responsibilities that data owners should be accountable for when managing and protecting the data in their domain. Namely:
What are the key characteristics of a well defined data domain?
A well-defined, understood and governed data domain typically exhibits the following characteristics:
领英推荐
Establishing these fundamental characteristics and principles can certainly help organisations to better manage their data, which in turn unlocks opportunities to safely explore AI use cases. This is especially so in businesses that are experiencing data sprawl across hybrid cloud estates. Based on personal experience, once your domains are defined the most crucial step is assigning ownership to them and perhaps most importantly making sure that your data owners understand what is expected of them in their roles.?
Simple steps to follow when defining data domains
Needless to say, if you ask 5 people to define their views of data domains within your business, you will likely get 10 different perspectives!?
As such, I’ve outlined some simple steps to follow when trying to determine the data domains within your business:?
In Conclusion:?
Over the course of this blog, I’ve tried to demonstrate that data domains can vary greatly in scope and complexity, but the underlying principles remain the same: to manage and protect data in a way that supports the organisation's goals and values. Indeed, getting the boundaries and ownership of your organisation's data domains clearly defined and understood can be an enabler to increase data quality and accessibility.?
This in turn can support the future adoption of AI enabled capabilities in an enterprise organisation, by ensuring that solid foundations are in place to support the traceability of data’s origin. Which in turn prevents “garbage in, garbage out” syndrome when trying to introduce AI in an enterprise context. Arguably, data domains are a match made in heaven when it comes to AI adoption.
I hope you’ve enjoyed the read and thanks for taking the time to listen to some of my perspectives in this regard.?