What is a Data Domain?
Nicola Askham
DataIQ 100 2022 | Award Winning Data Governance Training | Consultant | Coaching | Data Governance Expert | D.A.T.A Founding Committee
This is a very short and succinct question, and I thought the answer was likely to be the same. However, when I was sent this question, I was very surprised to see that it was a former colleague from my very first consultancy who submitted it!
I was taken aback at first because I thought to myself ‘surely, they would know the answer to this?!’ but on reflection I realised this is yet another example of how we data professionals are actually very bad at defining things and as data governance professionals,
that's even worse because we spend our time helping others and asking others to write definitions for their data and yet we so often don't define the terms we use well enough for others to understand.?
So, I decided that this was actually an excellent question and was definitely one that I should answer.
The first thing I did was look at one of my most commonly used reference books, the DAMA Dictionary of Data Management. This is an excellent reference book for anyone in data governance and as a DAMA member I would highly recommend it. However, on this occasion, it did take me aback…
I opened the page at Data Domain and was quite surprised at the definition it gave. It states that ‘a data domain is a set of allowable values for a data attribute’. However, that is not how I use the term and I think that is the perfect example of what happens as data professionals. We start using a term and people we work with start using it, it proliferates, but we're not necessarily using it for its original intention.
While a data domain is perhaps terminology more commonly used in data modelling and in databases, we use it a lot in the data governance world, but in my view, with a slightly different meaning. We clearly don't mean ‘a set of allowable values for a data attribute’ -that's very techy and data geeky and not at all the type of langue we would want to use when we're trying to talk to business users. So, what do we mean?
Well, when I use the term, I mean a logical grouping of data - something where we can tell where it starts, and it ends. From my point of view, I'm normally trying to find identify data domains so that I can identify data owners.
领英推荐
For example, you might call customer data a data domain. Or finance data, HR data, product data, supplier data. These are all ideas of logical groupings of data that all relate together.
It's then the work of data governance to work out the details of what is actually included in those domains… but that is what I, and many professionals that I work with, mean when we say, ‘data domain’.
Sometimes I don't use the word data domain. I talk in terms of ‘data set’, which is any logical grouping of data and I think you can use the words ‘data set’ and ‘data domain’ interchangeably. Just make sure that you actually understand what you mean when you use the term and explain to the business users that you're talking to what it means to avoid all confusion.
So, there you have it. That’s my definition of a data domain. I hope you find it useful. If you do, please help me on my mission to help as many people as possible be successful with data governance by sharing it on your choice of social media.
I really appreciate your help in getting the message out.
Originally published on?www.nicolaaskham.com
Data Governance Lead
2 年So topical in my world right now Nicola! Thanks for sharing :)
Data Architect
2 年Hi Nicola Askham, thank you for this elaboration. Christopher Bradley, I can only agree :-) I'll make a post on Data Product Domains and Information Domains to trigger this discussion further :-)
CDMP Fellow, Information Strategist, Independent Advisor & Trainer. Vice President Professional Development at DAMA International
2 年1/3 This term first appeared in Data Modelling in the 1970’s with the term “Attribute Domains” & has been widely used. AKA Value domains, it's a part of Information Engineering, Barker, IDEF etc etc and is incorporated into pretty much every leading Data Modelling tool. Also, Attribute domains are the basis upon which the Data Quality Dimension “Validity” is based.? ? There are two types of attribute domains: range and coded value.
Head of Data Governance at Saint-Gobain Group
2 年Thanks Nicola for this article. "We start using a term and people we work with start using it, it proliferates, but we're not necessarily using it for its original intention" : it is so true! And so different definition of Data Steward, Data Owners, Data Catalog, Data Lineage and so on. Is there really something standard, in details, in Data Governance? Or have we only high level commonly agreed definitions?
Data Governance & Management Expert
2 年You're right, Nicola. As I made a lot of database design when I started my career, the fisrt time I heard this term in data governance, I though of the values that can be set to a particular data. As this term was used first in data modeleling, maybe we can think on using a clearer term in data governance: data business domain, data subject area, or something alike.