Much Like Society, Data is better with Democracy.
Dennis Balada
Senior Business Relationship Lead @ Origin Energy | SME, Engineering, Charging, Fleet| I make the transition to EV easy and commercially viable.
Becoming a data-driven organisation remains one of the top strategic goals of many companies I work with.
Most are well aware of the benefits of becoming?intelligently empowered: providing the best customer experience based on data and hyper-personalisation; reducing operational costs and time through data-driven optimisations; and giving employees super powers with trend analysis and business intelligence.
They have been investing heavily in building enablers such as data and intelligence platforms. Despite?increasing effort and investment in building such enabling platforms, the organizations find the results middling, why is that so?
In this article I'll touch on why I think this is the case and how it can be solved as a business leader.
Monkey see and Monkey do
Here in Australia there's a trend of organisations have been working diligently to stamp out line of business having the control and freedom to make decisions and spend accordingly, mostly because at one point it was like the wild west; Where there were hundreds of investments with poor integrations , shadow IT popping up an siloed data and security concerns been normal.
This led domain architects to build/ buy solutions that would eventually lead to centralised, monolithic and domain agnostic data platforms to remedy this.
Essentially we have moving away from data ownership that is specific to certain domains, to centralized data ownership that is not domain-specific and we have been very proud of creating the biggest monolith of them all - the big data platform.
This has worked in the past, prior to the explosion of data and cloud adoption, but in today's world has led to significant problems.
Centralised and monolithic "Big government" style ownership.
Unfortunately this centralised model can work for organisations that have a smaller number of different types of customers and consumers but it fails for companies with a lot of different types of customers and a lot of sources for their data. This is because the more data that is available everywhere, the harder it becomes to control all of it in one place. This is especially true for data about customers. There are more and more sources of customer information, both inside and outside of organisations. Trying to store all this data in one place will limit our ability to use all the different sources of information.
The Titanic Effect - Inability to move quickly
Organisations also need to experiment quickly, fail fast and learn from previous mistakes, which means that there are more ways that data from the platform can be used. This in turn means that there are more transformations of the data- aggregates, projections, and slices- that can satisfy the needs of organisations for data. However, the long response time to satisfy the data consumer needs has been a point of friction for organizations in the past and remains to be so in well-established data platform architectures such as in Data Lakes and Data warehouses.
Ironically siloed ownership and Frustrated users.
Siloing data engineers from the operational units is not sustainable.?The platform's hyper-specialised teams have little understanding of their source domains and need to work with a diverse set or needs, whether it be analytical or business intelligence related - but without clear guidance on where they can find these experts within an organisation who will provide access for consuming applications that use big Data tooling like Spark etc., then this separation only leads towards suboptimal outcomes due lack alignment across functions internally as well externally.
Data engineering centralisation creates disconnected source teams, frustrating consumers fighting for a spot on top of the data platform team backlog and an over stretched Data Platform Team.
领英推荐
How do we Evolve Past this ?
Centralised, monolithic and domain agnostic data platforms as I have explained above have created a lot of learnings over the last decade or so, and from those learnings businesses are now realising that decentralising, democratising that data making it available everywhere and interconnected is incredibly important.
This is called Data Mesh
Data Mesh emphasises data governance and data sharing across organisational silos. The data mesh approach encourages organisations to build data products that are relevant, meaningful, shareable, and governed by data policy.
A data mesh architecture includes a data hub, data proxies, data services, and a discovery layer. The data hub is the central repository for all data products. Data proxies are used to access data from disparate data sources. Data services provide APIs for data access and management. The discovery layer aids in the discovery of data products and their underlying data sets. A data mesh provides a flexible, scalable way to manage data across an organization. It enables organisations to better utilize their data assets and build better data products.
Wait! Silo's ... Isn't this full circle?
Much like some first love couples break up .. and end up together later in life, grown up and matured ( hopefully happily ever after) building a data mesh has taken the learnings of time and applied principles which are :
I don't know if i'm coining the acronym "SADIST" here, feel free to use it if you have a sense of humor .
What's important is that cross functional skills and teams be invested in by businesses, policies and governance be implemented and the guiding principles be adhered to avoid going backwards.
Wrapping up i'll show you this paradigm shift looks like in the real world in this Diagram, what's key to note is that each domain has its preferences and toolboxes , but all are interoperable and sharing data in one big cohesive web.
Cloudera where I work specialises in bringing this together and enabling our customers to implement modern data architecture principles like Data mesh.
I suggest you check out our site or send me a message if you're interested to find out more.
Thanks for reading , See you on the next one.
Data Challenges Solver | Partner for your Data Dreams | Enterprise Technology Sales Executive @ Databricks
2 年Try posting this on Medium