登录查看更多内容

3 Steps for Developing a Successful Analytics Architecture

Sebastian Darrington

Vice President, Systems Engineering, EMEA

发布日期: 2017年10月10日

Many businesses have undertaken Big Data and/or Analytics projects, but for every successful project, there are dozens that have failed or have stagnated. Customers are realising they need to understand both the business outcome and technology in order to target their goals more accurately.

If you realise it or not all organisations are on a journey that results in the need to process thousands of times more data than they have done previously, process that data and operate in real-time or near real-time and then use the insights from that data to deliver personalised and the enhanced services.

To do that successfully requires you to think about the Journey from two points of view. One focused on the business aspect and one focused on the technical aspect. What is the business problem we are trying to solve, what is the data you have available to you and how can you simplify the technology to accelerate those business outcomes.

The reality of course is that most IT infrastructures have not been designed with Big Data and Analytics in mind. The journey to digital breaks traditional IT infrastructure and there is need for a new architecture and new infrastructure to support data driven business.

You need to be able to ingest from a growing number of internal and external data sources
You need to store more data, for longer, cost effectively. The more data you can collect, the better analysis and insights you can gain
You need to be able to surface that data quickly and easily across the business, or to consumers through applications

Customers with traditional OLTP databases are struggling to leverage these technologies for analytics and for good reason. In the same way that I would not recommend an OLTP database as the engine for advanced analytics I would not recommend an OLAP database for transaction processing. Yet many organisations continue to try and support/leverage their existing investments by buying additional bolt-ons from their incumbent supplier to try and resolve their challenges. The net result is significantly more cost, vendor lock-in and increasing complexity.

Some organisations chose to invest in Hadoop and the associated ecosystem of open-source solutions. Fast forward 3-5 years and many have reached the point where traditional Hadoop Architectures are causing their own challenges. They have a cost effective (maybe) platform for storing large amounts of data, but they are quickly realising that getting insight from that data can be extremely expensive in terms of hardware and again complexity.

Some organisations are just getting started, learning to leverage visual analytics to simply replace their historical Excel reporting. Even here many of the companies I talk to are experiencing challenges. Like mobile applications, visual analytics solutions need to be engaging and they need to be fast. In todays world, if you have to wait more than a few seconds for the answer the solution you have deployed will probably be considered a failure by its users.

So how can you resolve these challenges whilst leveraging the investments you have already made? Firstly despite what you might have been told, there is no single solution. Big Data and/or Analytics is an eco-system play. Certain products are better suited to certain use cases and in many cases certain technologies are complimentary. The overarching question is how can you leverage multiple back-end data stores with multiple front-end tools whilst reducing complexity and increasing performance?

Step 1 - Create a high speed, scalable, cost effective Enterprise Analytics Hub.

In todays world, speed is everything. Whether it be surfacing the latest information to the business in a near-realtime dashboard or running the latest risk prediction algorithm, performance is key. Even if your current performance is acceptable I have not yet met a customer who would not like to do more or add more data points to their calculation to get even finer insights. The solution is an analytic database designed from scratch for speed, that aggregates your various data sources and front end tools whilst accelerating your analytic workloads.

Step 2 - Simplify your environment

Inevitably most organisations have multiple tools to ingest, store, wrangle, surface and act upon data. Multiple data sources means creating a mesh of interconnectivity between tools and data sets which adds complexity and can impact performance. A better idea is to connect them all through the concept of an intelligent, centralised hub. This enterprise analytics hub should be able to accommodate the needs of all the data consumers and their tools. For the BI analyst it should support SQL. For the line of business it should integrate with visual analytic and dashboard tools. For the Data Scientist is should support advanced analytic functionality and integrate languages such as R, Python and Lua so that code can be executed natively within the database.

Step 3 - Connect your data silos and use them for what they were designed for

Technically, consolidation of data silos, is pretty simple to do, politically it can be a nightmare. Data, like knowledge, is power and consolidating data across different business units can be incredibly difficult to do. Couple this with the issue that many organisations have data in the cloud as well as on-premise and the challenges become exasperated. The solution is to connect, rather than consolidate, using a data virtualisation framework: The ability to aggregate sources of data whether they be on-premise or off-premise, structured or un-structured. Whilst at the same time being able to push relevant analytic queries to those underlying technologies that are best suited to store and process the data that they have, e.g. Hadoop for unstructured data and Oracle for structured data OLTP data.

The result of these steps is an architecture that empowers analytics within any size business, whilst leveraging previously purchased technology to maximise your return on investment.

What are your thoughts?

Sam Sibley

Partner @ PA Consulting. Angel Investor. Honorary Lecturer @ Swansea Uni. Ex Tennis Pro.

7 年

Great article Seb, thanks for sharing. Based on what I have seen, the challenge around data silos and having the ability to drive value from relevant data sources can also come from two things: 1. Either the business isn't aware of the data or its potential value to the organisation, or 2. Internal rules/regulation/permission can sometimes be a barrier or bottleneck to better execution. I saw it recently where a pricing team within a large organisation weren't able to get access to a data source as the group company controlled it and didn't permit the data to be passed to individual departments. Thoughts?

Anthony DeShazor

Experienced GTM leader specializing in scaling customer revenue from $0 to $1B+ ARR | Advisor | Speaker | Mentor

7 年

Solid principles. The data silos are a challenge. However, just dumping everything indiscriminately into a data lake also has its challenges.

1 次回应

Arash Ghazanfari

Strategic Leadership

7 年

Great article Seb!

Rosemary Omo

7 年

Nice piece. Its getting at those pesky data silos though...

1 次回应

Brenda Della Casa - Murray

??????Content & Brand Strategy Leader (B2B, B2C) | Driving Growth through Strategic Use of AI, Emotional Intelligence & Authentic Storytelling | Author & Ghostwriter | Dual Citizen UK/US | UK Global Talent.

7 年

Great piece!

查看更多评论

要查看或添加评论，请登录

Sebastian Darrington的更多文章

7 Ways the Role of the Pre-Sales Technical Sales Engineer is Evolving

2021年7月21日

7 Ways the Role of the Pre-Sales Technical Sales Engineer is Evolving

Over the past few months, I have been working on a number of pieces of work including "How to Achieve the Technical…

12 条评论
Starlink. My first impressions and installation in the UK

2021年4月11日

Starlink. My first impressions and installation in the UK

This post was originally posted at https://www.thestoragechap.

1 条评论
The Truth will Prevail

2019年4月30日

The Truth will Prevail

Disclaimer: I am an employee of Pure Storage. My statements and opinions on this site are my own and do not necessarily…

1 条评论
It's Time That All Men Support Women In The Workplace

2017年8月22日

It's Time That All Men Support Women In The Workplace

Reposted from the EXASOL blog. In April of this year, Tech Nation announced that applications for its Exceptional…
RDBMS is dead, long live the RDBMS!

2017年1月21日

RDBMS is dead, long live the RDBMS!

So, after over 12 years at (Dell) EMC, I recently made the decision to leave and join EXASOL. When I told my friends…

18 条评论
3 things to consider if you want to make your IoT project work harder

2016年10月5日

3 things to consider if you want to make your IoT project work harder

Cross posted from www.thestoragechap.

1 条评论
How to improve your physical security with video analytics in 4 simple steps

2016年9月29日

How to improve your physical security with video analytics in 4 simple steps

It seems as if security threats have become an unfortunate part of our everyday personal and professional lives these…
Six Benefits of Implementing a Big Data Strategy in Healthcare

2015年12月16日

Six Benefits of Implementing a Big Data Strategy in Healthcare

Following on from my colleagues post "Why many healthcare organizations find it difficult to capitalize big data and…

1 条评论
Tackling Customer Attrition with Real-Time Intelligence for Communication Service Providers

2015年11月16日

Tackling Customer Attrition with Real-Time Intelligence for Communication Service Providers

Cross posted from my blog https://www.thestoragechap.
Five things that smart cities are already doing that you may not know

2015年10月19日

Five things that smart cities are already doing that you may not know

Continuing growth of the world’s population is projected to add 2.5 billion people to urban areas by 2050, with nearly…

1 条评论

See all articles

3 Steps for Developing a Successful Analytics Architecture

Sebastian Darrington

Vice President, Systems Engineering, EMEA

Step 1 - Create a high speed, scalable, cost effective Enterprise Analytics Hub.

Step 2 - Simplify your environment

Step 3 - Connect your data silos and use them for what they were designed for

Sebastian Darrington的更多文章

社区洞察

其他会员也浏览了

DATA VIRTUALIZATION

Comparison of Data lake, Lakehouse and Data warehouses

EMC’s Federation Business Data Lake…Data Lake 2.0

Introduction to Big Data Analytics

Top 10 Big Data Tools for Analysis

Navigating Data Management in the Gaming Industry: Relational Databases, Data Warehouses, and Data Lakes

Data Virtualization [Series#2: I am Data!]

Highly popular and frequently used Tools for Big Data Analytics

Big Data & Governance ...

Step 1 - Create a high speed, scalable, cost effective Enterprise Analytics Hub.

Step 2 - Simplify your environment

Step 3 - Connect your data silos and use them for what they were designed for

Sebastian Darrington的更多文章

7 Ways the Role of the Pre-Sales Technical Sales Engineer is Evolving

Starlink. My first impressions and installation in the UK

The Truth will Prevail

It's Time That All Men Support Women In The Workplace

RDBMS is dead, long live the RDBMS!

3 things to consider if you want to make your IoT project work harder

How to improve your physical security with video analytics in 4 simple steps

Six Benefits of Implementing a Big Data Strategy in Healthcare

Tackling Customer Attrition with Real-Time Intelligence for Communication Service Providers

Five things that smart cities are already doing that you may not know

社区洞察

其他会员也浏览了

DATA VIRTUALIZATION

Comparison of Data lake, Lakehouse and Data warehouses

EMC’s Federation Business Data Lake…Data Lake 2.0

Introduction to Big Data Analytics

Top 10 Big Data Tools for Analysis

Navigating Data Management in the Gaming Industry: Relational Databases, Data Warehouses, and Data Lakes

Data Virtualization [Series#2: I am Data!]

Highly popular and frequently used Tools for Big Data Analytics

Big Data & Governance ...