6 steps for business people to lead data analytics initiatives
Cameron O'Rourke
Technical Product Marketing and Product Management for Data, Analytics, AI/ML and Web3 Projects | Ex-Oracle | Six SaaS Startups | Developer | Writer | Video Expert
Originally published on the Incorta blog, April 7, 2021
Faced with new challenges and pressures, more and more non-technical business people are trying analytics for the first time. Analytics is about looking for patterns and correlations in your data in order to gain insights about your business, or even help predict the future. This article is a tour for the data curious of the steps involved in an analytics project, along with some ideas and practices to keep in mind.
But first, a bit of philosophy. I've been in the data space for roughly 30 years, and have seen pretty much every way of shuffling data. But only in the last few years have we had the technology to do at-scale data analysis in a no-compromise, speedy, and continuous manner. And it's a bonus that in the cloud, we can do it in a way that is also cost-effective and accessible to everyone. This could not come at a better time.
The challenges of the last 18 months—economic uncertainty, shutdowns, and social distancing—have forced business teams to embrace new levels of creativity to survive the disruption. In particular, there is a great need for better data and insights to help guide business adaptation or even transformation.
Whereas data analytics has traditionally been the province of IT departments, more and more non-technical business leaders are not waiting to act. They are undertaking new data analysis initiatives to chart a course through this fast-changing landscape.
At the same time, the new technologies enables a more agile approach to analytics. With the right technical underpinnings, analytical discovery can be iterative, and deliver value to the business much sooner than traditional methods.
What follows is a data-analytics project roadmap that is informed by this philosophy and approach. Since it is meant for business people, I've kept it very high-level. I'll apologize now to my more technical colleagues for any oversimplifications. ;-)
1. Access Your Raw Data
The first step towards agile analytics is to acquire data from the systems you are using every day.
What is critical here is that you: 1) gain access to the ‘raw’ or ‘unprocessed’ data, and, 2) that the data can be continuously refreshed on a schedule or on demand. To understand why, we need to get slightly technical.?
Traditional databases store transactional data in tables with relationships in what is called 3rd Normal Form. This is a way of designing applications to reduce the duplication of data, maintain data integrity, and simplify data management. However, this is not the data you will get from a traditional data warehouse. The data from your data warehouse will have been “reshaped” in some way.
In the figure below, see how four related transactional data tables provide a complete picture of sales orders. (Note: This example is dramatically simplified for clarity.) Most any question about sales can be answered by combining the data in the tables in different ways.
The data warehouse table at the bottom of this figure is how the data might be reshaped in a typical BI/Analytics project. The business person probably told a data engineer, “I’d like to see monthly revenue, broken out by country and product category.” The data engineer dutifully wrote code to pull together the data from the four detail tables to answer the question at hand.
What’s the problem? While we certainly get the answer we were looking for, what if we want to ask a different question, such as “What is our best selling product?” That is going to need a different transformation of the data. So, our business person sets up another meeting with the data engineer—and waits. Having to keep going back to an intermediary really impedes your ability to be agile.
Another problem with this approach is that there is no way for the business user to dig into those monthly revenue numbers to see the composition or to check for errors. By having the data tables in their original form, it is easy to drill into the details to find errors in the data, problems with the business, or just to explore new questions.?
The second critical thing I mentioned was that your data should be continuously refreshed on a schedule or on demand. The converse of this would be a spreadsheet, or a snapshot of the data at a moment in time. The problems are numerous.
What we want is the raw, unprocessed data from your application in an online system, that is continually refreshed, with adequate security controls.
2. Explore the Data
With access to raw, detailed data (as opposed to an IT engineers' best guess as to what you need) you can take a more free-form iterative approach that prioritizes exploration and discovery. Answers to one question naturally lead to other questions. You don’t know what you don’t know, and discovering something new and unexpected is what leads to business breakthroughs.
Business users are usually pretty good at working out how the different data tables work together and how to interpret data values. However, if the source is too complex or cumbersome to work with 'as-is' there is always an option to create 'business views.' Views aren't copies of the data, they are just a different organization of the data that is easier to understand. IT people can help you create these views, but remember, you know your own systems best and what you don't want is a distillation, or subset of your data because then you'll lose the ability to freely traverse the data and drill into the details.
As you explore, you will encounter two major categories of data, each of which serves a different analytical purpose:?
Measures —? Sometimes called facts, measures are numerical values such as currency, quantities, and percentages that are combined with dimensions to arrive at reportable business metrics.?
Dimensions — Sometimes called attributes, dimensions are used to segment the data into reporting groups, and represent various facets of the business such as product categories, sales territories, fiscal quarters, organizations, or transaction types. For example, you could group revenue by the month and region dimensions.?
At this point, you are ready to try a few explorations, combining different measures and dimensions together to gain familiarity with the data. As you work, review your results against the source system to be sure the numbers check out. Don’t wait for perfection. The important thing is to start now, iterate, discover, and grow from there as you become increasingly confident.
领英推荐
3. Decide What to Measure
Many times you don’t have a choice about what to measure. You are given KPIs (Key Performance Indicators) or OKRs (Objectives and Key Results) that your department is held accountable to. There may also be measurements that are dictated by your industry.?
Domain experts may have more/better ideas of what to measure to ensure business success, based on their experience. Once you explore the data, you may find some measurement that everyone else has missed, but you think could generate important insights. Analyzing data from outside your firm can also open up new avenues for insights. For example, many companies with interdependent business operations share data. You can evaluate not only your performance, but your performance in relation to that of your suppliers, vendors, and downstream customers, if appropriate.
Industry associations are another good source of metrics. For example, if your business involves a supply chain, the Association for Supply Chain Management publishes the SCOR model which includes over 250 standard metrics calculations. Consulting firms such as eCapital Advisors and analyst firms such as Gartner publish industry-specific metrics and benchmarks that you can analyze your performance against.
4. Analyze Your Data
There are many different tools and techniques for analyzing data that can lead to new discoveries. Here's some of the tools that should be in your analysis toolbox.
Filtering and Sorting — This is a basic technique that you will turn to time and again. For example, suppose you are a salesperson looking through sales data, wondering whether your top customers represent your best upsell opportunities. You can apply filters to show only the data from those customers who spent more than $10,000 per month, in your territory, in the current year. Then you can sort this data by revenue amount and drill into their purchasing activity.??
Aggregations — This involves computing a sum, count, or average for a particular measure sliced by one or more dimensions that are meaningful to you. For example, it might be useful for a retailer to know which product categories are the best sellers by region. Do men’s and women’s coats sell equally well in all regions? Are mountain bikes popular in only some states? A pivot table that puts dimensions along the sides and top of a table, and values in the intersecting cells, is very handy for viewing aggregated measures cut by two or more dimensions. A variation of the pivot table is the heatmap, which shows the hotspots in the table visually.
Trends — Most all transactions have dates, and different parts of the date (the year, month, quarter, day of the week, week of the year, etc.) can be used as a dimension for analysis. For example, what if we want to know the average sales order amount (a measure) by each month (a dimension)? Is it going up or down? Can we detect seasonality using a week-by-week analysis???
Correlations — A correlation analysis can help you quantify the relationship between two or more related variables, so you can understand how one measure might change as the other measures change. The basic way to create this kind of analysis is to plot one variable on the Y-axis, and the other on the X-axis.?
Above all, don’t be afraid to experiment, as performing different types of analysis on new combinations of measures and dimensions, and dropping them into different graphs and charts is how you generate new insights.
5. Visualizing the Data
You might be familiar with bar charts and pie charts. But there are so many other kinds of visualizations that can help you tell a story with your data. Choosing the right data visualization style can both make it easier for you to interpret data, and then cause people to interpret your data in different ways.
Showing the composition of many parts related to a whole is the classic use of a pie chart. But did you know that you can also have multi-level pie charts (sometimes referred to as pie-donut or sunburst charts) that can represent, for example, both categories and sub-category relationships in a single chart?
The bubble chart combines three different measures along with one dimension. The scatter chart is similar, but only plots two measures instead of three, and is also great at revealing correlations.
Data that trends over time is often best shown using line charts, while comparisons (for example, sales by category at a fixed point in time) are often best shown using bar or column charts.
The possibilities are almost endless, but the important thing to remember is that different kinds of charts can be used to support the story you want your data to convey. Again, you’ll want to experiment with visually presenting the data in different ways.
6. Publishing your Results
After gathering, analyzing, and visualizing your data, the final step is to publish your work, either to gain support for your decisions or to allow others to contribute to the analysis and reach their own conclusions.
Dashboards typically contain one or more charts, tables, maps, plots, gauges or key metrics. They group different data visualizations together into a form that people can easily browse. Dashboards are dynamic, interactive tools designed to maximize comprehension and facilitate agile decision-making.
The ordering and organization of the visuals are important. The layout can either be all on a single page, or across multiple tabbed pages. Tabs can break a larger data analysis apart into smaller pieces that can be sequenced to reveal a data story (sometimes this is called data storytelling.)?
You want the flexibility to give users the ability to do their own filtering, and drilling down into the data. For example, you might give the dashboard to a group of regional managers — each manager is going to be interested in viewing the metrics for their region, and may want to drill into the details by salesperson, by customer, or by product category.?
New tools for new times
Businesses today are looking for breakthroughs that will allow them to thrive during challenging times. That has pushed business users to take a bigger role in data analysis projects. Incorta provides the tools for line-of-business teams who want agility today, and room to grow tomorrow.?
The value-driven, agile analytics approach I've described here is fully supported by Incorta, the company I work for. Incorta provides an end-to-end data analytics platform that is both powerful, and easy to use. It is cloud-based, and you can quickly check out the free trial and take it for a spin.
If you are interested in a slightly longer and more portable version of this article, please download the eBook version, The Business Person’s Guide to Analytics.
Strategic Product Management Leader Driving Market-Centric Growth | GTM Strategy | Data Management | Analytics | AI
3 年Nice blog Cameron. Product looks interesting! Happy new year mate