What Do You Do with Data?
Shelly Palmer
Professor of Advanced Media in Residence at S.I. Newhouse School of Public Communications at Syracuse University
I gave a speech about marketing and data science last week. Afterwards, one of the CMOs came up to me with panic in his eyes. He said, and I quote, “Help!”
I get it. Big Data and Data Science are overused catch phrases that can mean anything anyone wants them to mean. But the hype doesn’t change the facts. We are being overwhelmed with data, and I can assure you that if you don’t know what to do with it, your competition will.
Data Science Can Help
The fundamental goal of data science, a mash-up of three foundational skills – domain expertise, mathematics and computer science – is to turn information into action. You can learn about how they work together in my article, Are You Ready for Data Science?
The truism “If you ask the wrong question, you’re guaranteed to get the wrong answer” is often tossed about in data science meetings as profound insight. It is not. It is an axiom.
Importantly, the obverse is almost never true. Asking the right question might be a path to enlightenment, but it certainly does not guarantee finding the right answer. To attempt that, you need the appropriate analytic tools and techniques. This is where mathematics and computer science add value.
An Overview of Analytic Techniques
What are the technical members of your data science team going to do with your data? Transform, learn and predict. So, in the interest of searching for the right answers, let’s review a few common classes of analytic techniques and see how you might put them to good use.
Transformational Analytics
Aggregation – a class of techniques used to summarize data including basic statistics such as mean and weighted averages, median, Gaussian distribution and standard deviation. Other aggregation techniques include probability distribution fitting (the repeated measurement of variable phenomena – remember “method of moments” and “maximum likelihood” from Stats class?) and good, old-fashioned plotting points on a graph.
Enrichment – a set of techniques employed to add information to, or fill gaps in, a data set – for example, adding zip + 4 to five-digit zip codes, appending purchase data or credit scores or even simply standardizing prefixes or suffixes.
Processing – everything from data munging or data wrangling (the cleaning up of data) to entity extraction (identifying key terms in unstructured data that have value) to true feature extraction (building derived values from existing data).
Learning Analytics
Regression – a common way to predict the future based on the past by exploring spatial relationships. There are many types of regression techniques, but all share the common goal of predicting the value of a dependent variable where partial related variables are available, or estimating effects of an explanatory variable on the dependent variable.
Clustering – is just what it sounds like. The goal is to group a set of data points so that the ones with the most in common are closest together. Importantly, clustering is not a specific formula; it is accomplished by using a series of algorithms. And it is almost always an iterative process.
Classification – algorithms and other techniques used to identify to what category or subpopulation a data point belongs. When speaking about classifications, you must be careful to also identify the discipline you are speaking about. Statisticians use the term differently than practitioners of machine learning do.
Predictive Analytics
Simulation – a set of techniques used to create a simulated environment for testing predictive models.
Optimization – a wide-ranging tool set for making optimal selections from a set of alternatives. Commonly used for pricing and maximizing yield.
Want Help?
We have a team ready to help you prepare to work with your data, understand the opportunities afforded by machine learning and pattern matching and even do a data science readiness assessment. Just shoot me an email and I’ll be happy to work with you to help you achieve your business goals.
About Shelly Palmer
Named one of LinkedIn’s Top 10 Voices in Technology, Shelly Palmer is CEO of The Palmer Group, a strategic advisory, technology solutions and business development practice focused at the nexus of media and marketing with a special emphasis on machine learning and data-driven decision-making. He is Fox 5 New York's on-air tech and digital media expert, writes a weekly column for AdAge, and is a regular commentator on CNBC and CNN. Follow @shellypalmer or visit shellypalmer.com or subscribe to our daily email https://ow.ly/WsHcb
I find insights in data - Data Analysis/Business Intelligence/Data Science
9 年Yes, some companies have vast amounts of data. These same companies should have a plan for purging data they no longer need. If for no other reason, it reduces the possibility of losing data in case of a breach. There are also laws restricting the retention of some personal data. Like Mr. Adams states in his comment, not all data has value.
Shelly, How about syndicating a 2 minute piece on technology...we would be interested at our Fox station in Springfield for our newscast. I am sure we could help syndicate it to other stations as well.
National Procurement - PulteGroup
9 年Love the quick and fast methods for analyzing data, but the difficulty most organizations face as it relates to big data is how to action the information you're receiving. Be it retail, SaaS, or any other industry - information overload is real.