登录查看更多内容

The biggest misconception in data visualization

Nick Desbarats

Instructor and best-selling author, data visualization and dashboard design | Taught in 15+ countries | Lecturer @ Yale, Columbia | LinkedIn Top Data Visualization Voice

发布日期: 2021年11月16日

This article is reposted from the?Practical?Reporting blog. Subscribe to the?Practical Reporting email list?to be notified of future articles like this (one to?three per month).

tl;dr: When designing a chart, most people try to come up with the ‘best way to visualize the data’. This often results in charts that are unobvious or useless to readers, though. Instead, we should try to design charts that best answer a specific question or that best communicate a specific insight about the data, even though such charts don’t answer all questions that readers might have about the data.

Like any field, data visualization has some common misconceptions floating around in it. There’s one, though, that I think has done more damage than any other, which is the assumption that…

“When designing a chart, the goal is to find the overall best way to visualize the data.”

“WTF are you talking about?”

How can that be a misconception? Am I suggesting that your goal should be to find a bad way to visualize the data? Obviously not. What am I saying, then?

Well, have a look at the data in the table below and three potential ways of visualizing it for our company’s CEO. Which of the three graphs do you think is the best way to visualize this data, graph A, B, or C?

The answer, of course, is that any one of these graphs could be ‘the best way to visualize this data’, depending on what, specifically, we need to say about the data:

If the CEO needs to know which regions have the highest expenses, then Graph A is ‘the best way to visualize this data’.
If the CEO needs to know which regions are doing a better or worse job of sticking to their budget, then Graph B is ‘the best way to visualize this data’.
If the CEO needs to know which regions are contributing most to the company’s overall budget overage, then Graph C is ‘the best way to visualize this data’.

Is any one of these graphs the ‘overall best way to visualize this data’, or the ‘truest representation of this data’? How would we even go about determining that? All three—and many other possible variations—are potentially ‘the best way to visualize this data’, depending on what, specifically, we need to say about the data. None of them is the ‘overall best way to visualize this data’, or ‘the best representation of this data’. In fact, there’s never a single, ‘overall best way’ to visualize any dataset; there are only ‘best ways to say different things about the data’, such as which regions have the highest or lowest expenses, or which regions are doing a better or worse job of sticking to their budgets.

That’s the harsh reality of data visualization that few people seem to realize: Charts never ‘show the data’, they always just say a few specific things about the data. Different ways of visualizing the same dataset make different insights about that data more obvious, less obvious, and not visible at all. Yes, it would be awesome if we could make charts that ‘just show the data’, i.e., that make all possible insights obvious or that answer all possible questions that readers might have about the data, but those charts don’t exist.

“Why not?”

Well, if we try to create a chart that makes all possible insights obvious or that answers all possible questions that readers might have about the data, we’ll always end up with a ‘spaghetti chart’:

Even this doesn’t answer every question that the CEO might have about this data, though. For example, if the CEO wanted to quickly see what fraction of total expenses each region represents, or how these expenses compare to those of the previous year, we’d need to add even more clutter. Indeed, we’d never stop adding clutter to our chart in a quest to ‘just show the data’ because there’s always a virtually unlimited number of things that we could say about any dataset.

“Why don’t we just use a table, then?”

Well, tables do ‘just show the data’ without saying anything about the data. Indeed, tables don’t make any insights obvious at all. For example, based on the table alone in the scenario above, is it obvious which regions are doing a better or worse job of sticking to their budget? Or what fraction of total expenses each region represents? Sure, the reader can get those insights, but they’re going to have to work for them and possibly do some calculations, and they’re far less likely to notice interesting or unexpected patterns or relationships in a table of numbers than in a graph.

Tables are also many times slower to consume than graphs and require a lot more cognitive effort to process, which substantially increases the risk that readers won’t get the insights they need from a table—or will just skip over it altogether—because it requires too much cognitive effort to consume. In most situations, then, saying a few things about the data (i.e., showing a graph) is far more useful than saying nothing about the data (i.e., showing a table).

领英推荐

7 Key Ingredients for Knock-out Data Visualizations

Bernard Marr 9 年前

The 80/20 Approach to Data Analysis: Focus on What…

Walter Shields 3 个月前

5 strategies for converting Big Data into actionable…

Naveen Joshi 6 年前

“So, what does all this mean when it comes to actually designing charts?”

The next time you sit down to create a new chart, instead of asking yourself, “What’s the best way to visualize this data?”, ask yourself, “Do I know why I’m creating this chart?”, i.e., do you know what specific insight or answer you need the chart to communicate about the data? If the answer to that question is “no” (which it will be surprisingly often), you need to step away from the charting software and go find out. Perhaps you’ll need to do some exploratory analysis, or speak more with the target audience but, one way or another, you need to figure out what, specifically, your chart needs to say about the data. If you don’t, many of your design choices (chart type, color palette, etc.) will be quasi-random guesses, and the chances that the audience will get what they need from your chart will be low.

Once you’ve figured out what, specifically, your chart needs to say about the data, the next step is to accept that whatever design you come up with is going to communicate that specific insight or answer that specific question clearly (hopefully, anyway…), but there will be many other potentially interesting questions and insights that won’t be obvious in your chart, or possibly not visible at all. Not only is that O.K., it’s the only way it can work (unless you give your audience a spaghetti chart).

What happens if, try as you might, you can’t find out specifically why the audience needs to see a particular dataset or needs to see a chart? For example, perhaps the CEO has simply asked for “expenses for each department” and you don’t have the opportunity to ask them why they need that information because they’re too busy to meet with you. These are unpleasant situations to be in, but they do happen. In my Practical Charts course, we discuss strategies for increasing the odds that we end up giving the audience something that will be at least somewhat useful to them, but these strategies will have to be a topic for a future article since this one’s already longer than I’d like it to be. The bottom line, though, is that our chart probably won’t be as useful to the audience as it could be if we design it without knowing specifically what it needs to communicate about the data.

“So, are you also saying that…”

No. I want to be clear about a few things that I’m not saying:

I’m not saying that all the ways to visualize a given dataset are ‘potentially best’ ways. For any dataset, there are plenty of ways to visualize it that aren’t useful in any plausible scenario, that are fundamentally confusing, or that are just plain misleading:

Outside of obviously bad ways such as these, though, there are always many ‘best ways’ to visualize any dataset.

I’m not saying that, because there’s never a single ‘overall best way to visualize this data’, that whether one chart is better than another comes down to personal opinion or preference. For any given scenario (the nature of the data + what we need to say about that data + knowledge of the audience), different chart designs will be objectively better or worse ways to visualize that data for that scenario. How could we know if one chart design is objectively better than another for a given scenario? We could recruit representative members of our target audience and run an experiment to test the different chart designs to determine which one most effectively answers the question at hand or communicates the insight we need to communicate, and that ultimately best achieves whatever effect we want to have on the target audience. Of course, we usually don’t have the time or resources to run such experiments, so part of learning data visualization involves getting good at making educated guesses about which chart designs would perform best, were we to test them experimentally with members of our target audience. Having some knowledge of major findings from data visualization research studies is helpful and can make those guesses more educated, but research findings generally aren’t specific enough to point to the best chart in a specific scenario. Whether we have the resources to determine which chart design is objectively better or not, though, the fact remains that one of the designs is always objectively better than the others. It’s not an inherently subjective assessment.
I’m not saying that, as long as you know specifically what you need to say about the data, you’ll automatically be able to design an effective chart. It takes a fair amount of skill to take some data, a specific reason why the audience needs to see that data, and knowledge of the target audience (level of dataviz sophistication, current concerns, etc.), and turn all that information into an effective chart. The chart creator has to know how to choose chart types, chart arrangements, color palettes, scale formatting, and how to make many other types of design decisions. These are the skills that I teach in my Practical Charts course, and it’s 14 hours long…

“Umm, this seems kind of obvious…”

The fact that there isn’t a single ‘overall best’ way to visualize a given dataset may seem obvious to some when it’s spelled out like this, but getting out of the mindset of ‘trying to find the best way to visualize this data’ and into the mindset of ‘designing the chart that best communicates a specific insight or best answers a specific question’ requires a fundamental shift in thinking that relatively few people seem to have made. I regularly hear even well-known experts discussing which chart design ‘best represents the data’ without even mentioning what, exactly, the chart is supposed to do. As I see it, though, that’s like arguing about whether a hammer or a screwdriver is ‘the best tool’ without ever mentioning if we need to pound in a nail or tighten a screw.

“But is this really the biggest misconception in data visualization?”

I think so, yes…

It’s very widespread. While some people have fully internalized the idea of trying to find the best way to answer a specific question or communicate a specific insight, most still try to find ‘the best way to visualize this data’, without considering the specific reason why the audience needs to see that data in the first place.
It’s caused innumerable arguments regarding which of two (or more) chart designs is ‘better’, which could have been instantly resolved if everyone involved had realized that one chart design would be ‘the best chart’ in one scenario, and the other chart design would be ‘the best chart’ in a different scenario.
If we design a chart by trying to find ‘the best way to visualize this data’, there’s a dramatically higher risk that the target audience will find the resulting chart to be too unobvious—or possibly even useless—because many of our design choices (chart type, color palette, highlighting, etc.) will be guesses since they won’t be geared around communicating a specific insight or answer.
Trying to find ‘the best way to visualize this data’ makes designing effective charts a lot harder than it needs to be. Once we realize that all charts just say a few things about the data, it becomes a lot easier to choose chart types, color palettes, scale formats, etc. in light of the specific insight or answer that we need to communicate. We’re no longer trying futilely to design charts that anticipate every possible question that the audience might have about the data, or trying to find some ‘overall best’ representation of the data that doesn’t actually exist.

Let me know your thoughts in the comments, though. Do you have a different take on this idea?

By the way...

If you’re interested in attending my Practical Charts or Practical Dashboards course, here’s a list of my upcoming open-registration workshops.

Joshua Pine

Director of Policy | City of Cincinnati Councilmember Anna Albi

3 年

Very insightful, thanks for sharing!

carlos barboza

compliance reporting @ Natixis | spilledgraphics.com

3 年

Nick Desbarats., first of all, kudos! I enjoyed this line (and I think it should have been bolded) "....we should try to design charts that best answer a specific question or that best communicate a specific insight?about?the data, even though such charts don’t answer?all?questions that readers might have about the data." Regarding this line "Yes, it would be awesome if we could make charts that ‘just show the data’".... wouldn't it be too boring though? you'll probably agree. With regard to this one: "In most situations, then, saying?a few things?about the data (i.e., showing a graph) is far more useful than saying?nothing?about the data (i.e., showing a table)." Amen!, could this fall into the category of "Less is More", sort of ? I say this because, for example, with dashboards we can easily fall into the trap of producing flashy, eye-candy, poppy visuals, thinking they are showing the data, and at some points, they are but very, very ineffectively. From my readings on visual perception, our minds trick us in thinking that these visual are relevant but they are not. They are attention-grabbing but once you start analyzing them or decomposing them, you realize they tell you very little or at times, nothing. I can continue adding more parts that I liked, reading from this article, but I will leave these two for now. Hopefully lots of business leaders, CFO, CEOs, hungry for analytics take a look at this article of yours. ?? p.s. I will find an example of a box plot that beats a strip plot ????

2 次回应

Artur Nawrocki

Supply Chain and Financial Analytics | Microsoft Certified PL-300

3 年

I would see Waterfall or simple Table (with conditional formatting) much more readable

查看更多评论

要查看或添加评论，请登录

Nick Desbarats的更多文章

Why it wouldn’t make sense to adapt Practical Charts to Power BI’s limited visuals

2025年3月20日

Why it wouldn’t make sense to adapt Practical Charts to Power BI’s limited visuals

My Practical Charts course is tool agnostic, that is, it doesn’t assume that any particular dataviz software product…

32 条评论
New video: XmR chart redesign with Stacey Barr

2025年2月4日

New video: XmR chart redesign with Stacey Barr

When looking at a line chart for an important metric, people often panic when the line goes down and celebrate when it…

15 条评论
I kind of hate beautiful hotel shower faucets

2025年1月2日

I kind of hate beautiful hotel shower faucets

Trigger warning: This post isn’t (directly) dataviz-related. Not to flex or anything, but I’m somewhat of a connoisseur…

19 条评论
"Basic” chart types that many audiences don’t know how to read

2024年12月17日

"Basic” chart types that many audiences don’t know how to read

In last week’s post, I described how, while teaching data visualization to thousands of professionals, I was surprised…

18 条评论
No, everyone DOESN’T know how to read a scatterplot.

2024年12月12日

No, everyone DOESN’T know how to read a scatterplot.

When I’m chatting with other chart creators, it sometimes feels like there are two different groups that live in two…

20 条评论
Should you avoid using “advanced” chart types? (+Black Friday Sale!)

2024年11月26日

Should you avoid using “advanced” chart types? (+Black Friday Sale!)

I’ve seen the following scenario play out many times in the organizations with which I work: A chart creator decides to…

17 条评论
"I don't want red on my dashboards. It looks too negative."

2024年10月31日

"I don't want red on my dashboards. It looks too negative."

I first heard this objection from a client a number of years ago and it took me so off-guard that I just stared at them…

28 条评论
Dashboards should only show the “most important” metrics… right?

2024年10月8日

Dashboards should only show the “most important” metrics… right?

I regularly hear complaints from dashboard creators that go something like this… ??? “My users consider dozens of…

8 条评论
Big News! Practical Charts ON DEMAND Will Be Available On June 25th!

2024年6月18日

Big News! Practical Charts ON DEMAND Will Be Available On June 25th!

That's right. For last few months, I've been busily recording an on-demand version of my popular Practical Charts…

2 条评论
When to use a bar chart

2024年4月30日

When to use a bar chart

When should you use a bar chart instead of another chart type? Easy question, right? After all, bar charts might just…

10 条评论

See all articles

The biggest misconception in data visualization

Nick Desbarats

Instructor and best-selling author, data visualization and dashboard design | Taught in 15+ countries | Lecturer @ Yale, Columbia | LinkedIn Top Data Visualization Voice

“WTF are you talking about?”

“Why not?”

“Why don’t we just use a table, then?”

领英推荐

“So, what does all this mean when it comes to actually designing charts?”

“So, are you also saying that…”

“Umm, this seems kind of obvious…”

“But is this really the biggest misconception in data visualization?”

By the way...

Nick Desbarats的更多文章

社区洞察

其他会员也浏览了

The Future of Business Intelligence: Overcoming Challenges With Latest Technologies

How to Develop a Data Analysis Plan?

Data Visualisation: What do you want to Achieve?

Top 6 Reasons to Consult a Data Analytics Consultancy Firm for your Projects

Data Analytics: Data exploration and key techniques

Data Analytics: Choosing the Right Chart: The Core of Data Visualization ???

Making Data & Analytics work

Data Analytics or Data Visualizations? Why You Need Both

Unlocking the Power of Data Analytics: Practical Insights for Real Impact

Unraveling the Art of Data Visualization: Choosing the Right Chart

“WTF are you talking about?”

“Why not?”

“Why don’t we just use a table, then?”

领英推荐

“So, what does all this mean when it comes to actually designing charts?”

“So, are you also saying that…”

“Umm, this seems kind of obvious…”

“But is this really the biggest misconception in data visualization?”

By the way...

Nick Desbarats的更多文章

Why it wouldn’t make sense to adapt Practical Charts to Power BI’s limited visuals

New video: XmR chart redesign with Stacey Barr

I kind of hate beautiful hotel shower faucets

"Basic” chart types that many audiences don’t know how to read

No, everyone DOESN’T know how to read a scatterplot.

Should you avoid using “advanced” chart types? (+Black Friday Sale!)

"I don't want red on my dashboards. It looks too negative."

Dashboards should only show the “most important” metrics… right?

Big News! Practical Charts ON DEMAND Will Be Available On June 25th!

When to use a bar chart

社区洞察

其他会员也浏览了

The Future of Business Intelligence: Overcoming Challenges With Latest Technologies

How to Develop a Data Analysis Plan?

Data Visualisation: What do you want to Achieve?

Top 6 Reasons to Consult a Data Analytics Consultancy Firm for your Projects

Data Analytics: Data exploration and key techniques

Data Analytics: Choosing the Right Chart: The Core of Data Visualization ???

Making Data & Analytics work

Data Analytics or Data Visualizations? Why You Need Both

Unlocking the Power of Data Analytics: Practical Insights for Real Impact

Unraveling the Art of Data Visualization: Choosing the Right Chart