“All models are wrong, some are useful” ≠ Modeling is a futile exercise
Source : unsplash

“All models are wrong, some are useful” ≠ Modeling is a futile exercise

George Box's quote about "All models are wrong, some are useful"? is taken literally but ideally it should not be.

The phrase “All models are wrong, some are useful” is quite loosely used. Some take it in a very literal sense to imply that “Modeling is a futile exercise”.

This is a terrible misunderstanding and we shall see why shortly.

“All models are wrong, some are useful” is an aphorism (meaning it is a concise expression of general truth). But the aphorism in this case leads to misinterpretation.

Firstly, it is important to understand what modeling is.

The purpose of modeling is to provide an abstraction of real process. Basically, a good approximation of reality.

Anybody who mistakes the?abstraction for the real, commits the Fallacy of Reification (yup, one more fallacy to add to the list of all fallacies which we data scientists/statisticians commit).

Anybody who mistakes the abstraction for the real, commits the Fallacy of Reification

Coming to why the phrase should not be taken literally, a good analogy would be that of an example of a Map.

In an exact sense, a map is also wrong because it does not provide 1:1 mapping of the real world.

So, George Box’s phrase should be construed the same way as a map is considered wrong because it does not represent the real world.

Does a map provide 1:1 mapping to the real world?

No.

But is it mighty useful?

Hell yeah.

Also, talking about utility of the models, John Tukey’s words conveys the essence clearly.

Far better an approximate answer to the right question, which is often vague, than an exact answer to the wrong question”.

So, to summarize

“All models are wrong, some are useful” ≠ “Modeling is a futile exercise”.

Reference (map analogy) : On Exactitude in science — Jorge Luis Borges

Your comments are welcome.

Gareth Ellis

Head Of Data Analytics & Research at PKF Thailand

3 年

Some models are well designed and some less well. Their proximity to ‘the truth’ varies accordingly. Similarly in any market research project some are based on sound sampling methodologies and others are not; their ability to inform decision making therefore varies. I was always struck by the number of companies who didn’t believe modeling of historical data was a good place to start when considering future actions. If you don’t make use of available data what exactly are you going to base your decision on? I guess the modeler has to be able to adequately explain the method and potential limitations in order to gain buy-in and for the client to confidently apply any recommendations.

要查看或添加评论,请登录

社区洞察

其他会员也浏览了