»Me, myself and BI«

Bissantz ponders


|

Above all else, show the data

When you compress values, you are covering both yourself and your data. Information is rarely so valid and representative that it can maintain its meaning in a summarized form.

Welt am Sonntag (WAMS), my beloved Sunday newspaper, regularly chats with a panel of top managers. Twelve CEOs assessed what results the current and future economic conditions would have on their companies. Their answers were summarized in a so-called “entrepreneur business cycle index”.

Welt am Sonntag, 2008–04–06 page 29

These types of charts are commonplace in articles, presentations, expert reports and memos. The underlying desire to pack information into a single statement is apparent. It’s either black or white – and nothing but the facts.

The desire is understandable but extremely difficult to fulfill. Facts, for example, are rare in today’s information age. The reality is something in a shade of gray – never 100 % white or black. Averages signalize precision but our data is usually anything but exact. We need to use averages with caution. Why? Take temperature, for example. The average of zero and 40 degrees Celsius (32 F and 104 F) is a comfortable 20 degrees (68 F). But this moderate temperature never existed. The reality of the uncompressed data was extreme heat and cold. In the case of the twelve lonely CEOs in the WAMS panel, my forecast for my industry would shift the average…upwards. The panel is much too small. As you can see from my fictitious schematic chart, many different distributions (with very different implications) can lead to the average as in the WAMS chart.

Variations of raw data with the same average

In my opinion, the representation above is the only serious one, because it is transparent. It reveals – not conceals – the information so that each reader can decide if the average is an accurate depiction of the distribution. In addition to viewing the range of the answers, the reader can observe if the distribution shows trends, for example, ten responses are very similar while two are outliers. It illustrates that the data is weak, but it makes the most out of the information available. In other words, it’s beautiful evidence.

If you present your data like this, you not only show that you like the reader but that you like your data as well.

2 comments for “Above all else, show the data”

  1. Jon Peltier said:

    “Facts, for example, are rare in today’s information age.”

    Unfortunately, so true.

    I like your chart, and the philosophy behind it (Show The Data).

  2. Stephen Hampshire said:

    Excellent post, and very nicely illustrated too.

    Have you heard the joke about the statistician who drowned in a pool with an average depth of 2cm?

Leave a response

Thursday, August 5th, 2010

When CI rules

Friday, July 16th, 2010

Writing with sparklines

Friday, June 25th, 2010

Sportlines: The first sparklines in a German newspaper

Friday, June 4th, 2010

Computers from Pandora

Friday, May 14th, 2010

Helmsman, leave your watch - Helmsman, help us adapt.

Friday, April 23rd, 2010

The first sparklines in "Die Welt" – well, almost…

Friday, April 2nd, 2010

You can’t wrap a fish in an iPad

Friday, March 12th, 2010

New ‘See’land, Part II

Friday, February 19th, 2010

From Pixelland to Panoramaland

Saturday, January 30th, 2010

New ‘See’land I


»Me, myself and BI« Bissantz ponders
DE Deutsch