Data Democratisation and the Trouble with Rolling Averages
21/09/2017

I was watching a webinar the other day with various luminaries adding weight to a subtle sales pitch. Two of them were talking about future trends – one claiming data democratisation (Google it!) was spreading like wild-fire and was to be encouraged, while the other spoke about it being a myth. I’m with the latter – and much more in favour of information democratisation. Letting every person and their dog have access to any data so they can interpret it any which way they can – in my view that’s a recipe for much activity, confusion and wasted effort…
Let’s take just one of many examples – the application of moving average charts to data. It is often used to look for trends, cutting out the amount of “noise” (uppy-downy movements in data over time) in the data. Investors use it for example to cut out the ups and downs in trading to see where a certain stock is generally heading – and good luck to them! Those trained in statistics may use these kinds of charts to smooth out seasonality so they can see any signals where data is breaking out of that seasonal pattern.
So what is the effect when we look at some of these charts?
Here’s a 12-month moving average chart of some data (out of context to demonstrate the point):
So we’ve got a roughly constant set of results from July 2013 to June 2014 – nothing much going on.
But putting this in context we see a different picture – one that we might want to take action on:
So, in context we see there is a strong seasonal pattern, with high volumes in July and August and low volumes in February! The Moving Average Chart above would look like the same picture if the underlying data were flat rather than seasonal. So the Moving Average loses information about what is actually going on! It does not help in taking corrective action. The extended-SPC chart (showing the seasonal pattern) allows officers the opportunity to focus their efforts when Incident volume is high in July and August in order to have maximum impact.
Here’s another example:
So now we have a signal in February 2014 – something must be special about February 2014. “Find out what happened in February 2014” rattles down from the corridors of power.
But wait up! When we look at the data in context, we see a very different picture:
So actually, there’s nothing special about February 2014 at all! February is just part of a run of low results since June 2013 (when actually earlier corrective action was taken by the force to reduce shoplifting). The February 2014 signal in the Moving Average Chart above is actually as a result of the previous high values for months up to February 2014 dropping out of the 12-month moving average calculation! So the Moving Average Chart can be misleading taken out of context – which they usually are!
Lessons Learned:
- Don’t use Moving Average Charts out of context
- Don’t use Moving Average Charts to look for signals of corrective action impact nor for opportunities to apply corrective action
- Recognise that by processing any set of raw data potentially removes information from that data.
- Apply Moving Average Charts only when appropriate.
Or alternatively, you could standardise on the one Dilbert chart above – it might be just as useful!
Categories & Tags:
Leave a comment on this post:
You might also like…
How do I reference… a table of data from multiple sources?
If you have read our previous APA7 post on Referencing ... tables, you will know how to cite a table of data taken from another source, but when you are creating a new table which ...
Finding full-text Economist articles…
If you’re looking for The Economist, the place to go is ProQuest One Business. Follow these step-by-step instructions to get full-text access. Login here and click on the Publications option at the top, above the ...
Changes to Library Services over Easter, 18-21 April
Libraries on the Cranfield site Both Kings Norton Library and the School of Management Library (Building 111, first floor) will be open 24/7 over the Easter weekend. You will be able to use the study ...
Searching Statista: Effective strategies and Research AI tips
Statista is a global data and business intelligence platform with an extensive collection of statistics, reports, and insights on over 80,000 topics from 22,500 sources in 170 industries. It offers data on the global digital ...
Introducing…. BankFocus (Orbis)
For anyone researching the financial sector, BankFocus is a great place to start, providing financial and company data for finance institutions and companies from across the world. The service allows you to search for a ...
The Implications of US Tariffs on global supply chains
US President Donald Trump's new tariff policies announced on April 2, 2025 are expected to cause significant disruptions to the global supply chains, affecting multiple sectors and countries. A simple mathematical equation uses a country’s ...