Data Democratisation and the Trouble with Rolling Averages
21/09/2017


I was watching a webinar the other day with various luminaries adding weight to a subtle sales pitch. Two of them were talking about future trends – one claiming data democratisation (Google it!) was spreading like wild-fire and was to be encouraged, while the other spoke about it being a myth. I’m with the latter – and much more in favour of information democratisation. Letting every person and their dog have access to any data so they can interpret it any which way they can – in my view that’s a recipe for much activity, confusion and wasted effort…
Let’s take just one of many examples – the application of moving average charts to data. It is often used to look for trends, cutting out the amount of “noise” (uppy-downy movements in data over time) in the data. Investors use it for example to cut out the ups and downs in trading to see where a certain stock is generally heading – and good luck to them! Those trained in statistics may use these kinds of charts to smooth out seasonality so they can see any signals where data is breaking out of that seasonal pattern.
So what is the effect when we look at some of these charts?
Here’s a 12-month moving average chart of some data (out of context to demonstrate the point):

So we’ve got a roughly constant set of results from July 2013 to June 2014 – nothing much going on.
But putting this in context we see a different picture – one that we might want to take action on:

So, in context we see there is a strong seasonal pattern, with high volumes in July and August and low volumes in February! The Moving Average Chart above would look like the same picture if the underlying data were flat rather than seasonal. So the Moving Average loses information about what is actually going on! It does not help in taking corrective action. The extended-SPC chart (showing the seasonal pattern) allows officers the opportunity to focus their efforts when Incident volume is high in July and August in order to have maximum impact.
Here’s another example:

So now we have a signal in February 2014 – something must be special about February 2014. “Find out what happened in February 2014” rattles down from the corridors of power.
But wait up! When we look at the data in context, we see a very different picture:

So actually, there’s nothing special about February 2014 at all! February is just part of a run of low results since June 2013 (when actually earlier corrective action was taken by the force to reduce shoplifting). The February 2014 signal in the Moving Average Chart above is actually as a result of the previous high values for months up to February 2014 dropping out of the 12-month moving average calculation! So the Moving Average Chart can be misleading taken out of context – which they usually are!
Lessons Learned:
- Don’t use Moving Average Charts out of context
- Don’t use Moving Average Charts to look for signals of corrective action impact nor for opportunities to apply corrective action
- Recognise that by processing any set of raw data potentially removes information from that data.
- Apply Moving Average Charts only when appropriate.
Or alternatively, you could standardise on the one Dilbert chart above – it might be just as useful!
Categories & Tags:
Leave a comment on this post:
You might also like…
My journey to Cranfield as an FIA Motorsport Engineering Scholar
"You don’t need to fit a stereotype to succeed in engineering or motorsport. You need curiosity. Resilience. And the confidence to take up space." In this blog, Sanya Jain, current MSc student and FIA ...
‘Getting started with Bloomberg’ training – discover the power of Bloomberg terminals
Perhaps you've heard people talking about Bloomberg or heard it mentioned in the news and are wondering what all the fuss is about? Why not come along and find out at our Getting started with ...
Commonwealth Scholarships play a critical role in developing sustainability and leadership in Africa
Q&A with Evah Mosetlhane, Sustainability MSc, Commonwealth Distance Learning Scholar What inspired you to pursue the Sustainability MSc at Cranfield? I was inspired to pursue the Sustainability MSc at Cranfield because of the university’s ...
How do I reference a thesis… in the NLM style?
You may be including theses within your research. When you do so you need to treat them in the same way as content taken from any other source, by providing both a citation and a ...
Introducing… Bloomberg Trade Flows
Are you interested in world trade flows? Would it be useful to know which nations are your country's major trading partners? If so, the Bloomberg terminal has a rather nifty function where you can view ...
Cranfield alumni voyage to the International Space Station
Seeing our alumni reach the International Space Station (ISS) has a ripple effect that extends far beyond the space sector. For school students questioning whether science is “for them”, for undergraduates weighing their next ...
