Doing Big Data – Intelligently
14/12/2016

Last week we blogged about a term used by the Nobel Prize-winning author Daniel Kahneman in his book Thinking, Fast and Slow – WYSIATI (What you see is all there is). It refers to people wanting to see a complex world simplistically, whereas we would advocate that you have to understand (at some level at any rate) the complexity before simplifying – in other words “Simplicity on the far side of complexity”. Totally the opposite of WYSIATI – or “Simplistic on the near side of complexity”.
Many promises of Big Data fall into the area of simplistic on the near side of complexity. We hear Big Data is going to revolutionise our world from retail, financial services etc. through asset management and on into national security. This may be the case with unstructured data (i.e. free text), and possibly with structured data in some circumstances. It is obvious how a retailer might want to analyse all shoppers who purchased a certain washing powder for sensitive shin and run a promotion to also purchase sensitive skin cream and/or clothes made from materials for sensitive skin. Of course, retailers have been doing this for years – just not calling it Big Data.
However, there is a Big Question around Big Data revolutionising the Internet of Things or the Asset Management Industry. With sensor data coming from a myriad of equipment and machinery at the rate of every (typically) 100ms, we are being persuaded that it is necessary to capture every data point as if it has meaning – giving rise to vast so-called “data lakes” of petabytes and zeta bytes or whatever in size, with little hope of analysing it. It may revolutionise sales of Big Data solutions, but does it add value to understanding and analysis?
We came across a world-leading rubber moulding/sealing company in our travels over the past four years or so, and here we found this simplistic on the near side of complexity approach of “just put all the data into one place and run an algorithm to tell us what levers to pull” to get a perfect product. There was little interest in understanding, just “give me an answer”. They were generating large amounts of scrap – and still are.
More recently, we are working with a large transport organisation with many lifts in their buildings. We recognised that every data point did not hold value and that each data point was infected with electrical / mechanical noise that needed removal – in fact, through some intelligent analysis, we reduced the volume of rows by a factor of 400:1. Further analysis allowed us to enter the data into a proprietary time-series analysis platform and we were able to visualise normal behaviour hour by hour over several weeks, spot anomalies, see trends and patterns – all of which have never been seen before.
The Big Question – how would Big Data have provided such insight?
Categories & Tags:
Leave a comment on this post:
You might also like…
Systematic literature review – Selecting your sources
In our previous systematic literature review post we identified some background reading for getting started, explained how to find other systematic literature reviews to help you understand what is required, and how reviews should be structured and written ...
Systematic literature review – Where do I begin?
Undertaking a systematic literature review can seem a bit daunting so breaking it down into separate stages is a good way to begin. In this blog post we’re going to identify some background reading and ...
Bank holiday hours for Library Services: Monday 4 May
Library Services staff will be taking a break on Monday 4 May for the early May bank holiday. You will still be able to access all the online resources and help you need via our ...
10 things to know about referencing at Cranfield (APA7) – a taster
Need to gen up on your referencing? Then read on. The following are some of the most important things to know about referencing in the APA7 style at Cranfield. 1. If you do not reference ...
Thinking about your literature review?
As part of your PhD or Master’s thesis, you will probably have to write a literature review. A successful literature review will offer an analysis of the existing research in your field, demonstrating your understanding ...
Introducing… Scopus
Scopus is one of the largest and most trusted academic databases, indexing millions of peer‑reviewed articles, conference papers, books, and preprints across every major discipline. Scopus contains more than 100 million records and over 30,000 ...
