In the age of ubiquitous technology, there is data all around us. Satellites up in space collect images of the planet we live on. Down on earth, we look at stats to optimise public transport or the healthcare system. And I bet you have a friend who counts their steps and tracks their sleeping pattern, right?
In all these cases, we curate vast amounts of data to reflect (and possibly act) upon. We’re looking for information to help us improve our well-being or to map the fastest route to our friend’s house. But the underlying data may be complex and difficult to make sense of. And most of us are not scientists or statisticians, and we are not trained in analysing data.
That’s why apps and websites pre-process the data for us and present us with a user-friendly interface. This is where Human-Computer Interaction and the people who study it come in. I am one of those people and I investigate how to create more impactful data visualisations, to help people consult complex data and make the most of it. I am currently working with the University of Cambridge and Vizzuality.
Literacy refers to our ability to process information. Imagine you work for a tech customer support hotline. You will need to pick out how tech-savvy the person on the phone is. Their computer literacy will influence the way you speak to them (and whether you ask if they’ve tried restarting). Data often is quantitative in nature, and thus requires numeracy skills. We first learn to count with an abacus, and soon enough we use a calculator. Hardly evidence of our inbuilt numerical genius! And let’s be honest, few people get excited about spreadsheets full of data.
Luckily, some do. Data experts sort through the numbers for us and then point us to the meaningful patterns in the data. A great solution to communicate these meaningful patterns is to visualise the data. That’s because visualisations support our cognitive processes, they help us think. Visuals are often easier and faster to process than numeric or semantic information.
For example, I could tell you that in 2017, most tourists visiting Spain were British. Most tourists visiting Britain were French. Most tourists visiting France were German. Most tourists visiting Germany were French. And so on. But I’m guessing I have already lost you because you are looking at the map instead. The illustration by ‘a map a day’ shows the same information in a much more interesting format.
In a nutshell, visualisations have benefits for information processing. They might even be fun to delve into. Sounds good, but there is a catch. Much like we don’t do a lot of mental maths (so we use calculators to help us), we don’t practice reading sophisticated visualisations. Only good visualisations are helpful. Poor visualisations can, in the worst case, mislead and obscure meaning.
Mind the Reader’s Mind and the Purpose of your Visualisation.
Experienced data scientists follow a user-centric approach to prevent this from happening. User-centric visualisations follow principles in line with visual perception and cognitive processing. A straightforward principle is providing all the information needed to interpret the graph. Keys should be easy-to-read, and one should avoid cryptic units or abbreviations.
How do you feel about pie charts? Some love a good pie, others despise them. Only, pie charts are neither good or bad; it depends on the purpose. Visualising a hand full of categories in a pie chart is legitimate, as in the left example in the picture below. But chances are you have seen charts like the pie on the right, with way too many slices to be able to tell any of them apart. Opponents of pie charts point out that we aren’t naturals at judging angles. Indeed, in this pie, the angles are very narrow and it’s almost impossible to compare its parts. To make things worse, the colour-coding is ambiguous and the key is full of cryptic labels.
Let’s look at another example of a modern data vis challenge. Utility companies are installing Smart Meters and giving In-Home Displays to customers. This is what current In-Home Displays look like: they tell the user whether usage is low (green), medium (amber), or high (red), and the total cost of energy spent so far.
What else could we do? Smart meter data consists of a timestamp and a power reading. The intuitive option is to draw a line graph with time on the horizontal axis and power on the vertical axis. Let’s imagine we have data for every device in the home. The example by Kelly and Knottenbelt (2015) shows usage data for different appliances. This is a very useful visualisation for researchers to see what’s what.
But would we be showing this to householders? The visual system is good at detecting deviations from the horizon. That explains why we’re good at reading bar charts. It also means that, in line graphs, untrained users tend to focus on the peaks, and neglect the baseline and trends. Did you realise in the time series above how much the fridge consumes over the course of the day? I’d argue the pictographs below put things into perspective. The size of every pictograph is proportional to the energy it’s consumed. For a householder to learn where they are using most energy, the pictographs are quicker to compare than colour-coded lines in a time series graph.
Click here to continue reading Melanie Herrmann’s article.