dinsdag 3 januari 2017

The Electronic Barometer

This goes beyond data journalism. It is a project about the Internet of Things, studying the relationship between measurement/observation, data and storing data, and finally retrieving, analyzing and visualizing data if possible real time. Here is the story.

maandag 2 januari 2017


In part 1 we discussed publishing real time data, in part 2 I highlighted storing data in a database and publishing them on a blog. In the last part 3 of this series I will pay attention to retrieving data from the database and visualizing the query results.
The simplest way to retrieve data from MYSQL is installing phpMyAdmin, which gives you complete control of the database, tables and queries. phpMyAdmin makes building a query and exporting it to .csv very easy. The exported .csv can be used for further analysis with for example Excel and visualizing with Google.
More interesting is to make a direct connection to the database from R, making the query, analyze and visualize directly from R. Finally, I pay attention to publishing these results online with plot.ly rest API.


It is nice to read real time the changes in the pressure as an indication of changes in the weather. However, the observations once published are lost.  It would be much better to store the measurements as data in for example MYSQL and publish the data from the database in Wordpress.

woensdag 16 november 2016

Advanced Tableau

Playing with Tableau and trying some advanced settings:

1.  Adding GPS points to a map; how to. 

2.  Adding shape file to map; how to  and check for maps

maandag 24 oktober 2016


The Economist published a very interesting data journalism piece about the emotions in the presidential debate Trump versus Clinton. "Using facial-analysis software developed by Microsoft, The Economist has sought to quantify Mr Trump and Mrs Clinton’s emotions during some of the most trenchant moments of the debates".
This analysis focuses on facial expression in the debate, the content however is important as well. Using a transcript of the debate published by The Washington Post , I analyzed the debate on keywords, visualized in word clouds and the sentiments related to the words using R.

Here are the word clouds for key words of Clinton and Trump after the 'more tag':

vrijdag 21 oktober 2016


The Nobel prize for literature is awarded to Bob Dylan. How to report about that? From a data journalism perspective there are interesting possibilities. NRC Handelsblad published an info graphics. Interesting, but there are other possibilities using R and Tableau. Here are a few examples. If you are interested in the how-to, follow the more tag.