zaterdag 10 februari 2018

EARTHQUAKES AT GRONINGEN

Earthquakes at the province of Groningen are induced by the mining of natural gas since the sixties. The KNMI has recorded and collected the data of the quakes. Inspired by Maarten Lambrechts I loaded the data into a template at Flourish gives the following time chart:



vrijdag 9 februari 2018

HOW TO ANALYZE: FROM FLOURISH INTO R



Flourish is an awesome tool to create charts. Its output is almost art; this could move data journalism away from its original goal: being a kind of 'sociology done on deadline', aiming at 'improving reporting by using the tools of science'. Although a chart can be made fast, easily and beautiful, the question still is what does it show and what is the meaning?
Below I show how to use R and R Studio to do an analysis of the same dataset.

setwd("/home/peter/Desktop/rdata")
loading the data set in a data frame h
h<-read.csv("health2.csv")
Showing the structure of the data set
str(h)
'data.frame':   165 obs. of  8 variables:
 $ year                : int  2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 ...
 $ country             : Factor w/ 11 levels "Angola","Botswana",..: 1 1 1 1 1 1 1 1 1 1 ...
 $ life.expec          : num  46.6 47.4 48.1 48.8 49.4 ...
 $ gdp.cap             : num  606 574 776 850 1136 ...
 $ code                : Factor w/ 11 levels "AGO","BWA","CMR",..: 1 1 1 1 1 1 1 1 1 1 ...
 $ Total.as.percGDP    : num  2.79 5.38 3.63 4.41 4.71 4.1 4.54 3.38 3.84 4.37 ...
 $ govperc.total.exp   : num  60.2 52.2 46.4 46.4 51.1 ...
 $ privat.perc.of.total: num  39.8 47.8 53.6 53.6 48.9 ...

donderdag 8 februari 2018

FLOURISH: DIY DATA JOURNALISM CHARTS

You don't have to be a highly skilled data journalist to create interesting graphs and charts. There are a large number of internet sites were you can drop your data  and retrieve in seconds awesome graphics to embed on your news blog or website. I have been working in my training with for example Datawrapper, Plotly, Tableau. Recently a tweet by Alberto Cairo draw my attention to Flourish. Amazing! Flourish, based on a cooperation with Google Newslab, easily beats the competition. And of course for free. Create an account, login, choose a template and you are in business.
I played around with it, using some data of the Worldbank.  I selected a number of Sub-Sahara countries and download life expectancy and gdp per cap from 2000 to 2014. Here is my creation, done in a few minutes.




woensdag 27 december 2017

THE WORLD AT LARGE

Wanted to start 2018 with a larger view at the connected world. Here is my new Samsung UHD monitor 28 inch. Here are the specs: http://www.samsung.com/nl/monitors/uhd-ue590/LU28E590DSEN/


woensdag 6 december 2017

donderdag 23 november 2017

RIP WINDOWS 10


This enough...after a year struggling with Windows10 fast boot, secure boot, registered keys, UEFI files and endless updating, I cleaned the whole hard drive. Back to Linux again: installed Mint 17.3. All the data journalism stuff: Excel(office online) or Calc(Open Office, Outwit-Hub or table capture Chrome, Refine, Tabula, Qgis, R and R studio; all working. Just in case: running Win10 in a virtual box.
One small problem: I love Tableau public for viz. Not available for Linux (yet)....starting with plot.ly. Better integration with R.