The art and science of data visualization

Sponsor: The Zuri Group

The John Snow map

Snow, with modern tools

Why visualize data?

  • Visualization can generate hypotheses, reveal insights, and communicate information
  • Visualizations can be exploratory or explanatory - and there is not always a clear difference between the two!

Static charts

Interactive charts

Data source: United Nations

Exploratory vs. explanatory visualization

The data analysis process (H. Wickham)

Anscombe's Quartet

Source: Alan Smith

Visualization: an art and science

  • Practitioners must strike a balance between the art and science of data visualization
  • This means that visualizations must both:
    • Be visually appealing
    • Maintain fidelity to the structure of the data
  • Accomplishing both can be a challenge!

Color

  • Sequential

  • Diverging

  • Qualitative

When in doubt, consult ColorBrewer (http://colorbrewer2.org/)

Good use of color

Source: Kirk Goldsberry/Grantland.com

Poor use of color

Source: Kenneth Field/Cartonerd

Inaccurate representation of data

Source: FlowingData.com

Where to start the Y-axis?

What is wrong with this chart?

Alternative representation

Correlation means...

Source: XKCD

Spurious correlations

Source: tylervigen.com

Visualization for development

Possible questions to address:

  • Where are our donors coming from - and do we have people in the right places?
  • Are there relationships between donor characteristics/activities and donor volume?
  • What relationships/networks exist between donors?

Data visualization tools

For (many) data visualization tools, pick two of the following three:

  • Free to use
  • Data & visualizations stored on your own machine or server
  • Gentle learning curve

However, there are many different tools available for different use cases

Tool demonstration: Tableau

Tool demonstration: RStudio