Visual Analysis of CO2 Emissions and Human Development Around the World

1. Project overview

What:
A Tableau Storyboard resulting from the exploration of global emissions, human development and world happiness data.

Why:
Analysis of countries’ CO2 emissions most commonly use production-based figures, which do not accurately reflect a population’s lifestyle. Furthermore, the success of a country in offering its population a high standard of living is often measured by its GDP or GNI per capita. In order to achieve a more accurate reflection of countries’ emissions and the quality of life of their populations, I decided to base this analysis on the consumption-based CO2 emissions and the human development index instead.

How:
1. Identify publicly available data:

2. Primarily used Python skills (pandas, seaborn, statsmodels):

  • Data wrangling
  • Data merging
  • Grouping and aggregating data
  • Supervised machine learning
  • Unsupervised machine learning
  • Time series analysis
  • Using visualization libraries

3. Data visualization with Tableau to create a storyboard

2. Analyzing and visualizing the data

Bringing the datasets together

The main focus of this step was to clean the three datasets so as to retain only data that is relevant for this project to eliminate any possible noise for the analysis before having a stab at it with machine learning, etc. I then made sure that the country names were identical and in accordance with United Nations nomenclature, before I reshaped all three to long format for easier visualization. Finally, I merged all three datasets into one dataframe as a basis for the analysis.

With the merged dataset at my disposal, I set out to perform some exploratory analysis and visualization of the data in Python.

Data collection for all three datasets is very complex, so each covers a different set of countries over varying timespans. This posed a challenge for the analysis, because the time span we can analyze depends on the variables examined. In order to not confuse the reader, I chose to limit my analysis to CO2 emissions and inequality-adjusted human development index.

Although I enjoyed using machine learning (k-means clustering) on my dataset, it generated no pertinent additional insights in this project.

In view of the time series and geographical nature of the data, visualizations proved a great way to share my insights with readers. But to gain a deeper understanding of the underlying factors at play, qualitative (political) analysis would be required.

Defining a research question

Based on the above, I set out to tackle the following research questions

Creating the Tableau storyboard

I chose the following approach to visualize the data with Tableau:

  • First, presenting data on a global scale using maps:

  • Then, looking more closely at the relationship between CO2 emissions and human development using scatter plots, both interactive and static:

  • Finally, zooming in on the development of individual countries’ over the last decade, primarily using line charts:

  • 3. Recommendations and next steps

    The projects led me to some interesting insights, which are formulated in more detail in the Tableau storyboard embedded at the top:
  • In recent decades, only very few countries have managed to keep their consumption-based CO2 footprint close to sustainable levels while ensuring high levels of human development and happiness for their populations. Only one country (Albania) fell within the desired thresholds over the last decade, albeit with a low happiness score.
  • Many countries, mostly in the 'Global North', have managed to reduce their carbon footprints in recent decades while still increasing human development scores. However, these countries must do much more to live up to their historical responsibility in mitigating the climate crisis.
  • Meanwhile, the majority of countries in the 'Global South' have increased their CO2 emissions to improve their human development scores. This underscores the urgency of finding sustainable development pathways for these countries.

  • Next steps:
  • Finally, I recommended a closer look at the identified countries to better understand the underlying reasons for their performance. As any further insights could serve as lessons learned to be applied elsewhere, I hope to find the time to do this myself soon!

  • For a closer look at the Python code please visit my GitHub and feel free to check out the full storyboard at the top of this page.