WHO Life Expectancy

Languages used: Python, R, Git

Duration: March - May 2024

The goal was to predict Life Expectancy based on the World Health Organization (WHO) data from 2006 - 2012. This was a project for STAT 432: Basics of Statistical Learning at UIUC.

For this project, I used a Random Forest (regression) and a Logistic Regression (classification) for my modelling. More details of the project are included on the github repository, linked on the project name.

Results:

  • Random Forest was able to predict life expectancy with a Test Mean Squared Error (MSE) of 3.2212 (very good)
  • Logistic Regression with a Test Classification Error of 0.056 (also very good)

Overall, the modelling was a success, and it was fun to implement the skills and techniques I learned over the semester to a real life dataset.