World Happiness Report
This project analyzes World Happiness Report datasets from 2015 and 2023 to explore factors influencing happiness across countries. By examining the data, it aims to identify key determinants of happiness such as GDP, social support, and life expectancy, and to uncover regional trends.
Description
World Happiness Report Analysis
Dataset Overview
This project analyzes the World Happiness Report datasets to understand factors influencing happiness across countries. By examining data from 2015 and 2023, the goal is to identify decisive happiness factors, explore regional differences, and develop predictive models for happiness scores.
Data Files
The repository contains the following files:
- World_Happiness_Report_Analysis.ipynb: A Jupyter Notebook with full analysis, including data preprocessing, exploratory data analysis (EDA), correlation analysis, and machine‑learning models.
- 2015.csv: Dataset for the 2015 World Happiness Report, used for regional information gathering.
- WHR2023.csv: Dataset for the 2023 World Happiness Report, serving as the primary analysis dataset.
Data Sources
The analysis uses datasets from Kaggle:
Project Objectives
- Identify happiness drivers: Determine the most important variables influencing happiness scores, such as GDP, social support, and life expectancy.
- Regional happiness analysis: Explore happiness differences across regions and identify regional trends.
- Classification and Prediction: Develop machine‑learning models to classify and predict happiness scores based on key factors.
Analytical Methods
The analysis proceeds through the following steps:
- Data preprocessing: Handle missing values, normalize data, and merge datasets for comprehensive analysis.
- Exploratory Data Analysis (EDA): Visualize and examine data distribution, outliers, and regional trends.
- Correlation analysis: Use statistical methods to identify relationships between happiness scores and various factors.
- Machine‑learning models: Implement decision‑tree regression and linear regression to predict happiness scores.
Results Overview
- Key drivers: GDP, social support, and healthy life expectancy are the most influential factors for happiness scores.
- Regional analysis: Western Europe, North America, Australia, and New Zealand achieve the highest happiness scores, while Asia and Africa lag behind.
- Predictive model: The decision‑tree regression model explains approximately 83 % of the variance in happiness scores, demonstrating its effectiveness.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: github
Created: 8/10/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.