Skip to content

PrateekTh/development-based-clustering-countries

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Economic Development Based Analysis of Countries

This project aims to analyse the Gross Economic Data of 167 countries around the globe, to gain several insights. The analysis involves several steps, such as Data Cleaning, extensive visualisation, feature engineering and finally building the K-Prototype Clustering model.

Thus, the basic objectives pursued here are:

  • Find trends in development between various features, and look into economic implications in such relationships.
  • Perform unsupervised clustering based on all the features, and divide countries into clusters based on the features.
  • Profile and understand the thus formed clusters, and their possible significance.

Exploratory Analysis

Here are some of the Visualisations (the complete project can be found in the colab notebook):

Count Based Distributions of various features: Count-Distribution

Top and bottom countries of each Feature: GDP

Lowest Life Expectancy

Highest Life Expectancy

Pair-plots comparing each Feature:

Fertility VS Income

Results:

As a result of clustering, three major clusters are observed. Taking a broader perspective, they are named based on help needed by the country in question.

Plots with clusters based on Categories:

Child Mortality & Imports VS GDPP

World Map (Chloropleth): Result Plot

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published