www.crosshyou.info

政府統計の総合窓口のデータや、OECDやUCIやのデータを使って、Rの練習をしています。ときどき、読書記録も載せています。

OECD Purchasing power parities (PPP) data analysis 4 - Hierarchical Clustering and K-means Clustering using R.

f:id:cross_hyou:20211231081559j:plain

Photo by Mylon Ollila on Unsplash 

www.crosshyou.info

This post is following of the above post.

In this post, I will do clustering.

First, I use hierarchial clustering.

I make a matrix for clustering. I use 2018 and 2019 data.

f:id:cross_hyou:20211231091854p:plain

Then, I use dist() function to calculate distance between each data.

f:id:cross_hyou:20211231091959p:plain

Then, I use hclust() dunction to do hierarchial clustering.

f:id:cross_hyou:20211231092113p:plain

Then, I use plot() function to make a dendrogram plot.

f:id:cross_hyou:20211231092608p:plain

f:id:cross_hyou:20211231092620p:plain

I see there are 5 groups, 

Hierarchial Group 1 - LUX, IRL, CHE, AUT, BEL, GRC, PRT, NZL, FIN.

Hierarchial Group 2 - NLD, AUS, CAN, TUR, DEU, ESP, ITA, GBR, FRA.

Hierarchial Group 3 - MEX, SWE, NOR, DNK.

Hierarchial Group 4 - JPN.

Hierarchial Group 5 - ISL.

I use cutree() function to divide into 5 groups.

f:id:cross_hyou:20211231093101p:plain

Now, let's do another clustering method, k-means clustering.

I use kmeans() function.

f:id:cross_hyou:20211231093235p:plain

Let's see the result.

f:id:cross_hyou:20211231093412p:plain

Kmeans Group 1 - DNK, ISL, NOR, SWE.

Kmeans Group 2 - AUT, BEL, FIN, GRC, IRL, NLD, PRT, CHE.

Kmeans Group 3 - AUS, CAN, FRA, DEU, ITA, ESP, TUR, GBR.

Kmeans Group 4 - JPN, MEX.

Kmeans Group 5 - LUX.

I see Hierarchial Group 3 and Kmeans Group 1 are similar, Hierarchial Group 2 and Kmeans Group 3 are similar, Hierarchial Group 1 and Kmeans Group 2 are similar.

That's it. Thank you!

The next post is

 

www.crosshyou.info

 

To read from the 1st post,

 

www.crosshyou.info