www.crosshyou.info

政府統計の総合窓口のデータや、OECDやUCIやのデータを使って、Rの練習をしています。ときどき、読書記録も載せています。

OECD Total official and private flows data analysis 2 - Summary Statistics using R

UnsplashNastasia Kalininaが撮影した写真 

www.crosshyou.info

This post is following of the above post.

Let's see summary statistics.

First, let's see summary statistics by LOCATION.

Let's see summary stats by TIME.

I use group_by() function, then I use summarize() function and min(), quantile(), median(), max(), mean(), sd() and n() function.

Let's see the result.

Let's see each stats with line chart.

For minimum value, there are some extraordinaly nagative values.

For p25 (25 percentile), it is up trend.

 

For p50, it is similar to p25.

I found minimum value has different trend than other stats. So, it is good idea to analyze from minimum value point of view.

let's see df_summaly_loc again.

Some LOCATION has negative min value and some don't.

AUS, AUT, BEL, CHE, FIN, ITA and NLD have negative min.

I added a new variable to indicate having negative Value.

That's it. Thank you!

Next post is,

www.crosshyou.info

 

To read from the 1st post,

www.crosshyou.info