crosshyou

主にクロス表(分割表)分析をしようかなと思いはじめましたが、あまりクロス表の分析はできず。R言語の練習ブログになっています。

Data_Analysis

OECD International Student Mobility Data Analysis 5 - Making a slope chart using R plot(), lines(), points() and text() functions.

Photo by Aditi Jain on Unsplash www.crosshyou.info This post is following of above post.In this post, I will make a slope chart using R.We can get an idea of changes based on the slope of the lines, so we can see which country has improved…

OECD International Student Mobility Data Analysis 4 - Which country has the largest net change for International Student Mobility?

Photo by Kumiko SHIMIZU on Unsplash www.crosshyou.info This post is following of above post. Let's make data frame which contains 2005 data only. Then, let's make data frame which contains 2018 data only. Then, join these two data frames w…

OECD International Student Mobility Data Analysis 3 - t-test with t.test() function using R

Photo by Tegar Surya on Unsplash www.crosshyou.info This post is following of above post.In this post, let's do t-test with t.test() function using R. In the previous post we see International Mobility is gradually increasing.Let's confirm…

OECD International Student Mobility Data Analysis 2 - making graphs with ggplot2() function.

Photo by Lance Anderson on Unsplash www.crosshyou.info This post is following of above post. In this post, let's make some graphs to see data distributions. First, let's see overall student_mobility data histogram. We see there are a few o…

OECD International Student Mobility Data Analysis 1 - load CSV file data into R with read_csv() function

Photo by Kenrick Baksh on Unsplash In this post I will analyze OECD International Student Mobility data using R. You can download data from Students - International student mobility - OECD Data CSV file image is below. Let's upload this da…

OECD Net Oda data analysis 8 - PCA(Principal Component Analysis) shows G7 countries

Photo by Navi Photography on Unsplash www.crosshyou.info This post is following of above post.In this post, I use PCA(Principal Component Analysis).First, I make a data frame for PCA. Then, I use prcomp() function to do PCA. We see PC1 exp…

OECD Net ODA data analysis 7 - making a correlation matrix and scatter plots.

Photo by Eugene Golovesov on Unsplash www.crosshyou.info This post is following of above post. In this post let's ee make a correlation matirx. To begin with, I make a data frame for average ODAFLOWS & MLN_USD by country. Next, I make ODAF…

OECD Net ODA data analysis 6 - Turkey has the highest ODAGRANT & PC_GNI. There is not significant difference in year.

Photo by 301+ Kim on Unsplash www.crosshyou.info This post is following of above post. In this post, let's see ODAGRANT & PCT_GNI. What is the highest ODAGRAN & PCT_GNI country - year? 2019 Turkey has the highest ODAGRANT & PC_GNI. What is…

OECD Net ODA data analysis 5 - United States has the largest ODAGRANT & MLN_USD

Photo by TOMOKO UJI on Unsplash www.crosshyou.info This post is following of above post.In this post, let's see how ODAGRANT & MLN_USD looks like. United States dominates. Germany is at the 2nd. Let's see average value by country. United S…

OECD Net ODA data analysis 4 - Examining ODAFLOWS & PC_GNI. Sweden has the highest ODAFLOWS & PC_GNI

Photo by Jarand K. Løkeland on Unsplash www.crosshyou.info This blog is following of above blog. In this post, let's see ODAFLOWS and PC_GNI data. To begin with, let's see time trend. I cannot see obvious trend. Let's calculate average val…

OECD Net ODA data analysis 3 - Visualizing data with ggplot() + geom_boxplot() and geom_line(). U.S.A. has the largest ODAFLOWS & MLN_USD.

Photo by Chris Lejarazu on Unsplash www.crosshyou.info This blog is following of above blog.This time, let's visualize ODAFLOWS & MLN_USD data. Fisrtly, let's see time x value Let'c caluculate average value by time and plot a line chart. W…

OECD Net ODA data analysis 2 - summary() function is to see summary stats. the maximum ODAFLOWS is 36551.15 million USD.

Photo by kazuend on Unsplash www.crosshyou.info This blog is following of above blog. This time, let's import country name data too. I hvae below CSV file which contains ISO code and country name. I use read_csv() function to read this dat…

OECD Net ODA data analysis 1 - using read_csv() function to read CSV file data into R.

Photo by Wengang Zhai on Unsplash In this blog, I will analyzie OECD Net ODA data I got data from OECD web site. This is what the CSV file looks like.I use R for data analysis. Firstly, I load tidyverse package. Next, use read_csv() functi…

OECD Doctors' consultations data analysis 6 - using lm() function for linear regression

Photo by DAVID TANG on Unsplash www.crosshyou.info This post is following of above post. In this blog, I will do regression anaysis using lm() function in R. Let's go ahead. summary() function displays result. p-value is 0.4024, it is grea…

OECD Doctors' consultations data analysis 4 - T-test shows doctors' consultations has been increased from 2007 to 2012.

Photo by Jasper Wilde on Unsplash www.crosshyou.info This post is following og above post. This time, let's see boxplot by region for 2021 data We see Asia region has the highest consultations and Afirica has the lowest. Then, how about 20…

OECD Doctors' consultations data analysis 3 - which.max() and which.min() finds which country has the max/min doctors' consultations?

Photo by Harry Cunningham on Unsplash www.crosshyou.info This post is following of above psot.Now, let's see which country has the maxmum doctors' consultations using which.max() fundtion. Korea has the max conultations per capita. 16.9 ti…

OECD Doctors' consultations data analysis 2 - use inner_join() to merge two data frames

Photo by Sergey Shmidt on Unsplash www.crosshyou.info This post is following of above post.In the previous post, I read OECD Doctors' consultation data in R. But we don't know what LOCATION is what country. So I will add those information.…

OECD Doctors' consultations data analysis 1 - Read CSV file data into R

Photo by Tobias Keller on Unsplash This time let's see OECD Dictor's consultations data. This indicator presents data on the number of consultations patiens have with doctors in a given year. I download below file from the OECD web site(He…

OECD Threatened species data analysis 5 - Bootstrap for Cinfidence Interval

This blog is following of www.crosshyou.info In this blog, I will show you how to get confidence interval with bootstrap method. for BIRD, 95% confidence interval is 18.1 ~ 25.3 by parametric calculation. average ± qt(0.975, d.f.)*S.E. We …

OECD Threatened species data analysis 4 - making bar plot with error bars in R

This blog is following of www.crosshyou.info In this blofg, I will make barplot with error bars. 1. check n(number of overbations) of each SUBJECT We see BIRD has 36, MAMAL has 34 and PLANT has 35 observations. 2. calculate average pf each…

OECD Threatened species data analysis 3 - ANOVA(ANalysis Of VAriance) without lm() and anova()

www.crosshyou.info In this brlog, let's do ANOVA(Analysis of Variance). We see average Value(percentage of threatened species) are different by SUBJECT. BIRD has the highest Value and PLANT has the lowest.But this difference is statistical…

OECD Threatened species data analysis 2 - visualize data using ggplot2 in R

www.crosshyou.info This brlog is following of above blog.This time, let's visualize data with ggplot2 package in R. Boxplot by SUBJECT We see BIRD are the highest median and PLANT is the lowest median. Next, let's visualize by LOCATION CZE…

OECD Threatened species data analysis 1 - read csv file into R

Hello. In this blog, I will analyize OECD Threatend species data.First, I goet data from OECD web site. Biodiversity - Threatened species - OECD DataThe csv file looks below Let's read this file into R. First of all, load tidyverse pachage…