www.crosshyou.info

政府統計の総合窓口のデータや、OECDのデータを使って、Rの練習をしています。ときどき、読書記録も載せています。

Data_Analysis

OECD Trust in government data analysis 4 - Simple Linear Regression part II. I am very confident that trust and per capita GDP has positive relationship.

Generated by Bing Image Creator: Photograph of Japanese natural small river and wild fishes in the morning www.crosshyou.info This post is followong of the above post. In the previous post, I made a data frame which shows country averaged …

OECD Trust in government data analysis 3 - Simple Linear Regression Using 'infer' Package Workflow.

Generated by Bing Image Creator: Photograph of spring fields with tiny flowers and blue sky and white clouds www.crosshyou.info This post is following of the above post. In this post, I will do linear regression using infer package workflo…

OECD Trust in government data analysis 2 - visualizing data with R using ggplot2. one variable distribution, two variables relationship.

Generated by Bing Image Crator: Photography of Mt. Fuji and beautiful lake and blue sky and red foreasts www.crosshyou.info This post is following of the above post.In the previous post, I load OECD Trust in govenment data and per capita G…

OECD Trust in government data analysis 1 - import CSV file into R and make a tidy data frame.

Generated by Bing Image Creator : Photograph of Japanese Shrine and Blue Sky and White Could and Beautiful Flowers In this post I an going to analyze OECD Trust in government with R. Firstly, I downloaded CSV file like below from OECD web …

OECD Influenza vaccination rates data analysis with R 6 - multiple regression

Generated by Bing Image Creator: Orion stars and beautiful forest www.crosshyou.info This post is following of the above post. In the previous post, I did simple regression. In this post I will do multiple regressions. We have 5 explnanato…

OECD Influenza vaccination rates data analysis with R 5 - Simple Regression Analysis

Generated by Bing Image Creator : Photograph, Morning, Tall Waterfall, Wild Flowers from Birds View more green tone www.crosshyou.info This post is following of the above post. In this post, I will do regression analysis. The dependent var…

OECD Influenza vaccination rates data analysis with R 4 - geom_point() and facet_wrap(), geom_shmooth() are good friends.

Generated by Bing Image Creator: Photograph, Small Birds, Biright Sky, White Clouds and country landscape www.crosshyou.info This post is following of the above post. In the above post, I merge Influenza data and GDP data. Let's check the …

OECD Influenza vaccination rates data analysis with R 3 - merge GDP data frame with inner_join().

Generated by Bing Image Creator : Pphotograph, Alps, Mountains, Green Grass, Cows www.crosshyou.info This post is following of the above post. In this post, I will add per capita GDP data to the Influenza vaccination data. From OECD websit…

OECD Influenza vaccination rates data analysis with R 2 - Drawing Histogram, Q-Q, Scatter Plot and Boxplot

Bing Image Creator : Photograph, Landscape, Beautiful Trees, Blue Sea and Beach www.crosshyou.info This post is following of above post.In this post, I will make some grpahps to see waht OECD Influenza vaccination rates looks like. First, …

OECD Influenza vaccination rates data analysis with R 1 - Load CSV file date using read_csv() function

Bing Image Creator -- Photograph of small yellow flowers, high mountains and blue sky and clouds In this post, I will analyze OECD Influenza vaccination rates with R. First, I load tidyverse. Then, I use read_csv() function to load data on…

OECD Gross pension replacement rates data analysis 6 - Linear Regression Analysis finds GDP does not matter for pension replacement rates.

Created by Bing Image Creator www.crosshyou.info This post is following of the above test. In this post I will analyze relationship between replacement rates and per capita GDP. So, I got per capita GDP data from OECD web site. I downloade…

OECD Gross pension replacement rates data analysis 5 - t-test with simulation based using "infer" package of R. There is no significant difference between 2014 and 2020.

Generated by Bing Image Creator www.crosshyou.info This post is following of the above post. In the above post, I visualize replacement rates of 2014 and 2020. And there seems no signficant difference. In this post, let's confirm that usin…

OECD Gross pension replacement rates data analysis 4 - Visualization Histogram and Boxplot using R. There is not much difference between 2014 and 2020.

Generated by Bing Image Creatorwww.crosshyou.info This post is following of the above post. In this post I will analysis difference between 2014 and 2020. First, I make a data frame to calculate difference between 2014 and 2020. filter() f…

OECD Gross pension replacement rates data analysis 3 - Is there statistically significant difference between MEN and WOMEN? Using R infer package workflow.

UnsplashのIvan Jevticが撮影した写真 www.crosshyou.info This post is followong of the above post. In this post I will check the first question, 1 - MEN and WOMEN have different replacement rate? I use 'infer' package workflow, it is not the…

OECD Gross pension replacement rates data analysis 2 - making data frame for analysis using select(), rename(), mutate() function and making a histogram with ggplot() + geom?histogram() function with R

UnsplashのAyesha Firdausが撮影した写真 www.crosshyou.info This post is following of the above post. In the previous post, I load CSV file data into R with read_csv() function. In this post I make a data frame for analysis use. First, I sel…

OECD Gross pension replacement rates data analysis 1 - read_csv() function to load CSV file data into R and skim() function to check each variables.

UnsplashのQingbao Mengが撮影した写真 In this post, I will analyze OECD Gross pension replacement rates data. According to the above definions, the higher replacement rates implies richer retirement life wit pension income.First, I download…

OECD Crop production data analysis 9 - ANOVA(ANalysis Of VAriance) analysis with "infer" package using R

UnsplashのAnisur Rahmanが撮影した写真 www.crosshyou.info This post is followoing of the above post. In this post I will do ANOVA analysis with "infer" package using R. I would like to see crop production are different by crop, maize, rice,…

OECD Crop production data analysis 8 - Hierarchical Clustering using R

UnsplashのDustin Humesが撮影した写真 www.crosshyou.info This post is following of the above post. In this post, I will do Hierarchical Clustering using R. First, I make a matrix for clustering. I start with subsetting only year 2020 data. …

OECD Crop production data analysis 7 - time series data analysis using R - creating "ts" class object and visualize with TSstudio package.

UnsplashのMelanie Kreutzが撮影した写真 www.crosshyou.info This post is following of the above post.I will see time-seires trend of productivity of crop production data in this post. I refrer to Hands-On Time Series Analysis with R: Perform…

OECD Crop production data analysis 6 - correlation matrix and scatter plots matrix using R

UnsplashのOchir-Erdene Oyunmedegが撮影した写真 www.crosshyou.info This post is following of the above post. In this post, I will analyze correlations of vatiables. Let's begin with crop field area. I can use cor() function to see correlati…

OECD Crop production data analysis 5 - Using pivot_wider(), select() and rename() function to make a new "wider format" data frame.

UnsplashのSina Baharが撮影した写真 www.crosshyou.info This post is followiing of the above post. In this post, I will make wider format data frame from "df" data frame object. First, let's see "df" data frame. I see SUBJECT variable has 4 …

OECD Crop production data analysis 4 - crop productivity ranking using R

www.crosshyou.info This post is following of the above post. In this post, I will make crop productivity ranking by country. Crop productivity is "crop volume / crop field area", it is measured as TONNE_HA. Let's start with MAIZE. ISR(Isra…

OECD Crop production data analysis 3 - crop volume ranking using R ggplot() + geom_boxplot(). China is only one country which are in top 4 countries for all crops.

UnsplashのSpenser Sembratが撮影した写真 www.crosshyou.info This post is following of the above post.The above post is ti make crop field are ranking. In this post, I will make crop volume ranling by location. First, MAIZE. USA, China, Bras…

OECD Crop production data analysis 2 - crop field area ranking using R visualization tool

UnsplashのMack Songが撮影した写真 www.crosshyou.info This post is following of the above post.In the previous post, I imported CSV file data into R. Then, let's visialize those data. First, let's see each 4 kinds of crops field area. Let's…

OECD Crop production data analysis 1 - Import CSV file into R using read_csv() function. MAIZE has the largest cropfields and volume.

UnsplashのMelissa Askewが撮影した写真 In this post, I will analyzie OECD Crop production data. Firstly, I downloaded data from "OECD (2023), Crop production (indicator). doi: 10.1787/49a4e677-en (Accessed on 01 July 2023)" I use R to analy…

OECD Adult education level data analysis 9 - regression analysis with panel data using R

UnsplashのMartin Sanchezが撮影した写真 www.crosshyou.info This post is following of the above post. In the previous post, I made a panel data frame, n = 37, T = 2, N = 74. In this post, I do regression analysis with the panel data frame. F…

OECD Adult education level data analysis 8 - Making panel data frame using R plm package

UnsplashのDaniel Pelaez Duqueが撮影した写真 www.crosshyou.info This post is following of the above post. In the previous post, I did regression analysis with time-series data. Unfortunately, I cannot make a statistically significant model.…

OECD Adult education level data analysis 7 - Time series analysis, serial correlation, cochrane-orcutt estimation using R

UnsplashのMattia Bericchiaが撮影した写真 www.crosshyou.info This post is following of the above post. In the previous post, I did time-series regression with differenced data and found these models are not valid. So, I will do with level d…

OECD Adult education level data analysis 6 - Time series regression analysis using R

UnsplashのJohannes Wが撮影した写真 www.crosshyou.info This post is following of the above post. In this post, I will do time series regression analysis. Before starting this, let's see which LOCATION has the most obserbations. USA has 41 o…

OECD Adult education level data analysis 5 - Cross sectional regression analysis using R

UnsplashのAditi Bhattが撮影した写真 www.crosshyou.info This post is following of the above post. In the previous post, I make some scatter plots to feel some sense of variables relationships. In this post, I will do corss sectional regress…