Data_Analysis
UnsplashのAlejandro Contrerasが撮影した写真 www.crosshyou.info This post is following of the above post. Let's check which LOCATION has the most observations. I see GBR has the most observations, 66 observations. USA has the 2nd most, FIN …
UnsplashのMatteo Vellaが撮影した写真 www.crosshyou.info This post is following of the above post.In the above post, I did simple linear regression analysis. This time, I will for multiple linear regression (MLR). I make a new variable, fac…
UnsplashのLaura Smetsersが撮影した写真 www.crosshyou.info This post is following of the above post. I will do regression analysis. Fist, I will do simple linear regression analysis. I use SELFEMPLOYEMENT as a dependent variable and EMPLOYE…
UnsplashのRenato Pozziが撮影した写真 www.crosshyou.info This post is following of the above post. I make small data frame that contains only 2010 and 2014 observations. First, I use filter() function to get only 2010 and 2014 data, then I …
UnsplashのMaria Tejadaが撮影した写真 www.crosshyou.info This post is following of the above post.I will calculate confidence interval. First, I make bootstrapping confidence interval. I use R infer package. I use specify(), generte() and c…
UnsplashのJames Wainscoatが撮影した写真 www.crosshyou.info This post is following of the above post. In the previous post I see mean of EMPLOYEE is 17.9 and mean of SELFEMPLOYED is 30.6. Let's see whether the difference is statistically si…
UnsplashのKumiko SHIMIZUが撮影した写真 Hello. In this post, I will analyze Gender wage gap of OECD data with R. First, I download CSV file like belo from OECD web site, https://data.oecd.org/earnwage/gender-wage-gap.htm I use R to analyze …
UnsplashのCraig Mannersが撮影した写真 www.crosshyou.info This post is following of the above post. In this post, I will do hypothesis test for one proportion. For Japan nuclear power plants proportion. In the previous post, I found Japan n…
UnsplashのMaarten van den Heuvelが撮影した写真 www.crosshyou.info This post is following of the above post. In this post, I will get confidence interval for one proportion. In this case, number of nuclear power plants in Japan / number of …
UnsplashのYan Agritが撮影した写真 www.crosshyou.info This post is following of the above post. In this post I do hypothesis testing using R with infer package. I refere to B Inference Examples | Statistical Inference via Data Science (mode…
UnsplashのEean Chenが撮影した写真 www.crosshyou.info This post is following of the above post.I will calculate confidence interval in this post. There are two ways to calclulate confidence interval, one is bootstrap method and the other is…
UnsplashのLukáš Lehotskýが撮影した写真 In this post, I will playaround with OECD Nuclear power plants data with R. OECD Nuclear power plants data is defined as the number of nuclear units in operation as of 1 January 2019. It is measured a…
UnsplashのSonika Agarwalが撮影した写真 www.crosshyou.info This post is following of the above post. In this post, I will do bootstrapping with R infer package. Suppoese df2$priv_pc_gdp is population. So true mean of priv_pc_gdp is The true…
UnsplashのArda Demirkaynakが撮影した写真 www.crosshyou.info This post is following of above post. In the previous post, I made some visualizations with R ggplot2 package. In this post. In this post I will calculate confidence intervals. Fi…
UnsplashのAlicia Steelsが撮影した写真 www.crosshyou.info This post is following of above post.In the previous post, I made a dataframe for data analysis, named 'df2'.Now, let's start data analysis with data visualization.I will make 5 Name…
UnsplashのMilos Prelevicが撮影した写真 www.crosshyou.info This post is following of the above post. In the previous post, I load OECD Social spending data into R. I also load country ISO code and continent name data like below CSV file. I …
UnsplashのAlexander Schimmeckが撮影した写真 In this post I will analyze OECD Social spending data using R. OECD (2022), Social spending (indicator). doi: 10.1787/7497563b-en (Accessed on 26 November 2022) This indicator is measured as a pe…
UnsplashのEvi T.が撮影した写真 www.crosshyou.info This post is following of the above post.In the previous post, I did sinple linear regression, it menas there is only one explanatory vatiable. In this post I will do multiple linear regres…
UnsplashのMadara Parmaが撮影した写真 www.crosshyou.info This post is following of the above post.In this post, I will do linear regression analysis. To do this, I make a small(subset) data frame. Let's check what TIME has the most observat…
UnsplashのKarsten Würthが撮影した写真 www.crosshyou.info This post is following of the above post. In this post, let's sort dataframe by variables. The smallest TOT_1000EMPLOTED observation is CHL 2009. The largest TOT_1000EMPLOYED observa…
UnsplashのPhong Nguyenが撮影した写真 www.crosshyou.info This post is following of the above post.In this post I will create 5 names graphs in R. I refer to Chapter 2 Data Visualization | Statistical Inference via Data Science (moderndive.c…
UnsplashのSakuraが撮影した写真 www.crosshyou.info This post is floowing of the above post. Let's explore gdp dataframe. gdp dataframe has more LOCATION than researcher dataframe. gdp dataframe INDICATOR has only one calue, GDP. So I can re…
UnsplashのMarek Piwnickiが撮影した写真 In this post, I will analyze OECD Researchers data. Researchers are professionals engaged in the conception or creation of new knowledge, products, processes, methos and systems, as well as in the man…
UnsplashのWolfgang Hasselmannが撮影した写真 www.crosshyou.info This post is following of the above post.In this post, I will do hierarchical clustering using R. It is very easi with R. Firstly, I make a matrix for hierarchical clustering. …
Unsplashのmartin bennieが撮影した写真 www.crosshyou.info This post is following of above post. Let's calculate difference between Y2016 and Y2015 Let's see a histogram of d2016 Then, let's calculate difference between Y2017 and Y2016 Let's…
UnsplashのHeather Wildeが撮影した写真 www.crosshyou.info This post is following of the above post. In this post, I will show some statistics of our data. Before investigation, I make data frame to wide format with pivot_wider() function. W…
UnsplashのJ Cruikshankが撮影した写真 www.crosshyou.info This post is following of the above post. In the previous post, I load CSV file data into R. Then, let's make some basic graphs using ggplot2 package. Scatter plot ggplot() + geom_poi…
UnsplashのJeremy Thomasが撮影した写真 In this post, I will use R for analysis about OECD Non-Financial Corporations Debt to Surplus Ratio. This ratio is debt outstanding / annual flow if gross operating surplus. So, the higher the ratio, t…
Photo by S. Tsuchiya on Unsplash www.crosshyou.info This post is following above post. In the previous post, I did multiple regression, s_ni_kg ~ s_po_kg + s_ni_to. Let's add 'time' variables. All time variables are not statistically signi…
Photo by Harry Gillen on Unsplash www.crosshyou.info This post is following of the above post. In the previous post, I made scaled variables in df4, let's see correlation matrix of those variables. The most highly correlated variable pair …