Introduction
When I first came to class for statistics I knew it was going to be different from all the other math that I was used to. Fast forward to the end of this semester and now I know the reasons why this project was done the way it was and why knowing sample sizes, types of samples, type of data, as well as testing the data to see if it is correct is so important. The benefits from what I have learned in statistics will go far as I am going to school of Graphics and Multimedia Software. Being able to apply the skill of using graphing solutions such as box plots and histograms will be a better tool to apply logical numbers unless I have to make a pretty categorical graph such as pie chart.
It is probability statistic data analysis project. The project is about use secondary data from any organizations, websites or etc through the computer by using R program. So, this data was collected by Crisis Preparedness and Response Centre (CPRC), Ministry of Health (MOH) and Department of Statistics, Malaysia (DOSM). Basically, this data is about Covid-19 the status of COVID-19 outbreak in Malaysia. I choose this data analysis because to do some research about status of Covid-19 in our county. For this project I use a few concept like hypothesis testing, chi-square, regression and correlation.
Reflection
Hypothesis testing of a claim is something I do not think I will use often but having to be able to calculate within a 90-99% certainty that a piece claim is true will is an awesome skill. This skill is also useful to test claims from others that what they are saying is accurate. If my results are more critical using the same confidence level then I have evidence to dispute what somebody else states. This is something I needs and now understand why statistics is required for my studies.
The Chi-square test is intended to test how likely it is that an observed distribution is due to chance. It is also called a "goodness of fit" statistic, because it measures how well the observed distribution of data fits with the distribution that is expected if the variables are independent.
Next, I use regression analysis to describe the relationships between a set of independent variables and the dependent variable. Regression analysis produces a regression equation where the coefficients represent the relationship between each independent variable and the dependent variable. You can also use the equation to make predictions.
Correlation is a term that is a measure of the strength of a linear relationship between two quantitative variables. The most common measure of correlation is Pearson’s product-moment correlation, which is commonly referred to simply as the correlation, the correlation coefficient. The correlation coefficient r measures the strength and direction of a linear relationship.
Conclusion
In conclusion to my reflection I know the requirements for a good sample. I learnt many thing tips do to statistical and how to represents it using graphical. So, what I got from the project, it is not easy to do by myself and do more research at home because of MCO. But, whats can I do ? So, just obey the instructions because from this project I realize that we need to stop the virus of Covid-19 from spreading. So, what our county do is control their movement and isolate themselves. The above findings is in parallel with instructions Movement Control Order (MCO). MCO is instructions from government that everyone. This is to prevent the spread of the virus Covid-19. The more you obey the government, the more quick our country free from this virus.
Report :
-
Download Part 1.docx
Part 1.docx Details
- Sunday, 28 June 2020 [32.9KB]
Data Set :
-
Download DatasSet.xlsx
DatasSet.xlsx Details
- Sunday, 28 June 2020 [22.3KB]
Appendix :
-
Download Part 2.docx
Part 2.docx Details
- Sunday, 28 June 2020 [44.4KB]
Video :
Video has been submitted through e-learning .