In Project 2, my team members (Chih Zhen En and Yong Kok Siong) and I need to conduct a statistical inference analysis. We need to use a total of 4 tests to perform the analysis. Each test will perform a different function and analysis. Those tests are
- 2 Sample Hypothesis Testing
- Correlation
- Regression
- Chi-square Test
First of all, we need to find a suitable dataset from the website. It's hard for us to find a suitable dataset because the dataset must fulfill the requirement of those tests. Nevertheless, we still managed to reach a consensus and choose a suitable dataset about education. It's from a trustable official website from The Department of Statistics Malaysia. Next, to do the test, we are required to apply the knowledge that we learned from the class in R studio, which is an integrated development environment for R, a programming language for statistical computing and graphics. It's used to avoid human error as all the calculations will be calculated through the program. It's an interesting experience and skill to have. It helps me to get to know more about how analyses and graphs are conducted and created. After all the calculations, we get to know the result of the aim of this analysis.
- There is no difference between the mean number of male students and female students in secondary schools in Malaysia.
- There is a strong positive relationship between the number of schools and the number of students.
- The number of male or female students is dependent on the area.