SECI2143 - 07 Probability & Statistical Data Analysis

Assignment 1

Assignment 2

Assignment 3

Assignment 4

Project 1

Project 2

Project 2 Reflection

By completing this project, I have mastered the ways to use RStudio to carry out inference statistical analysis such as hypothesis testing, correlation analysis, Chi-Square test of independence, and regression analysis. We can calculate the results of all tests more easily by using RStudio. This is due to the fact that the sample size of our group's dataset is 500, it is quite large, and it makes each calculation very inefficient and time-consuming. We can easily compute the data without having to perform complex calculations by using RStudio.

At first, we choose to do a study on life expectancy, but we found that some of the data had issues with accuracy and that some of the data were incomplete during the analysis. After that, our group choose to do a study on the features that increase the chances of having cardiovascular disease. To increase the accuracy of inference statistical analysis, We use 500 observations out of 70000 observations with people who have cardiovascular disease. I applied what I learned in this course to complete this project. I am able to comprehend how tests are carried out, state the null hypothesis do test calculations, and choose whether to reject the null hypothesis.

Throughout this group project, I also learn how to communicate better with group members. Each of us attends meetings that conducting in Google Meet actively participates in group discussions and contributes equally to this project. As we must finish the report as a team, this group project also helped me to improve my teamwork skills. I also appreciate the help I've received from my teammates when I've had difficulties with this project. I thank would like to thank Dr Nor Azizah Ali as she always answers our questions for our assignments and projects so we can complete our tasks on time.