Report Activity No.2 Probability and Statistical Data Analysis

Data Analysis of the Number of Vaccinated People in Malaysia

Manipulating and Organizing Datasets

In our project we chose a dataset that contains a record of all vaccinated people in Malaysia ever since the vaccination process started. For doing data analysis, we just take a sample of the whole population because it is easier to manage. In our project, we only picked the vaccination record from January 2022 until May 2022 to make it easier for us to test the data.

Knowing the Different Types of Test Statistics

In this course we learned different types of tests like, Hypothesis testing, Correlation, Regression, ANOVA, and many other tests. In our project, we managed to imply these tests in a real-life example where we compared the number of vaccinated people in each month and whether they took their first dose of their second dose.

R Programming

To make this project more beneficial, we had to use R programming to do the calculations needed. What is so interesting, in my opinion, is that there are thousands of built-in functions that allow us to do the calculations just by knowing the name of the function and what does it exactly do.

The Outcome of the Project

After finishing our report and tested the data given, we concluded that people started to accept the fact that vaccination will enhance their immune system and it is compulsory for everyone to take it.

 

Reflection on the Project

The project gave me an insight how to do hypothesis testing in various ways. It also taught me the principles of teamwork, cooperation as well as time management. The purpose of data analytics is to organize the data so that we can make predictions and  anticipate what is going to happen in the future. With these information, we can late come up with solutions for the issues of our daily life.