YEW RUI XIANG's Reflection /
SECI2143--PROJECT 2

What is Data Analysis? Types, Methods and Techniques [2022 Edition] |  Simplilearn

After completing PSDA project 2, I have improved both my soft skills and hard skills. In this project, I am required to cooperate with other groupmates to make our project complete successfully. The work we do the first was searching for the secondary data from various internet resources and choosing one that fits our case study. We finally found a dataset that fits our study from Kaggle online website with our determination. The dataset is about the insurance charges based on different variables such as ages, gender and so on. The purpose of our case study is to investigate whether there is a relationship between the variables ages, sex (gender), BMI, smoker and region with insurance charges. After that, we divide our tasks fairly among us to complete our project source code, report and video. During the process, we discussed and helped one another to check each other’s code.

For this project, we are required to perform the data analysis by using R programming languages. This project pushes us to self-learning R-coding to complete the inferential statistic. Through this, I gain more knowledge about the ways to conduct the hypothesis testing and interpret the result. Apart from that, I also learned the way to do well in time management to complete the project within the given time.

As a Data Engineering student, I appreciate anything that learns in this course, especially in this project as I am exposed to data analysis knowledge such as inferential statistics which may be very useful in my future either in conducting research or enhancing job opportunity. Lastly, I am here to express my gratitude to my lecturer, Dr. Nor Azizah Ali for her guidance which makes us able to successfully implement the code and complete the project on time.