PSDA Reflection
REFLECTION PROJECT 2 IN SEMESTER 2
Name: Aliya Zarena Binti Zainulanuar
Matric ID: A21EC0013
Lecturer’s Name: Dr. Nor Azizah Binti Ali
In early June, we were assigned Project 2 where we needed to make an inference statistical analysis based on a selected dataset. Here, we have the option to select our dataset, which we have obtained from the Kaggle website and is titled "Heart Attack Analysis & Prediction". First and foremost, we must first attend the online classes through the month of June, where we study the principles of how to use each of these test methods before we can acquire the technique for analysing these statistics. I personally feel overwhelmed and interested to learn how to analyse the datasets in a variety of statistical testing types by discovering the relationship between any two variables and clarifying the hypotheses. Not to mention, since statistical analysis skills are necessary to conduct a successful analysis such as data collection strategies skills and data mining techniques skills, I am also able to put my understanding of how to utilise statistical analysis software, such as R programming and Excel, to practical use. These softwares are very useful since they can handle the pre-processing step, carry out calculations more quickly, and produce any style of the graph that is intended.
Next, throughout this project, I was assigned by the group to perform the correlation test and make the conclusion for the whole inference statistical analysis report. Here, in analysing the correlation test, I'm having a few problems in order to get a better correlation coefficient where at first I used the other two variables but I get no correlation in the relation between the two of them. Then, I faced another problem where I couldn't access the whole dataset in the R programming software which made it difficult to form a scatter plot based on the whole sample. Here, I try to have a discussion with my teammates where they help in discovering the solutions where Khuzairie, who is one of my teammates, is able to discover that there is a correlation between the age of patients and the resting blood pressure of patients recorded. Moreover, as handling a large raw dataset is difficult for us humans, we are also able to discover that by assigning the new variable to each selected variable in order to make the dataset into an understandable dataset. As a result, through this project, I am able to learn more about how to utilise statistical software and, at the same time, develop effective communication with my teammates, as without it, the project wouldn't be completed successfully.
To summarize, I wanted to thank Dr. Nor Azizah Binti Ali, my Probability Statistical Data Analysis lecturer, and Universiti Teknologi Malaysia for providing me with a great education that has helped me enhance my statistical skills. My lecturer is one of the most approachable people I have ever met since she doesn’t hesitate to explain things to us whenever we are in confusion until the directions are clear and we are able to generate good analysis in this project. Surely, all of the knowledge I have gained in this course throughout semester 2 will be remembered and applied in the following semester including in the future, which I believe will be more challenging than the current one. I would like to take it as a challenge for myself because I believe in order to be a successful data analyst person, I need to go through all the hardships that will act as a barricade that blocks my way in order to achieve my goals and dreams.