INTRO
We know that nowadays, cars are important assets in our lives because we drive cars to school, work, buy groceries, etc. It is important to have our own car so that we can travel easily anywhere, anytime we want. However, a good car with good specifications is also very important as we wouldn't want to waste more money on maintaining and servicing our cars. For this project, we have to find a data set which is from secondary source, hence I have retrieved my data from the Kaggle website. I chose this dataset because I like cars and I am very interested to explore more about the car specifications. The main purpose of my study is to apply and perform statistical test analysis such as 2 sample hypothesis testing, correlation analysis, regression analysis, goodness of fit test and chi-square test of independence from a secondary data source. I hope to prove whether the selected variables for my test analysis are dependent on each other.
PROJECT 2
-
Download Project 2 Submission.zip
Project 2 Submission.zip Details
- Friday, 26 June 2020 [1MB] -
Download Project 2 Slide.pptx
Project 2 Slide.pptx Details
- Friday, 26 June 2020 [1.9MB]
CONCLUSION
In short, it was a great experience to perform test analysis using R Studio. Compared to the previous project, of course the workload is greater as this is an individual project whereas project 1 was a group project, but I can do my research, findings and analysis freely. Also, Dr Chan was always ready to help us throughout this project. I believed that this project will be very useful for my future in the computer science field. I hope that I have more projects like this in the future. With this I end my reflection on PSDA Project 2, thank you!
*REFLECTION ON PROJECT 2*
About my data set, it was a secondary data source that I retrieved from the Kaggle website. The data was collected by an analytic from the US. After being satisfied with the data set that I retrieved, I immediately consulted Dr Chan for approval as I was not sure about the test analysis that I want to carry out, and proceeded with a proposal of my project.
After getting approval from our lecturer Dr Chan, I proceeded with my data analysis process. For data analysis process, I used R Studio and during the process, I learnt how to produce box plots, linear regression model, scatter plot from the selected variables. Since we have already used R Studio before for our first project, this time around I can do my analysis more effectively.
I also did slide presentation in PowerPoint format to make my video presentation more interesting and easy for the audience to understand my findings from the research. For my video presentation, I presented on the five analysis carried out which were 2 sample hypothesis testing, correlation analysis, regression analysis, goodness of fit test and chi-square test of independence. Throughout the slides, I provided info graphics such as scatter plots, table, box plots so that my presentation is easier to understand.
Throughout the project, I found out some interesting statements from my analysis. I am surprised to actually found out that our conclusion from the analysis is related to our real world. For example, when carrying out the 2 sample hypothesis testing, I found out that the mean horsepower of turbo-aspirated cars are higher than the mean horsepower of std-aspirated cars. In real world, this statement is actually true because turbo-aspirated cars do have higher power for enhanced driving performance. Hence, besides exploring R Studio, being able to discover these real world facts are fun and interesting.
Also, I would like to thank Dr. Chan for his guidance throughout Project 2. He is always following up on our progress via WhatsApp even though we cannot have face-to-face classes at the moment. When we encountered problems throughout our project, he is always available to answer out questions patiently. In short, I am happy to have the opportunity to do this project.
---PROJECT 2 END---