INTRODUCTION
For my case study, I decided to choose data set from the topic of Malaysia crime cases index in 2017, which was collected by Ministry of Home Affairs (Kementerian dalam Negeri). The data set I chose is not entirely about the crime cases but the number of residents of each states in Malaysia in 2017 was also being recorded along with it. For the specification of the target population, I decided to look into more detail on crime cases in Johor, Kelantan, Selangor and Terengganu by analyzing some data related to them. The purpose of my study is to study the relationship between states, type of crime cases, total number of crime cases and the number of residents by states. All tests I conducted was calculated using R Studio and the conclusion were based on the result.
Hypothesis testing 2-sample was conducted to determine if there is any difference in mean of total crime between Terengganu and Kelantan at the 0.05 significance level (95% CI). Other than that, correlation analysis was also conducted to measure the statistical relationship between number of residents and total number of crime cases in all four states. Besides that, I conduct a regression analysis to examine the linear relationship between the property crime cases and the total number of crime cases at 95% confidence interval. The last test that was being conducted to test whether a relationship exists between states and type of crime (stealing cars, stealing motorcycle) is Chi-Square test of independence with alpha = 0.05.
CONCLUSION
From this case study, after the result of my tests were obtained, I can conclude that even though data of the number of crime cases in one state may defer from the other, we cannot analyse it just from the number. By using hypothesis testing 2-sample, the result was that the mean of crime cases in Terengganu is not different than the mean of crime cases in Kelantan even though the total number of crime cases in Kelantan is greater than the number of crime cases in Terengganu. Other than that, by conducting correlation analysis, the result was that total number of crime cases increases as the number of resident in one state increases. In addition, it was conclude that the number of property crime cases highly affects the total number of crime cases in each states. Lastly, I can conclude that there exists relationship between the type of crime cases (stealing cars & stealing motorcycle) and states.
VIDEO
REPORT
REFLECTION
As I completed my Project 2, I gained some knowledge and new experience. One of them is doing a project on my own is not an easy task. If back in Project 1 we can divide our workload between three people, in Project 2 we need to do all tasks for the project on our own. Choosing a data set was also quite a difficult task for me since we need to consider the variable and information it provides, thus I took quite few days searching for data set here and there on the internet. Even though I decide on the data set of Malaysia crime cases index in 2017, I am still doubting my choice, but Alhamdulillah when I asked Dr. Suhaila about my chosen data set, she approved on it and I can proceed on completing the tasks of Project 2 using this data set.
In addition, when I started doing this project, I realized using R to calculate findings on hypothesis testing 2-sample, correlation, regression and chi-square test independence are so much easier than I thought it would be. Besides, my slide and video presentation was done at 4 in the morning because I need to find a time where the house is in a quiet state. It was raining outside and my voice was quite hoarse but i believe that i need to get it done no matter what. There was so many new challenging and exciting things I experience in completing this project.