Project 2 is a part of the assessment in course Probability and Statistical Data Analysis and must be done individually. Project 2 is about inferential statististic which we use secondary data from other sources to do the analysis. For me, it is quite hard to find a dataset and I have to do many research to get the suitable one. A week before eid, I still did not find any dataset to be used in my project and the due date for the proposal to submit is on first day of eidul fitri. Finally two days before eid, I got one dataset that I think I can do the analysis on it. Fortunately, the proposal is accepted by the lecturer and I can continue to do analysis on the data.
The topic of the dataset is perception of Malaysian household about GST. In 1st April 2015, government has replace the Sales and Services tax (SST) with Goods and Services Tax (GST). Eventhough GST has been replaced back to SST, I want to know what is the perception of Malaysian on that time about GST and does Malaysian think that GST cause the cost of living increase. The dataset contain 14 variables and there are 735 respondents across the country. The type of analysis used in my project are hypothesis testing single sample, correlation, regression and chi-square test and all the calculation is done by using RStudio. So far, I did not have any problem with RStudio since the lecturer already sent the tutorial for using RStudio.
For hypothesis testing single sample, I want to test if the mean of household income is less RM5000 and the result is reject null hypothesis. Household with income less than RM5000 is under B40 category and I think that this category may feel affected with the implementation of GST. Next, I do 4 correlation analysis which are relationship between household income and transport expenditure; household income and housing expenditure; household income and food expenditure; and household income and net income. The purpose of these analysis to know if there is linear relationship and what is the strength for each relationship, hence the result for all relationship is linear. For regression analysis, I want to test does household income affect the housing expenditure. The result obtained is the null hypothesis is rejected which means that household income affect the housing expenditure. Lastly, to know if there is relationship between income strata category and the rising cost of living, chi-square test is use and the result obtained is no relationship between the variables.
From this project, I have to make sure that my understanding about the topic in this course is really well. Unlike the previous project, my group members can help me to get the correct answer. I have to be independent in order to finish this project and follow the rubric to get more marks. Since there is no final exam for this course, I hope that I can get a high mark for this project and other assignments.