Project Profile 2
Project 2 Report
Video Project 2
Reflection Project 2
Project 2 is a part of the assessment in course Probability and Statistical Data Analysis and must be done individually. Project 2 is about inferential statististic which we use secondary data from other sources to do the analysis. For me, it is quite hard to find a dataset and I have to do many research to get the suitable one. A week before eid, I still did not find any dataset to be used in my project and the due date for the proposal to submit is on first day of eidul fitri. Finally two days before eid, I got one dataset that I think I can do the analysis on it. Fortunately, the proposal is accepted by the lecturer and I can continue to do analysis on the data.
The topic of the dataset is perception of Malaysian household about GST. In 1st April 2015, government has replace the Sales and Services tax (SST) with Goods and Services Tax (GST). Eventhough GST has been replaced back to SST, I want to know what is the perception of Malaysian on that time about GST and does Malaysian think that GST cause the cost of living increase. The dataset contain 14 variables and there are 735 respondents across the country. The type of analysis used in my project are hypothesis testing single sample, correlation, regression and chi-square test and all the calculation is done by using RStudio. So far, I did not have any problem with RStudio since the lecturer already sent the tutorial for using RStudio.
For hypothesis testing single sample, I want to test if the mean of household income is less RM5000 and the result is reject null hypothesis. Household with income less than RM5000 is under B40 category and I think that this category may feel affected with the implementation of GST. Next, I do 4 correlation analysis which are relationship between household income and transport expenditure; household income and housing expenditure; household income and food expenditure; and household income and net income. The purpose of these analysis to know if there is linear relationship and what is the strength for each relationship, hence the result for all relationship is linear. For regression analysis, I want to test does household income affect the housing expenditure. The result obtained is the null hypothesis is rejected which means that household income affect the housing expenditure. Lastly, to know if there is relationship between income strata category and the rising cost of living, chi-square test is use and the result obtained is no relationship between the variables.
From this project, I have to make sure that my understanding about the topic in this course is really well. Unlike the previous project, my group members can help me to get the correct answer. I have to be independent in order to finish this project and follow the rubric to get more marks. Since there is no final exam for this course, I hope that I can get a high mark for this project and other assignments.
Reflection Project 1
For this subject, we are required to do a project on descriptive data by using primary data such as survey. Our group make some discussion on topic that suitable to ask to our target respondent and finally we agreed to investigate the usage of social media and time spent on it among undergraduate students. We choose this topic because it is really related with the students which most of the students will have social media apps in their smartphone. After we choose the topic, we start to point out some question to be ask and follow the assessment rubric of the project. We must include nominal, ordinal, interval and ratio data type on our survey. To make sure our survey is acceptable, we come to Dr. Chan Weng Howe's room to consult with him about our project. After do some correction, we start to distribute the survey among UTM students and finally we manage to receive 130 respondent in a week.
Since our group stay in same college block, it is easy for us to discuss the next step of the project. We analyse the data and put in our report. By using Google Docs, we can edit the report anytime without need to meet each other. It is easy for me to analyse the data because sometimes I cannot think well when do report together. For me, the challenge part is when we have to use R programming in our project to generate graph. It is quite difficult because we never use this programming and we have to learn by ourselves to understand how R programming work. But we manage to handle it after learn some tutorial from YouTube.
Our presentation supposed to be done in class but our lecturer told that we must do the presentation through video. I thought it will be easy for us to present since we are staying in same block. However, Prime Minister announced that MCO will begin from 18th April and students can go back to their home. Since me and my group members want to back home too, we decided to record our presentation before MCO started. Although one of our group member need to go back early, she manage to record the video and finish it.
Based on this project, the pie chart shows most of the respondents is first year student and we conclude that first year students have more free time to spend on social media. The result shows that majority of the respondents agreed that social media makes communication easier and effiecient in receiving news. From the bar chart, we can see that YouTube is the only app that all the respondents ever use compare to Tik Tok or Pinterest due to its function for their study. For the time spending on social media, mean for the data is 2.8962 hours while the standard deviation of data is 0.9793 hours. Based on the standard deviation, it can be concluded that the numbers are less spread out.
From this project, i have learn something new on how to do a survey and analyse the data by using R programming and implement what we have learn in class towards our project. I really hope that we will get excellent marks for this project since there will be no final exam for this subject due to MCO.