Oon Yee Sem's Portfolio Semester 2

SECI 2143- 02 PROBABILITY AND STATISTICAL DATA ANALYSIS

Project Video

REFLECTION RSS

Reflection

  Hello, my name is Oon Yee Sem, I am a student from section 2, taught and guided by my dedicated and respectful lecture, Dr Nor Azizah Ali. Via this course, it has successfully broadened my view about the knowledge of statistical data analysis and improve my programming skills by using software which is R programming language. This course is not only teaching us the knowledge of statistical data analysis but also focusing on practical by providing us assignments, quiz, tests and project to elevate our skills and also examine our understanding.   

 In the PSDA project 1, it examinates the students’ skills of data collecting, visualising and analysing the collected data by using R programming. The first technique that I have learnt from this project is to use the proper ways to set question, collect and filter the different types of data. To illustrate, there are four types of data which are nominal, ordinal, interval and ratio and every type of data we need to use proper way to design the question in order to get the respective types of data correctly. Besides that, after collecting the data, I have to learn to use R programming language to visualise the date from complicated raw data into different types of graphs such as histogram, dot plot, bar chart and so on. For me, the R programming skill is the biggest for me in this project and I strongly believe that this skill is imperative for me to elevate my potential and competitiveness in the industry in the next future. The hardship that I encountered in this project is setting the questionnaire based on different data types and learning to use R programming language to visualise the data. Fortunately, because of the high cooperation of my teammates and the dedication of my Dr by guiding and checking our questions and data types in our project proposal and step-by-step teaching us to use R programming to draw graph. Finally, my teammates and I have successfully gained lots of knowledge from this project and completed this project satisfactorily.     

  In the PSDA project 2, it examinates the skills of students by using R programming and hypothesis testing in performing advanced data analysis. In fact, this is the first time for me to conduct a data analysis on a big dataset with 1000 entries. Of course, in the journey of conducting this project was definitely not easy and my teammates and I have encountered lots of challenges in completing this project, but for me it is all worth it. Via this PSDA project 2, I have learnt the importance of hypothesis testing and how to apply this knowledge in the real-life situation. The study that our team conducted in this project was the factors that affect the income of Korean. Through conducting this study, I have learnt a lot about the income of citizens. To illustrate, there are not only education, company, gender, occupation can affect the income of a person but the number of family members, marriage status, born-year and living area also can affect too. Besides that, I have also learnt there are lots of unforeseen factors that the income of a person for instance inflation rate. economical conditions, sectors that workers involved, and Gross Domestic Product (GDP) of a country. The most favourite and interested knowledge that I have learnt in this project were utilising correlation and linear regression techniques in performing data analysis. Correlation is a statistical method to measure the strength of linear relationship between two variables. By using this technique, it is importance for us to know the relationship of the studied variables whether it is strong, weak, positive, negative, no relationship or is a curvilinear relationship and this information and relationship are imperative for us to discover new insights and reveal interdependencies of this connection. Furthermore, regression analysis is a statistical method to assist us in predicting the effects of independent variable on dependent one and it also essential for us to know which factors matter most, which factors can be ignored and how these factors can interact which each other. After I did some studies about linear regression, I found that regression can also be used as a simple machine model to use to predict value! Last but not least, via this project, I have learnt a lot the knowledge of doing data analysis such as hypothesis two-sample test, Chi-square test, correlation as well as regression test. Besides that, I also had learnt and understood the details of the factors that affect the income of people and I strongly believe that this knowledge will be very helpful in my next future.

All good things must come to an end, before end my reflection, please allow me to thank Dr Azizah for your guidance and dedication lectures in this semester. Through this course, I have learned a lot from you. I really enjoy your class and it brings lots of happiness and challenges. Dr your dedication I really appreciate it.

Thank you, Dr! May you live in a happy life and everything goes well in the future. 

Details