SECI2143 - 02 Probability & Statistical Data Analysis

Lecturer: Dr. Chan Weng Howe

The purpose of this course is to introduce students with some statistical techniques as tools to analyze data. In the beginning, students will be exposed to various forms of data which come from different sources, either daily or from industrial activities. There are some methods of parameter estimation from different distributions that need to be explored. Besides, data analysis is conducted by introducing hypothesis testing. At the end of the course, students will be able to apply some statistical models in analyzing data using available software.

Image

Chapters

  1.  Introduction to Statistics
  2. Data Description
  3. Descriptive Statistics
  4. Probability, Random Variables and Probability Distributions
  5. Hypothesis Testing
  6. Chi-Square Test and Contingency Analysis
  7. Correlation and Regression
  8. Analysis of Variance (ANOVA)

Lab

Introduction to Statistical Tools (RStudio)

Project 2

Reflection Project 2

Since the second half of the semester, the topics that we learnt in the class are mainly about inferential statistics. Inferential statistics include topics such as hypothesis testing, chi-square test and contingency analysis, correlation, regression and analysis of variance (ANOVA). Based on those topics, we do many testing for the claims by using different kinds of methods in different kinds of situations and get to apply it into this project 2. 

According to the instructions and rubric given, there are many steps that we need to apply to complete this project 2. Firstly, we need to choose or find the data set either from what is given by our lecturer or finding it through the internet. After discussing with the team members via Whatsapps group, we choose the dataset given from our lecturer which is about Student Performance. According to the dataset, there are 1000 samples that we need to analyse. The analysis includes hypothesis testing, correlation and regression which are compulsory, and also goodness of fit test, chi square test of independence and ANOVA which are optional. My group chose the chi square test of independence among all three optional analyses provided.  

After making decisions, we divide the task to each member equally and I am in charge of doing the analysis for the correlation. Although my lecturer only gave a brief explanation for RStudio, findings through the internet for formulas are very helpful. RStudio really helps me a lot in doing the analysis as it is very time consuming and also much more accurate. It is difficult for me to do the analysis with 1000 numbers of samples, thus this tool is very convenient. 

Lastly, I am very grateful for getting to complete this project with my team members as they are  very responsible with their own parts. We also get to complete the project earlier before the due date to avoid last minute work. Lastly, I hope this project 2 and also other assessments from this subject could help me a lot in doing much more complex work in the future.

Details

Project 2 Presentation