PROBABILITY & STATISTICAL DATA ANALYSIS - Project

For this project, my groupmate and I were tasked to collect and analyze a dataset. The analysis involves descriptive and inference statistics analysis. The topic we are working on is "Sleep and Its Relation with Academic Performance". The sample dataset is collected among the students from the School of Computing University Teknologi Malaysia. The dataset that we have collected involved is as below.

Variable

Type of Variable

Level of Measurement

Age

Quantitative

Nominal

Gendre

Qualitative

Nominal

CGPA

Quantitative

Interval

Nap Duration (During Weekdays)

Quantitative

Ratio

Total Sleep Time (During Weekdays)

Quantitative

Ratio

Nap Duration (During Weekends)

Quantitative

Ratio

Total Sleep Time (During Weekends)

Quantitative

Ratio

Sleep Satisfaction

Qualitative

Ordinal

Sleep Consistency

Qualitative

Ordinal

Caffeine Intake

Quantitative

Ratio

Stimulant Pill Intake

Quantitative

Ratio

Reason to Stay Up

Qualitative

Nominal

Total Study Time per Day

Quantitative

Ratio

Figure 1. Table of Data Collection

 

For the first part of the project (which involves descriptive analysis), we are looking at the pattern of the dataset. Referring to the data that was collected, it is clear that sleep and academic performance are related. In order to get a better grade in academics, we need to get enough sleep to make sure our brain can function to its maximum potential. From the data we collected, we can see that on average, students with good academic performance mostly have a total sleep time of 7.5, and most of them have a sleep time of 12.30 a.m. This total sleep time also includes the time taken for their nap throughout the day where they have 2.5 hours of nap time during the weekdays and the weekend. 

 

For the second part of the project (which involves inference analysis),  we are using a different dataset given by our lecturer. This dataset is also related to the students' academic performance. However, the variable involved is as below.

Name

Variable

Data type

Gender

Gender

Nominal

Race/Ethnic

Race

Nominal

Parental Level of Education

Parental Level of Education

Ordinal

Lunch Aid

Lunch Aid

Nominal

Test Preparation Course

Test Preparation Course

Nominal

Math Score

Math Score

Ratio

Reading Score

Reading Score

Ratio

Writing Score

Writing Score

Ratio

Table 2: Variables in the data

From the analysis, it can be said that financial stability will not guarantee the students' academic performance, there is a strong positive linear relationship between reading and writing which shows that the performance of a certain subject might influence the performance of other subjects, and test preparation course are able to help the students to perform in their math test.

From both parts of the project, I'm able to gain a better understanding of the subject. Especially, in terms of implementation. It teaches me how to analyze the pattern of a dataset, and how to prove a claim on certain things. I realize that we can't make a claim or statement without a proper study of the topics where it can lead to a wrong statement and can result in a negative impact if someone uses the wrong statement especially to make a decision.

Last but not least, I would like to deliver my thanks to my groupmate for doing this project with me. Also to be forgotten our lecturer, Dr. Aryati Bakri. Thank you so much for the knowledge, and guidance you have given to us.