PSDA

PROJECT 1 SHORTPAPER

SCHOOL OF COMPUTING

FACULTY OF ENGINEERING

UNIVERSITI TEKNOLOGI MALAYSIA

83300 UTM JOHOR BAHRU, JOHOR

--------------------------------------------------------------------------------------------------

2nd Semester Session 2018/2019

 

SCSI2143-O9

 Probability & Statistical Data Analysis

Project 1 – Descriptive Statistic

Non-academic activities in university

--------------------------------------------------------------------------------------------------

LECTURER : DR NOR ERNE NAZIRA BINTI BAZIN

PREPARED BY :

                 1)MUHAMMAD ALIEMEEN FARHAN BIN RAMZEE          A18CS0123

                 2)MUHAMMAD AIMAN HAKIM BIN AHMAD RIDZO        A18CS0121

                 3)MUHAMMAD FIRDAUS BIN NOR AZMAN                       A18CS0140

                 4)DANIEL BIN ARIPIN                                                              B18CS0004

 

 

 

 

ABSTRACT

We were given a task to conduct a survey on a certain topics within UTM. Our chosen topic is want to focus how students react to non-academic activities in university. In addition, we will be using R studio software to support us with creating the charts and support our explanation to this survey.

 

  1. INTRODUCTION

We provide some of questions that related to our topic which is “Non-academic activities in university”. Firstly, we want to know the respondents’ personal information which is gender, years of study and previous CGPA result. Then, we continued with our topic with what their  favourite non-academic activities. From that we can know what type of activities that student like the most. After that, we ask respondents how much they satisfied with their favourite activity. So, we know if they really like the activity or just for fun. We also ask what non-academic activities that they joined last semester. Majority chose what the activities that they really like and their favourite activity. Then, we ask how they rate the activities that they have joined. If not quite good, the organizer of such activities can study from the past and do better in future. We also ask how many that they have joined non-academic activities for last semester. From that we know that students nowadays still love these types of activities. Also, the most important things if want to join these activities is about the fees. Majority students spends more money in average to join these activities. The last two questions we ask their opinions and views about these activities in university. Majority students agreed with these two statements. So, our objectives for this survey are we want to know if students love this kind of activities or not. Second, we want to know what is main cause if these activities is no longer in demand. And the last one, we want to show that this kind of activities can give benefits to organization, religion and country.

 

 

  1. METHODOLOGY

We used to conduct this topic (Non- academic activities in university) with online survey. It’s the easiest way because respondents just need to click the link provided and answer all the questionnaires. First, we discuss how to provide question by follow all the rubric of the project. So, we decide each member provide 4 questions with different data type which are nominal, ordinal, interval and ratio. So, we can get at most 4 questions each data type. Once we have done with the question, we created the survey with ‘Google Form’. This Google Form  shared to all students in UTM to give their honest opinions regarding to the non-academic activities.

 

 

  • RESULTS AND DISCUSSION

 

Q1(Gender)->NOMINAL

We got the percentage of the Male are 51.02% while Female are 48.98% from 49 respondents that we have.

GAMBAR SOALAN 1.png

 

Q2(Years of study)->RATIO

For the 1st year students, we got 73.48% from 49 respondents. Then, for 2nd and 4th year students we got the same percentage which are 8.16% while 10.20% for the 3rd year students.

GAMBAR SOALAN 2.png

 

 

Q3(Your previous CGPA result )->RATIO

A)Hist

GAMBAR SOALAN 3a.png

 

B)Cumsum

GAMBAR SOALAN 3b.png

Majority our respondent got CGPA result 2.0 and above. Mean from this data is 3.30. Mode is 4.00 while median is 3.28.

 

Q4(What is your favourite type of non-academic  activities)->NOMINAL

GAMBAR SOALAN 4.png

 

The more favourite activities is sports with percentage 44.90%. Followed by community activities with 18.37%, cultural activities with 16.34%, academic services with 10.20% and others with 8.16%.

 

 

Q5(How satisfied are you with non academic activities that you participate in university)->ORDINAL/INTERVAL

GAMBAR SOALAN 5.png

 

The most answered by the respondents is choice no 4 with 46.94% followed by choice no 3 with 20.41%. Then, choice no 5 with 18.37% followed by answer no 2 with 10.20% and choice no 1 with 4.08%.

 

 

Q6(How many ratings do you give to activities that you join in all aspects above?)->RATIO

GAMBAR SOALAN 6.png

No respondent gave answer 1. The most answer is no 4 with 48.98%. Then,  5 with 26.53%  followed by no 3 with  20.41% and no 2 with 4.08%.

 

Q7(How many non-academic activities have you participated in UTM last semester?)->INTERVAL/ORDINAL

GAMBAR SOALAN 7.png

Our respondents only 8.16% that join activities more than 15 times last semester. And around 57.14% respondent join less than 5 times. And the others that joined 5 -10 times is 24.49% and that joined 11-15 times is 10.20%.

 

 

Q8(How much average you spent the fees for the activity that you participate(in RM) ?)->RATIO

GAMBAR SOALAN 8.png

The mean of this data is RM58. The mode is RM100 and median is RM30.

 

 

Q9(Students must always join the non-academic activities.)->ORDINAL/INTERVAL

GAMBAR SOALAN 9.png

Almost 65.31% respondents agree with this statement and around 6.12% does not agree with this statement. Only 28.57% feel neither with this statement.

 

Q10(These types of activities also important for students after academical studies.)->ORDINAL/INTERVAL

GAMBAR SOALAN 10.png

Almost 83.67% respondents agree with this statement and around 4.08% does not agree with this statement. Only 12.25% feel neither with this statement.

 

 

 

 

 

 

REFERENCES

  • Lectures notes Probability Statistical Data and Analysis Chapter (1-3)
  • Azizah Ali

Project PSDA 2: Study of TYPE OF DRUGS THAT DRUG ADDICT USE

Project PSDA 2:
Study of TYPE OF DRUGS THAT DRUG ADDICT USE

 

 

 

MUHAMMAD FIRDAUS BIN NOR AZMAN                     A18CS0140

MUHAMMAD AIMAN HAKIM BIN AHMAD RIDZO        A18CS0121

MUHAMMAD ALIEMEEN FARHAN BIN RAMZEE          A18CS0123

DANIEL BIN ARIPIN                                               B18CS0004

 

Introduction

 

‘DRUGS’ based on dictionary state “a medicine or other substance which has a physiological effect when ingested or otherwise introduced into the body.”. Drugs abuse is occurring when people take over dosage of drugs because they addicted to have fun or to strengthen their body. Drug abuse is dangerous because it can cause death Somehow drug abuse is happening among people in Malaysia. Because of the concern .We use the data about “STATISTIK JENIS DADAH YANG DIGUNA, 2010-2016” from National Anti-Drugs Agency to show the type of drug that there use in harmful way.

 

 1.Hypothesis testing 2-sample

Year

Opiate

Methamphetamic

2010

11664

4026

2011

9629

7034

2012

8472

4761

2013

16035

2901

2014

14496

4117

2015

16616

8133

2016

16985

10107

 

Based on the table above. The table shows the population and type of drug(opiate and methampehtamic) that happened in Malaysia for 7 years. Use significance level, α = 0.05,  to support a claim that the mean coefficient of type of drugs used between opiate and methamphetamic

 

Test Statistical

 

H0: μ1 = μ2

H1: μ1 > μ2

Picture1.jpg

Based on the result shown, the value of test statistics is 4.583851 while the critical value, t = -2.200985

INTERPRETATION OF RESULT

d.f. = 11.12528 ~ 11

Test Statistic > Critical Value

  |4.583851| >  |-2.201|

 

 

 

CONCLUSION:

 

  • Since 4.583851 > 2.201, we Reject H0 at α = 0.05. There is insufficient evidence that there are difference in mean drug used between opiate and methamphetamic.

 

 

2. CORRELATION

Year

Population

Opiate

2010

23642

11664

2011

19531

9629

2012

15101

8472

2013

21361

16035

2014

22355

14496

2015

27479

16616

2016

31764

16985

 

  • Based on the table above. The table shows the population and type of drug(opiate) that happened in Malaysia for 7 years. Can we conclude from these data that a relationship exist between population and type of drug used? Use significance level, α = 0.01.

 

Picture2.jpg

  • Linear equation

Picture3.jpg

                               Ŷ = 1332.4872 +  0.5245x

 

CONCLUSION

 

  • So, we can conclude that the correlation coefficient, r = 0.81. Thus, this indicates that there is correlation between population and type of drug. However, this is a strong positive relationship because r falls within 0.8 to 1.

 

3. REGRESSION

 

Year

Population

Weed

2010

23642

3011

2011

19531

2026

2012

15101

1472

2013

21361

1885

2014

22355

1919

2015

27479

1389

2016

31764

1236

 

  • The following data about population and type of drug(weed) used in the Malaysia, 2010–2016.Construct a scatter plot. Does the relationship between the population and type of drug used to be linear?

 

The SCATTER PLOT

Picture4.png

  • Linear equation

Picture5.png

Ŷ = 2431.58165927 -0.02532404x

 

CONCLUSION

  • It can be seen in the graph that the total number about population increase as total number of drug(weed) is decrease. A scatter plot and regression analysis of this data indicates there is sufficient evidence that there is a relationship between the population and the weed

 

4. Chi-Squared Test

 

  • A study was conducted on the used of drug in Malaysia. Test the claim that the used of drugs with equal proportion for 7 years, from 2010 to 2016 at 0.05 significance level.

 

Chi-Square Test of Independence

  • We would like to check whether there is a relationship or not between different type of drug may effect taking of different drug(candu,opiat) through year from 2010 -2016 with significance level of ⍺ = 0.05.
  •  
  • So we can propose a hypothesis.
  • H0 = Variables are independence
  • H1 = Variables are not independence / There is relationship between variables.

Picture6.png

 

CONCLUSION

 By using chi-square test of independence we find that x^2 = 14 With the significance level of ⍺ = 0.05 and Df = 12, x^2 = 21.03 Since the test value do not fall on the critical region, we fail to reject H0. Thus, we can conclude that there is no relationship between  candu and opiat drug variable through the year from  2010 – 2016.

 

REFLECTION ON PROJECT 1& PROJECT 2

data set is a collection of numbers or values that relate to a particular subject. For example, the test scores of each student in a particular class is a data set. The number of fish eaten by each dolphin at an aquarium is a data set.

 

data_sets_1.jpg

 

Data sets can be written as a group of numbers in random order, in a table form or with curly brackets surrounding them. The data sets are usually labeled so you know what the data represents, but when working with data sets, what the data stands for is not always provided and is not always needed to finish the problem.

Our group project for this semester is project 1 and project 2. Our group consists of 4 members which are Daniel, Aiman, and Daus.The 2 project mainly about data analysis and how we manipulate raw data to data that can be easily understood by everyone.

From project 1, I know how to present our raw data which we get from a survey that we conduct and share through Whatsapp apps to UTM student to get a response about non-academic activities. We represent the data to variety graph such as a pie chart, scatter plot and others that can be easily understood by everyone. We also learn how to use Rstudio to making the graph by coding.

 

Next, we successfully did our project 2 by manipulating data base on various data analysis method such as correlation, regression, hypothesis one sample. We use the data from We use the data about “STATISTIK JENIS DADAH YANG DIGUNA, 2010-2016” from National Anti-Drugs Agency to show the type of drug that there use in a harmful way.As me, I need to find a clear relationship among them by using chi-square independent test to determine whether the data is independent or not.I also use Rstudio to calculate test statistic and critical region .

From my observation from project 1 and project 2, I know how to analyze data using various method and to get the good data representation that I can use it in the future.I would like to thank to our lecturer ,Dr Erne Nazira for guiding us on this project and teach us PSDA for this semester