PROJECT 1 SHORTPAPER
SCHOOL OF COMPUTING
FACULTY OF ENGINEERING
UNIVERSITI TEKNOLOGI MALAYSIA
83300 UTM JOHOR BAHRU, JOHOR
--------------------------------------------------------------------------------------------------
2nd Semester Session 2018/2019
SCSI2143-O9
Probability & Statistical Data Analysis
Project 1 – Descriptive Statistic
Non-academic activities in university
--------------------------------------------------------------------------------------------------
LECTURER : DR NOR ERNE NAZIRA BINTI BAZIN
PREPARED BY :
1)MUHAMMAD ALIEMEEN FARHAN BIN RAMZEE A18CS0123
2)MUHAMMAD AIMAN HAKIM BIN AHMAD RIDZO A18CS0121
3)MUHAMMAD FIRDAUS BIN NOR AZMAN A18CS0140
4)DANIEL BIN ARIPIN B18CS0004
ABSTRACT
We were given a task to conduct a survey on a certain topics within UTM. Our chosen topic is want to focus how students react to non-academic activities in university. In addition, we will be using R studio software to support us with creating the charts and support our explanation to this survey.
- INTRODUCTION
We provide some of questions that related to our topic which is “Non-academic activities in university”. Firstly, we want to know the respondents’ personal information which is gender, years of study and previous CGPA result. Then, we continued with our topic with what their favourite non-academic activities. From that we can know what type of activities that student like the most. After that, we ask respondents how much they satisfied with their favourite activity. So, we know if they really like the activity or just for fun. We also ask what non-academic activities that they joined last semester. Majority chose what the activities that they really like and their favourite activity. Then, we ask how they rate the activities that they have joined. If not quite good, the organizer of such activities can study from the past and do better in future. We also ask how many that they have joined non-academic activities for last semester. From that we know that students nowadays still love these types of activities. Also, the most important things if want to join these activities is about the fees. Majority students spends more money in average to join these activities. The last two questions we ask their opinions and views about these activities in university. Majority students agreed with these two statements. So, our objectives for this survey are we want to know if students love this kind of activities or not. Second, we want to know what is main cause if these activities is no longer in demand. And the last one, we want to show that this kind of activities can give benefits to organization, religion and country.
- METHODOLOGY
We used to conduct this topic (Non- academic activities in university) with online survey. It’s the easiest way because respondents just need to click the link provided and answer all the questionnaires. First, we discuss how to provide question by follow all the rubric of the project. So, we decide each member provide 4 questions with different data type which are nominal, ordinal, interval and ratio. So, we can get at most 4 questions each data type. Once we have done with the question, we created the survey with ‘Google Form’. This Google Form shared to all students in UTM to give their honest opinions regarding to the non-academic activities.
- RESULTS AND DISCUSSION
Q1(Gender)->NOMINAL
We got the percentage of the Male are 51.02% while Female are 48.98% from 49 respondents that we have.
Q2(Years of study)->RATIO
For the 1st year students, we got 73.48% from 49 respondents. Then, for 2nd and 4th year students we got the same percentage which are 8.16% while 10.20% for the 3rd year students.
Q3(Your previous CGPA result )->RATIO
A)Hist
B)Cumsum
Majority our respondent got CGPA result 2.0 and above. Mean from this data is 3.30. Mode is 4.00 while median is 3.28.
Q4(What is your favourite type of non-academic activities)->NOMINAL
The more favourite activities is sports with percentage 44.90%. Followed by community activities with 18.37%, cultural activities with 16.34%, academic services with 10.20% and others with 8.16%.
Q5(How satisfied are you with non academic activities that you participate in university)->ORDINAL/INTERVAL
The most answered by the respondents is choice no 4 with 46.94% followed by choice no 3 with 20.41%. Then, choice no 5 with 18.37% followed by answer no 2 with 10.20% and choice no 1 with 4.08%.
Q6(How many ratings do you give to activities that you join in all aspects above?)->RATIO
No respondent gave answer 1. The most answer is no 4 with 48.98%. Then, 5 with 26.53% followed by no 3 with 20.41% and no 2 with 4.08%.
Q7(How many non-academic activities have you participated in UTM last semester?)->INTERVAL/ORDINAL
Our respondents only 8.16% that join activities more than 15 times last semester. And around 57.14% respondent join less than 5 times. And the others that joined 5 -10 times is 24.49% and that joined 11-15 times is 10.20%.
Q8(How much average you spent the fees for the activity that you participate(in RM) ?)->RATIO
The mean of this data is RM58. The mode is RM100 and median is RM30.
Q9(Students must always join the non-academic activities.)->ORDINAL/INTERVAL
Almost 65.31% respondents agree with this statement and around 6.12% does not agree with this statement. Only 28.57% feel neither with this statement.
Q10(These types of activities also important for students after academical studies.)->ORDINAL/INTERVAL
Almost 83.67% respondents agree with this statement and around 4.08% does not agree with this statement. Only 12.25% feel neither with this statement.
REFERENCES
- Lectures notes Probability Statistical Data and Analysis Chapter (1-3)
- Azizah Ali
Project PSDA 2:Study of TYPE OF DRUGS THAT DRUG ADDICT USE
Project PSDA 2:
Study of TYPE OF DRUGS THAT DRUG ADDICT USE
MUHAMMAD FIRDAUS BIN NOR AZMAN A18CS0140
MUHAMMAD AIMAN HAKIM BIN AHMAD RIDZO A18CS0121
MUHAMMAD ALIEMEEN FARHAN BIN RAMZEE A18CS0123
DANIEL BIN ARIPIN B18CS0004
Introduction
‘DRUGS’ based on dictionary state “a medicine or other substance which has a physiological effect when ingested or otherwise introduced into the body.”. Drugs abuse is occurring when people take over dosage of drugs because they addicted to have fun or to strengthen their body. Drug abuse is dangerous because it can cause death Somehow drug abuse is happening among people in Malaysia. Because of the concern .We use the data about “STATISTIK JENIS DADAH YANG DIGUNA, 2010-2016” from National Anti-Drugs Agency to show the type of drug that there use in harmful way.
1.Hypothesis testing 2-sample
Year |
Opiate |
Methamphetamic |
2010 |
11664 |
4026 |
2011 |
9629 |
7034 |
2012 |
8472 |
4761 |
2013 |
16035 |
2901 |
2014 |
14496 |
4117 |
2015 |
16616 |
8133 |
2016 |
16985 |
10107 |
Based on the table above. The table shows the population and type of drug(opiate and methampehtamic) that happened in Malaysia for 7 years. Use significance level, α = 0.05, to support a claim that the mean coefficient of type of drugs used between opiate and methamphetamic
Test Statistical
H0: μ1 = μ2
H1: μ1 > μ2
Based on the result shown, the value of test statistics is 4.583851 while the critical value, t = -2.200985
INTERPRETATION OF RESULT
d.f. = 11.12528 ~ 11
Test Statistic > Critical Value
|4.583851| > |-2.201|
CONCLUSION:
- Since 4.583851 > 2.201, we Reject H0 at α = 0.05. There is insufficient evidence that there are difference in mean drug used between opiate and methamphetamic.
2. CORRELATION
Year |
Population |
Opiate |
2010 |
23642 |
11664 |
2011 |
19531 |
9629 |
2012 |
15101 |
8472 |
2013 |
21361 |
16035 |
2014 |
22355 |
14496 |
2015 |
27479 |
16616 |
2016 |
31764 |
16985 |
- Based on the table above. The table shows the population and type of drug(opiate) that happened in Malaysia for 7 years. Can we conclude from these data that a relationship exist between population and type of drug used? Use significance level, α = 0.01.
- Linear equation
Ŷ = 1332.4872 + 0.5245x
CONCLUSION
- So, we can conclude that the correlation coefficient, r = 0.81. Thus, this indicates that there is correlation between population and type of drug. However, this is a strong positive relationship because r falls within 0.8 to 1.
3. REGRESSION
Year |
Population |
Weed |
2010 |
23642 |
3011 |
2011 |
19531 |
2026 |
2012 |
15101 |
1472 |
2013 |
21361 |
1885 |
2014 |
22355 |
1919 |
2015 |
27479 |
1389 |
2016 |
31764 |
1236 |
- The following data about population and type of drug(weed) used in the Malaysia, 2010–2016.Construct a scatter plot. Does the relationship between the population and type of drug used to be linear?
The SCATTER PLOT
- Linear equation
Ŷ = 2431.58165927 -0.02532404x
CONCLUSION
- It can be seen in the graph that the total number about population increase as total number of drug(weed) is decrease. A scatter plot and regression analysis of this data indicates there is sufficient evidence that there is a relationship between the population and the weed
4. Chi-Squared Test
- A study was conducted on the used of drug in Malaysia. Test the claim that the used of drugs with equal proportion for 7 years, from 2010 to 2016 at 0.05 significance level.
Chi-Square Test of Independence
- We would like to check whether there is a relationship or not between different type of drug may effect taking of different drug(candu,opiat) through year from 2010 -2016 with significance level of ⍺ = 0.05.
- So we can propose a hypothesis.
- H0 = Variables are independence
- H1 = Variables are not independence / There is relationship between variables.
CONCLUSION
By using chi-square test of independence we find that x^2 = 14 With the significance level of ⍺ = 0.05 and Df = 12, x^2 = 21.03 Since the test value do not fall on the critical region, we fail to reject H0. Thus, we can conclude that there is no relationship between candu and opiat drug variable through the year from 2010 – 2016.
REFLECTION ON PROJECT 1& PROJECT 2
A data set is a collection of numbers or values that relate to a particular subject. For example, the test scores of each student in a particular class is a data set. The number of fish eaten by each dolphin at an aquarium is a data set.
![]() |
Data sets can be written as a group of numbers in random order, in a table form or with curly brackets surrounding them. The data sets are usually labeled so you know what the data represents, but when working with data sets, what the data stands for is not always provided and is not always needed to finish the problem.
Our group project for this semester is project 1 and project 2. Our group consists of 4 members which are Daniel, Aiman, and Daus.The 2 project mainly about data analysis and how we manipulate raw data to data that can be easily understood by everyone.
From project 1, I know how to present our raw data which we get from a survey that we conduct and share through Whatsapp apps to UTM student to get a response about non-academic activities. We represent the data to variety graph such as a pie chart, scatter plot and others that can be easily understood by everyone. We also learn how to use Rstudio to making the graph by coding.
Next, we successfully did our project 2 by manipulating data base on various data analysis method such as correlation, regression, hypothesis one sample. We use the data from We use the data about “STATISTIK JENIS DADAH YANG DIGUNA, 2010-2016” from National Anti-Drugs Agency to show the type of drug that there use in a harmful way.As me, I need to find a clear relationship among them by using chi-square independent test to determine whether the data is independent or not.I also use Rstudio to calculate test statistic and critical region .
From my observation from project 1 and project 2, I know how to analyze data using various method and to get the good data representation that I can use it in the future.I would like to thank to our lecturer ,Dr Erne Nazira for guiding us on this project and teach us PSDA for this semester