PSDA PROJECT 2

In this project 2, we are required to find a dataset on the internet and do some hypothesis testing on the dataset on certain claimed value/facts. I chose a dataset that contained the data about the factors of diabetes such as blood sugar level (glucose), blood pressure, pregnancies, diabetes pedigree function etc. Originally there are about 768 of sample in the dataset but I decided to remove some due to existence of null value (empty) and it became 757 in total of the sample. 

During doing the project 2, I spent a lot of time doing research of the topic that I'm doing which is "Predicting Diabetes". I researched about the factors of diabetes, the causes of diabetes, the basic little facts about diabetes. I also did research on how to do R language to make it easier for me to do the statistic of each test. I referred my lecturer, Dr. Chan Weng Howe a lot and I also referred the R notes that he gave to us in elearning. I learnt a lot of new things during completing this project 2.