(19/20) SEMESTER 2

PSDA PROJECT 2- PREDICTING DIABETES

INTRODUCTION

The dataset was published by Faysal Islam. This dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The purpose of the dataset is to diagnostically predict whether a patient has diabetes, based on certain diagnostic measurements included in the dataset. 

There is 2 type of diabetes. Type 1 diabetes, when known as adolescent diabetes or insulin-subordinate diabetes, is a ceaseless condition in which the pancreas delivers practically zero insulin. Insulin is a hormone expected to permit sugar (glucose) to enter cells to create vitality. Various components, including hereditary qualities and some infections, may add to type 1 diabetes. In spite of the fact that type 1 diabetes normally shows up during youth or puberty, it can create in grown-ups (Mayo, 2017). Type 2 diabetes is an interminable, possibly weakening and frequently deadly ailment requiring standard checking of a person's glucose level and treatment. In type 2 diabetes, the body either doesn't appropriately deliver or utilize insulin, a hormone created by the pancreas that assists move with sugaring into cells. Consequently, the body gets impervious to insulin. This opposition causes high glucose levels (OAC, 2012).

            From this dataset that I chose, I want to predict diabetes by considering a few variables/factors (such as pregnancies, glucose etc) that possibly has relationship or rely on each other that gives the result to have diabetes.

REPORT

REFLECTION

In this project 2, we are required to find a dataset on the internet and do some hypothesis testing on the dataset on certain claimed value/facts. I chose a dataset that contained the data about the factors of diabetes such as blood sugar level (glucose), blood pressure, pregnancies, diabetes pedigree function etc. Originally there are about 768 of sample in the dataset but I decided to remove some due to existence of null value (empty) and it became 757 in total of the sample. 

During doing the project 2, I spent a lot of time doing research of the topic that I'm doing which is "Predicting Diabetes". I researched about the factors of diabetes, the causes of diabetes, the basic little facts about diabetes. I also did research on how to do R language to make it easier for me to do the statistic of each test. I referred my lecturer, Dr. Chan Weng Howe a lot and I also referred the R notes that he gave to us in elearning. I learnt a lot of new things during completing this project 2. 

Details

VIDEO PRESENTATION