Diabetes
Health service professionals working with the Pima people want to develop, for women of Pima heritage, a model that may be used to predict diabetes from other more readily measured variables. To begin this process the researchers collated data from the National Institute of Diabetes and Digestive and Kidney Diseases for women of Pima heritage that were at least 21 years old. Their data set is in diabetes.csv.1
From those data use plasma glucose concentration (Glucose
) to possibly develop a model to predict the presence of diabetes or not (coded as 1
=diabetes and 0
=no diabetes in the Outcome
variable). Note that glucose values of 0 in the data are errors and should be removed (see this note). Please follow the workflow and the tenor of the example analyses in the reading.