Note: Your answers to the questions below should follow the expectations for homework found here. Questions outside of class can be asked on the Module Assignments-Questions Teams channel (see link on homepage).

Diabetes

Health service professionals working with the Pima people want to develop, for women of Pima heritage, a model that may be used to predict diabetes from other more readily measured variables. To begin this process the researchers collated data from the National Institute of Diabetes and Digestive and Kidney Diseases for women of Pima heritage that were at least 21 years old. Their data set is in diabetes.csv.1

From those data use plasma glucose concentration (Glucose) to possibly develop a model to predict the presence of diabetes or not (coded as 1=diabetes and 0=no diabetes in the Outcome variable). Note that glucose values of 0 in the data are errors and should be removed (see this note). Please follow the workflow and the tenor of the example analyses in the reading.

  1. These data are originally from here