Cross-validation

Cross-validation is a method of testing a predictive model on unseen data. In cross-validation, the value of a penalty (loss) function (mean prediction error) is computed on data not used for finding the model. Low value = good.

Cross-validation gives a good estimate of the actual predictive power of the model. It can also be used to compare different models or classification methods.

Define the loss function loss_func and compute the mean prediction error for the training data: The high_use column in alc is the target and the probability column has the predictions.
Perform leave-one-out cross-validation and print out the mean prediction error for the testing data. (nrow(alc) gives the observation count in alc and using K = nrow(alc) defines the leave-one-out method. The cv.glm function from the 'boot' library computes the error and stores it in delta. See ?cv.glm for more information.)
Adjust the code: Perform 10-fold cross validation. Print out the mean prediction error for the testing data. Is the prediction error higher or lower on the testing data compared to the training data? Why?

Exercise

Cross-validation

Instructions

.css-6su6fj{-webkit-flex-shrink:0;-ms-flex-negative:0;flex-shrink:0;}Exercise

Instructions

Exercise