Week 37: Statistical interpretations and Resampling Methods

Loading [MathJax]/extensions/TeX/boldsymbol.js

Cross-validation in brief

For the various values of $k$

shuffle the dataset randomly.
Split the dataset into $k$ groups.
For each unique group:
1. Decide which group to use as set for test data
2. Take the remaining groups as a training data set
3. Fit a model on the training set and evaluate it on the test set
4. Retain the evaluation score and discard the model
Summarize the model using the sample of model evaluation scores