Validation split (machine learning)

Last revised by Daniel J Bell on 3 Apr 2023

Citation, DOI, disclosures and article data

Citation:

Wang D, Bell D, Moore C, et al. Validation split (machine learning). Reference article, Radiopaedia.org (Accessed on 16 Apr 2024) https://doi.org/10.53347/rID-61721

DOI:

https://doi.org/10.53347/rID-61721

Permalink:

https://radiopaedia.org/articles/61721?iframe=true

rID:

61721

Article created:

16 Jul 2018, David John Wang

Disclosures:

At the time the article was created David John Wang had no recorded disclosures.

View David John Wang's current disclosures

Last revised:

3 Apr 2023, Daniel J Bell ◉

Disclosures:

At the time the article was last revised Daniel J Bell had no financial relationships to ineligible companies to disclose.

View Daniel J Bell's current disclosures

Revisions:

5 times, by 4 contributors - see full revision history and disclosures

Tags:

In order to ensure that machine learning models are able to generalize well to new data not seen before by the model, it is important to have several sets of data including training data, test data, and cross-validation split data for the original set of data to obtain the best possible predictive model.

Training set

When conducting machine learning, data collection is critical to generate accurate algorithms to make good predictions. A predictive model is created after undergoing training utilizing a training set of known examples.

Test set

A credible method is required to test the accuracy of the model after training. Using the same training examples for testing is unlikely to give an accurate representation of the predictive accuracy of the model as the model is likely to be biased towards the training set. Thus, the original data set is usually split to make a test set. The test set is usually used to select the algorithm with the best performance.

Cross-validation set

Selecting an algorithm based on the test set could lead to further biases. As the algorithm is selected from the best performance based on the same test set, this is not an accurate representation of generalized accuracy to examples never seen before by the algorithm (as a test set is finite and does not necessarily cover the wide variety of real examples). The algorithm selected will likely have an optimistic estimation of the generalization error. Consequently, the original dataset is further split to include a cross-validation set. The cross-validation set is used to select the best performing algorithm, and the test set is used to estimate the generalization error from this algorithm.

training set
- data points used to train the algorithm
cross-validation set
- data points used to select the best algorithm
test set
- data points used to test the selected algorithm for the generalization error/accuracy.

A typical split of the original dataset is 60% training, 20% cross-validation and 20% test sets.

Validation split (machine learning)

Citation, DOI, disclosures and article data

Training set

Test set

Cross-validation set

References

Related articles: Artificial intelligence

Promoted articles (advertising)