Start Over

Improvement of Predictive Ability by Uniform Coverage of the Target Genetic Space

Authors :: Daniela Bustos-Korts
Marcos Malosetti
Scott Chapman
Ben Biddulph
Fred van Eeuwijk
Source :: G3: Genes, Genomes, Genetics, Vol 6, Iss 11, Pp 3733-3747 (2016)
Publication Year :: 2016
Publisher :: Oxford University Press, 2016.
Abstract: Genome-enabled prediction provides breeders with the means to increase the number of genotypes that can be evaluated for selection. One of the major challenges in genome-enabled prediction is how to construct a training set of genotypes from a calibration set that represents the target population of genotypes, where the calibration set is composed of a training and validation set. A random sampling protocol of genotypes from the calibration set will lead to low quality coverage of the total genetic space by the training set when the calibration set contains population structure. As a consequence, predictive ability will be affected negatively, because some parts of the genotypic diversity in the target population will be under-represented in the training set, whereas other parts will be over-represented. Therefore, we propose a training set construction method that uniformly samples the genetic space spanned by the target population of genotypes, thereby increasing predictive ability. To evaluate our method, we constructed training sets alongside with the identification of corresponding genomic prediction models for four genotype panels that differed in the amount of population structure they contained (maize Flint, maize Dent, wheat, and rice). Training sets were constructed using uniform sampling, stratified-uniform sampling, stratified sampling and random sampling. We compared these methods with a method that maximizes the generalized coefficient of determination (CD). Several training set sizes were considered. We investigated four genomic prediction models: multi-locus QTL models, GBLUP models, combinations of QTL and GBLUPs, and Reproducing Kernel Hilbert Space (RKHS) models. For the maize and wheat panels, construction of the training set under uniform sampling led to a larger predictive ability than under stratified and random sampling. The results of our methods were similar to those of the CD method. For the rice panel, all training set construction methods led to similar predictive ability, a reflection of the very strong population structure in this panel.

Subjects :: genomic prediction
population structure
genetic space
training set
RKHS model
GenPred
Shared Data Resources
Genomic Selection
Genetics
QH426-470

Details

Language :: English
ISSN :: 21601836
Volume :: 6
Issue :: 11
Database :: Directory of Open Access Journals
Journal :: G3: Genes, Genomes, Genetics
Publication Type :: Academic Journal
Accession number :: edsdoj.8aaaffed2aef48efbf6c4aa546ef1e3a
Document Type :: article
Full Text :: https://doi.org/10.1534/g3.116.035410

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Improvement of Predictive Ability by Uniform Coverage of the Target Genetic Space

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Improvement of Predictive Ability by Uniform Coverage of the Target Genetic Space

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources