The accuracy of different strategies for building training sets for genomic predictions in segregating soybean populations

Document Type

Article

Publication Date

11-1-2020

Abstract

The design of the training set is a key factor in the success of the genomic selection approach. The nature of line inclusion in soybean [Sorghum bicolor (L.) Moench.] breeding programs is highly dynamic, so generating a training set that endures across the years and regions is challenging. Therefore, we aimed to define the best strategies for building training sets to apply genomic selection in segregating soybean populations for traits with different genetic architectures. We used two datasets for grain yield (GY) and maturity group (MG) from two different soybean breeding regions in Brazil. Five training set schemes were tested. In addition, we included a training set formed by an optimization algorithm based on the predicted error variance. The predictions achieved good values for both traits, reaching 0.5 in some scenarios. The best scenario changed according to the trait. Although the best performance was achieved with the use of full-sibs in the MG dataset, for GY, full-sibs and a set of advanced lines were equivalent. For both traits, no improvement in predictive ability resulted from training set optimization. Furthermore, the use of advanced lines from the same breeding program is recommended as a training set for GY, so the training set is continually renewed and closely related to the breeding populations, and no additional phenotyping is needed. On the other hand, to improve prediction accuracies for MG, it is necessary to use training sets with less genetic variability but with more segregation resolution.

Publication Source (Journal or Book title)

Crop Science

First Page

3115

Last Page

3126

This document is currently not available here.

Share

COinS