Adrian O. Stec, Robert M. Stupar, Guillem Rigaill, Marc André Lemay, Davoud Torkamaneh, François Belzile, Brian Boyle, Université Laval [Québec] (ULaval), Department of Plant Agriculture, University of Guelph, Institut des Sciences des Plantes de Paris-Saclay (IPS2 (UMR_9213 / UMR_1403)), Institut National de la Recherche Agronomique (INRA)-Université Paris-Sud - Paris 11 (UP11)-Université Paris Diderot - Paris 7 (UPD7)-Université d'Évry-Val-d'Essonne (UEVE)-Centre National de la Recherche Scientifique (CNRS), Laboratoire de Mathématiques et Modélisation d'Evry (LaMME), Institut National de la Recherche Agronomique (INRA)-Université d'Évry-Val-d'Essonne (UEVE)-ENSIIE-Centre National de la Recherche Scientifique (CNRS), Institut de Biologie intégrative et des systèmes, Department of Agronomy and Plant Genetics, University of Minnesota [Twin Cities] (UMN), University of Minnesota System-University of Minnesota System, Genome Quebec, Genome CanadaGenome Canada, government of Canada, Ministere de l'Economie, Science et Innovation du Quebec, Semences Prograin Inc., Syngenta Canada Inc., Saskatchewan Pulse Growers, Manitoba Pulse & Soybean Growers, Canadian Field Crop Research Alliance, Producteurs de grains du Quebec, National Science FoundationNational Science Foundation (NSF) [1444581], NSERC Canada Vanier Graduate Scholarship, AgroPhytoSciences NSERC CREATE Training Program, SoyaGen grant, and Belzile, Francois
Background The effective use of mutant populations for reverse genetic screens relies on the population-wide characterization of the induced mutations. Genome- and population-wide characterization of the mutations found in fast neutron populations has been hindered, however, by the wide range of mutations generated and the lack of affordable technologies to detect DNA sequence changes. In this study, we therefore aimed to test whether genotyping-by-sequencing (GBS) technology could be used to characterize copy number variation (CNV) induced by fast neutrons in a soybean mutant population. Results We called CNVs from GBS data in 79 soybean mutants and assessed the sensitivity and precision of this approach by validating our results against array comparative genomic hybridization (aCGH) data for 19 of these mutants as well as targeted PCR and ddPCR assays for a representative subset of the smallest events detected by GBS. Our GBS pipeline detected 55 of the 96 events found by aCGH, with approximate detection thresholds of 60 kb, 500 kb and 1 Mb for homozygous deletions, hemizygous deletions and duplications, respectively. Among the whole set of 79 mutants, the GBS data revealed 105 homozygous deletions, 32 hemizygous deletions and 19 duplications. This included several extremely large events, exhibiting maximum sizes of ~ 11.2 Mb for a homozygous deletion, ~ 11.6 Mb for a hemizygous deletion, and ~ 50 Mb for a duplication. Conclusions This study provides a proof of concept that GBS can be used as an affordable high-throughput method for assessing CNVs in fast neutron mutants. The modularity of this GBS approach allows combining as many different libraries or sequencing runs as is necessary for reaching the goals of a particular study. This method should enable the low-cost genome-wide characterization of hundreds to thousands of individuals in fast neutron mutant populations or any population with large genomic deletions and duplications. Electronic supplementary material The online version of this article (10.1186/s12864-019-5998-1) contains supplementary material, which is available to authorized users.