Genomic Signatures of North American Soybean Improvement Inform Diversity Enrichment Strategies and Clarify the Impact of Hybridization

G3 (Bethesda). 2016 Sep 8;6(9):2693-705. doi: 10.1534/g3.116.029215.

Abstract

Crop improvement represents a long-running experiment in artificial selection on a complex trait, namely yield. How such selection relates to natural populations is unclear, but the analysis of domesticated populations could offer insights into the relative role of selection, drift, and recombination in all species facing major shifts in selective regimes. Because of the extreme autogamy exhibited by soybean (Glycine max), many "immortalized" genotypes of elite varieties spanning the last century have been preserved and characterized using ∼50,000 single nucleotide polymorphic (SNP) markers. Also due to autogamy, the history of North American soybean breeding can be roughly divided into pre- and posthybridization eras, allowing for direct interrogation of the role of recombination in improvement and selection. Here, we report on genome-wide characterization of the structure and history of North American soybean populations and the signature of selection in these populations. Supporting previous work, we find that maturity defines population structure. Though the diversity of North American ancestors is comparable to available landraces, prehybridization line selections resulted in a clonal structure that dominated early breeding and explains many of the reductions in diversity found in the initial generations of soybean hybridization. The rate of allele frequency change does not deviate sharply from neutral expectation, yet some regions bare hallmarks of strong selection, suggesting a highly variable range of selection strengths biased toward weak effects. We also discuss the importance of haplotypes as units of analysis when complex traits fall under novel selection regimes.

Keywords: detecting selection; haplotype frequencies; maturity groups; selection on standing variation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Genetic Variation*
  • Genome, Plant
  • Genome-Wide Association Study*
  • Genomics
  • Genotype
  • Glycine max / genetics*
  • Haplotypes
  • Linkage Disequilibrium
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci / genetics*
  • United States