Genome-wide association genetics of an adaptive trait in lodgepole pine
THOMAS L. PARCHMAN
Department of Botany, University of Wyoming, Laramie, WY 82071, USA
Search for more papers by this authorZACHARIAH GOMPERT
Department of Botany, University of Wyoming, Laramie, WY 82071, USA
Search for more papers by this authorJOANN MUDGE
National Center for Genome Resources, Santa Fe, NM, USA
Search for more papers by this authorFAYE D. SCHILKEY
National Center for Genome Resources, Santa Fe, NM, USA
Search for more papers by this authorCRAIG W. BENKMAN
Department of Zoology and Physiology, University of Wyoming, Laramie, WY 82071, USA
Search for more papers by this authorC. ALEX BUERKLE
Department of Botany, University of Wyoming, Laramie, WY 82071, USA
Search for more papers by this authorTHOMAS L. PARCHMAN
Department of Botany, University of Wyoming, Laramie, WY 82071, USA
Search for more papers by this authorZACHARIAH GOMPERT
Department of Botany, University of Wyoming, Laramie, WY 82071, USA
Search for more papers by this authorJOANN MUDGE
National Center for Genome Resources, Santa Fe, NM, USA
Search for more papers by this authorFAYE D. SCHILKEY
National Center for Genome Resources, Santa Fe, NM, USA
Search for more papers by this authorCRAIG W. BENKMAN
Department of Zoology and Physiology, University of Wyoming, Laramie, WY 82071, USA
Search for more papers by this authorC. ALEX BUERKLE
Department of Botany, University of Wyoming, Laramie, WY 82071, USA
Search for more papers by this authorAbstract
Pine cones that remain closed and retain seeds until fire causes the cones to open (cone serotiny) represent a key adaptive trait in a variety of pine species. In lodgepole pine, there is substantial geographical variation in serotiny across the Rocky Mountain region. This variation in serotiny has evolved as a result of geographically divergent selection, with consequences that extend to forest communities and ecosystems. An understanding of the genetic architecture of this trait is of interest owing to the wide-reaching ecological consequences of serotiny and also because of the repeated evolution of the trait across the genus. Here, we present and utilize an inexpensive and time-effective method for generating population genomic data. The method uses restriction enzymes and PCR amplification to generate a library of fragments that can be sequenced with a high level of multiplexing. We obtained data for more than 95 000 single nucleotide polymorphisms across 98 serotinous and nonserotinous lodgepole pines from three populations. We used a Bayesian generalized linear model (GLM) to test for an association between genotypic variation at these loci and serotiny. The probability of serotiny varied by genotype at 11 loci, and the association between genotype and serotiny at these loci was consistent in each of the three populations of pines. Genetic variation across these 11 loci explained 50% of the phenotypic variation in serotiny. Our results provide a first genome-wide association map of serotiny in pines and demonstrate an inexpensive and efficient method for generating population genomic data.
References
- 454 Life Sciences Corp. (2009) Using multiplex identifier (MID) adaptors for the GS FLX Titanium chemistry-extended MID set. Tech. rep., Technical Bulletin: Genome Sequencer FLX System.
- Albrechtsen A, Nielsen FC, Nielsen R (2010) Ascertainment biases in snp chips affect measures of population divergence. Molecular Biology and Evolution, 27, 2534–2547.
- Andolfatto P, Davison D, Erezyilmaz D et al. (2011) Multiplexed shotgun genotyping for rapid and efficient genetic mapping. Genome Research, 21, 610–617.
- Arno SF (1980) Forest fire history in the northern rockies. Journal of Forestry, 78, 460–465.
- Ayliffe MA, Timmis JN (1992) Tobacco nuclear DNA contains long tracts of homology to chloroplast DNA. TAG Theoretical and Applied Genetics, 85, 229–238.
- Balding DJ (2003) Likelihood-based inference for genetic correlation coefficients. Theoretical Population Biology, 63, 221–230.
- Balding DJ, Nichols RA (1995) A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity. Genetica, 96, 3–12.
- Baxter SW, Davey JW, Johnston JS et al. (2011) Linkage mapping and comparative genomics using next-generation RAD sequencing of a non-model organism. PLoS One, 6, e19315.
- Benkman CW, Siepielski AM (2004) A keystone selective agent? Pine squirrels and the frequency of serotiny in lodgepole pine. Ecology, 85, 2082–2087.
- Benkman CW, Holimon WC, Smith JW (2001) The influence of a competitor on the geographic mosaic of coevolution between crossbills and lodgepole pine. Evolution, 55, 282–294.
- Benkman CW, Parchman TL, Favis A, Siepielski AM (2003) Reciprocal selection causes a coevolutionary arms race between crossbills and lodgepole pine. American Naturalist, 162, 182–194.
- Benkman CW, Siepielski AM, Parchman TL (2008) The local introduction of strongly interacting species and the loss of geographic variation in species and species interactions. Molecular Ecology, 17, 395–404.
- Brown GR, Gill GP, Kuntz RJ, Langley CH, Neale DB (2004) Nucleotide diversity and linkage disequilibrium in loblolly pine. Proceedings of National Academy of Sciences of the United States of America, 101, 15255–15260.
- Carlson CS, Eberle MA, Kruglyak L et al. (2003) Additional SNPs and linkage-disequilibrium analyses are necessary for whole-genome association studies in humans. Nature Genetics, 33, 518-521.
- Clark AG, Hubisz MJ, Bustamante CD, Williamson SH, Nielsen R (2005) Ascertainment bias in studies of human genome-wide polymorphism. Genome Research, 15, 1496–1502.
- Cosart T, Beja-Pereira A, Chen S et al. (2011) Exome-wide DNA capture and next generation sequencing in domestic and wild species. BMC Genomics, 12, 347.
- Craig DW, Pearson JV, Szelinger S et al. (2008) Identification of genetic variants using bar-coded multiplexed sequencing. Nature Methods, 5, 887–893.
- Critchfield WB (1980) Genetics of lodgepole pine. United States Forest Service Research Paper WO-37.
- Cumbie WP, Eckert A, Wegrzyn J et al. (2011) Association genetics of carbon isotope discrimination, height and foliar nitrogen in a natural population of Pinus taeda L. Heredity, 107, 105–114.
- Dancik BP, Yeh FC (1983) Allozyme variability and evolution of lodgepole pine (Pinus contorta var latifolia) and Jack pine (Pinus banksiana) in Alberta. Canadian Journal of Genetics and Cytology, 25, 57–64.
- Dong JS, Wagner DB (1994) Paternally inherited chloroplast polymorphism in Pinus– estimation of diversity and population subdivision, and tests of disequilibrium with a maternally inherited mitochondrial polymorphism. Genetics, 136, 1187–1194.
- Doyle J (1991) DNA protocols for plants: a CTAB total DNA isolation. In: Molecular Techniques in Taxonomy (eds GM Hewitt, A Johnston), pp. 283–293. Springer, New York.
- Eckert AJ, Bower AD, Wegrzyn JL et al. (2009a) Association genetics of coastal douglas fir (Pseudotsuga menziesii var. menziesii, Pinaceae). I. cold-hardiness related traits. Genetics, 182, 1289–1302.
- Eckert AJ, Pande B, Ersoz ES et al. (2009b) High-throughput genotyping and mapping of single nucleotide polymorphisms in loblolly pine (Pinus taeda L.). Tree Genetics & Genomes, 5, 225–234.
- Eckert AJ, Wegrzyn JL, Pande B et al. (2009c) Multilocus patterns of nucleotide diversity and divergence reveal positive selection at candidate genes related to cold hardiness in coastal Douglas fir (Pseudotsuga menziesii var. menziesii). Genetics, 183, 289–98.
- Eckert AJ, van Heerwaarden J, Wegrzyn JL et al. (2010) Patterns of population structure and environmental associations to aridity across the range of loblolly pine (Pinus taeda L., Pinaceae). Genetics, 185, 969–982.
- Elshire RJ, Glaubitz JC, Sun Q et al. (2011) A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One, 6, e19379.
- Epperson BK, Allard RW (1989) Spatial auto-correlation analysis of the distribution of genotypes within populations of lodgepole pine. Genetics, 121, 369–377.
- Falush D, Stephens M, Pritchard JK (2003) Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics, 164, 1567–1587.
- Givnish TJ (1981) Serotiny, geography, and fire in the pine barrens of New Jersey. Evolution, 35, 101–123.
- Godbout J, Fazekas A, Newton CH, Yeh FC, Bousquet J (2008) Glacial vicariance in the Pacific Northwest: evidence from a lodgepole pine mitochondrial DNA minisatellite for multiple genetically distinct and widely separated refugia. Molecular Ecology, 17, 2463–2475.
- Gompert Z, Forister ML, Fordyce JA et al. (2010) Bayesian analysis of molecular variance in pyrosequences quantifies population genetic structure across the genome of Lycaeides butterflies. Molecular Ecology, 19, 2455–2473.
- Gompert Z, Lucas LK, Nice CC et al. (2012) Reproductive isolation between two butterfly species evolved by divergent selection. Evolution, DOI:10.1111/j.1558-5646.2012.01587.x.
- González-Martínez SC, Huber D, Ersoz E, Davis JM, Neale DB (2008) Association genetics in Pinus taeda L. II. Carbon isotope discrimination. Heredity, 101, 19–26.
- González-Martínez SC, Krutovsky KV, Neale DB (2006) Forest-tree population genomics and adaptive evolution. New Phytologist, 170, 227–238.
- González-Martínez SC, Wheeler NC, Ersoz E, Nelson CD, Neale DB (2007) Association genetics in Pinus taeda L. I. Wood property traits. Genetics, 175, 399–409.
- Grotkopp E, Rejmánek M, Sanderson MJ, Rost TL (2004) Evolution of genome size in pines (pinus) and its life-history correlates: supertree analyses. Evolution, 58, 1705–1729.
- Gupta P, Rustgi S, Kulwal P (2005) Linkage disequilibrium and association studies in higher plants: present status and future prospects. Plant Molecular Biology, 57, 461–485.
- Hall SE, Dvorak WS, Johnston JS, Price HJ, Williams CG (2000) Flow cytometric analysis of dna content for tropical and temperate new world pines. Annals of Botany, 86, 1081–1086.
- Heuertz M, De Paoli E et al. (2006) Multilocus patterns of nucleotide diversity, linkage disequilibrium and demographic history of norway spruce [Picea abies (l.) karst]. Genetics, 174, 2095–2105.
- Hirschhorn JN, Daly MJ (2005) Genome-wide association studies for common diseases and complex traits. Nature Reviews. Genetics, 6, 95–108.
- Hohenlohe PA, Bassham S, Etter PD et al. (2010) Population genomics of parallel adaptation in threespine stickleback using sequenced RAD tags. PLoS Genetics, 6, e1000862.
- Holliday JA, Ritland K, Aitken SN (2010) Widespread, ecologically relevant genetic markers developed from association mapping of climate-related traits in Sitka spruce (Picea sitchensis). New Phytologist, 188, 501–514.
- Huang CY, Ayliffe MA, Timmis JN (2003) Direct measurement of the transfer rate of chloroplast dna into the nucleus. Nature, 422, 72–76.
- Hudson ME (2008) Sequencing breakthroughs for genomic ecology and evolutionary biology. Molecular Ecology Resources, 8, 3–17.
- Joyner KL, Wang XR, Johnston JS, Price HJ, Williams CG (2001) DNA content for asian pines parallels new world relatives. Canadian Journal of Botany, 79, 192–196.
- Krutovsky KV, Neale DB (2005) Nucleotide diversity and linkage disequilibrium in cold-hardiness- and wood quality-related candidate genes in Douglas-fir. Genetics, 171, 2029–2041.
- Krutovsky KV, Clair JBS, Saich R, Hipkins VD, Neale DB (2009) Estimation of population structure in coastal Douglas-fir [Pseudotsuga menziesii (mirb.) franco var. menziesii] using allozyme and microsatellite markers. Tree Genetics & Genomes, 5, 641–658.
- Kuhner MK, Beerli P, Yamato J, Felsenstein J (2000) Usefulness of single nucleotide polymorphism data for estimating population parameters. Genetics, 156, 439–447.
- Li JZ, Absher DM, Tang H et al. (2008) Worldwide human relationships inferred from genome-wide patterns of variation. Science, 319, 1100–1104.
- Li J, Li Q, Hou W et al. (2009) An algorithmic model for constructing a linkage and linkage disequilibrium map in outcrossing plant populations. Genetics Research, 91, 9–21.
- Lotan JE (1975) The role of cone serotiny in lodgepole pine forests. In: Symposium Proceedings, Washington State University, Pullman, Washington, USA (ed. DM Baumgartner), pp. 471–495.
- Lynch M (2007) The Origins of Genome Architecture. Sinauer Associates, Sunderland, MA.
- McCarthy MI, Abecasis GR, Cardon LR et al. (2008) Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nature Reviews. Genetics, 9, 356–369.
- McVean G (2009) A genealogical interpretation of principal components analysis. Plos Genetics, 5, e1000686.
- Meyer M, Kircher M (2010) Illumina sequencing library preparation for highly multiplexed target capture and sequencing. Cold Spring Harbor Protocols, 2010, DOI:10/1101/pdb.prot5448.
10.1101/pdb.prot5448 Google Scholar
- Morse AM, Peterson DG, Islam-Faridi MN et al. (2009) Evolution of genome size and complexity in Pinus. PLoS One, 4, e4332.
- Moya D, Saracino A, Salvatore R et al. (2008) Anatomic basis and insulation of serotinous cones in Pinus halepensis. Trees – Structure and Function, 22, 511–519.
- Neale DB (2007) Genomics to tree breeding and forest health. Current Opinion in Genetics & Development, 17, 539–544.
- Neale D, Kremer A (2011) Forest tree genomics: growing resources and applications. Nature Reviews. Genetics, 12, 111–122.
- Neale DB, Savolainen O (2004) Association genetics of complex traits in conifers. Trends in Plant Science, 9, 325–330.
- Nicholson G, Smith AV, Jonsson F et al. (2002) Assessing population differentiation and isolation from single-nucleotide polymorphism data. Journal of the Royal Statistical Society. Series B, Statistical Methodology, 64, 695–715.
- Novembre J, Stephens M (2008) Interpreting principal component analyses of spatial population genetic variation. Nature Genetics, 40, 646–649.
- Novembre J, Johnson T, Bryc K et al. (2008) Genes mirror geography within Europe. Nature, 456, 98–101.
- van Orsouw NJ, Hogers RCJ, Janssen A et al. (2007) Complexity reduction of polymorphic sequences (CRoPS): a novel approach for large-scale polymorphism discovery in complex genomes. PLoS One, 2, e1172.
- Parchman TL, Geist KS, Grahnen JA, Benkman CW, Buerkle CA (2010) Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery. BMC Genomics, 11, 180.
- Parchman TL, Benkman CW, Jenkins B, Buerkle CA (2011) Low levels of population genetic structure in Pinus contorta (Pinaceae) across a geographic mosaic of co-evolution. American Journal of Botany, 98, 669–679.
- Perry DA, Lotan JE (1979) Opening temperatures of serotinous cones of lodgepole pine. Tech. rep., Forest Service, U.S. Department of Agriculture Research Note INT-228.
- Quesada T, Vikneswaran G, Cumbie WP et al. (2010) Association mapping of quantitative disease resistance in a natural population of Loblolly Pine (Pinus taeda L.). Genetics, 186, 677–686.
- R Development Core Team (2011) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, ISBN 3-900051-07-0.
- Rannala B, Hartigan JA (1996) Estimating gene flow in island populations. Genetical Research, 67, 147–158.
- Rudolph TC, Schoenike RE, Schantz-Hanzen T (1959) Results of one-parent progeny tests relating to the inheritance of open and closed cones in Jack pine. Minnesota Forestry Notes, 78, 1–2.
- Stinchcombe JR, Hoekstra HE (2007) Combining population genomics and quantitative genetics: finding the genes underlying ecologically important traits. Heredity, 100, 158–170.
- Tapias R, Climent J, Pardos J, Gil L (2004) Life histories of mediterranean pines. Plant Ecology, 171, 53–68.
- Teich AH (1970) Cone serotiny and inbreeding in natural populations of Pinus banksiana and Pinus contorta. Canadian Journal of Botany, 48, 1805–1809.
- Turner MG, Hargrove WW, Gardner RH, Romme WH (1994) Effects of fire on landscape heterogeneity in Yellowstone National Park, Wyoming. Journal of Vegetation Science, 5, 731–742.
- Turner MG, Romme WH, Gardner RH, Hargrove WW (1997) Effects of fire size and patterns on early succession in Yellowstone National Park. Ecological Monographs, 67, 411–433.
- Turner MG, Romme WH, Tinker DB (2003) Surprises and lessons from the 1988 yellowstone fires. Frontiers in Ecology and the Environment, 1, 351–358.
- Vos P, Hogers R, Bleeker M et al. (1995) AFLP – a new technique for DNA-fingerprinting. Nucleic Acids Research, 23, 4407–4414.
- Weir BS (1979) Inferences about linkage disequilibrium. Biometrics, 35, 235–254.
- Wheeler NC, Guries RP (1982) Biogeography of lodgepole pine. Canadian Journal of Botany, 60, 1805–1814.
- Wymore AS, Keeley ATH, Yturralde KM et al. (2011) Genes to ecosystems: exploring the frontiers of ecology with one of the smallest biological units. New Phytologist, 191, 19–36.
- Yeh FC, Cheliak WM, Dancik BP et al. (1985) Population differentiation in lodgepole pine, Pinus contorta spp latifolia– a discriminant-analysis of allozyme variation. Canadian Journal of Genetics and Cytology, 27, 210–218.
- Yu J, Pressoir W, Briggs W et al. (2006) A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nature Genetics, 38, 203–208.
- Yuan Q, Hill J, Hsiao J et al. (2002) Genome sequencing of a 239-kb region of rice chromosome 10L reveals a high frequency of gene duplication and a large chloroplast dna insertion. Molecular Genetics and Genomics, 267, 713–720.