Molecular Ecology

Volume 13, Issue 11 p. 3261-3273

INVITED REVIEW

How to track and assess genotyping errors in population genetics studies

A. BONIN,

Corresponding Author

A. BONIN

Laboratoire d’Ecologie Alpine, CNRS-UMR 5553, Université Joseph Fourier, BP 53, 38041 Grenoble Cedex 09, France,

A. Bonin. Fax: +33 0 4 76 51 42 79; E-mail: [email protected]Search for more papers by this author

E. BELLEMAIN,

E. BELLEMAIN

Laboratoire d’Ecologie Alpine, CNRS-UMR 5553, Université Joseph Fourier, BP 53, 38041 Grenoble Cedex 09, France,

Department of Ecology and Natural Resource Management, Agricultural University of Norway, Box 5003, NO-1432 Ås, Norway,

Search for more papers by this author

P. BRONKEN EIDESEN,

P. BRONKEN EIDESEN

National Centre for Biosystematics, Natural History Museums and Botanical Garden, University of Oslo, PO Box 1172 Blindern, NO-0318 Oslo, Norway

Search for more papers by this author

F. POMPANON,

F. POMPANON

Laboratoire d’Ecologie Alpine, CNRS-UMR 5553, Université Joseph Fourier, BP 53, 38041 Grenoble Cedex 09, France,

Search for more papers by this author

C. BROCHMANN,

C. BROCHMANN

National Centre for Biosystematics, Natural History Museums and Botanical Garden, University of Oslo, PO Box 1172 Blindern, NO-0318 Oslo, Norway

Search for more papers by this author

P. TABERLET,

P. TABERLET

Laboratoire d’Ecologie Alpine, CNRS-UMR 5553, Université Joseph Fourier, BP 53, 38041 Grenoble Cedex 09, France,

Search for more papers by this author

A. BONIN,

Corresponding Author

A. BONIN

Laboratoire d’Ecologie Alpine, CNRS-UMR 5553, Université Joseph Fourier, BP 53, 38041 Grenoble Cedex 09, France,

A. Bonin. Fax: +33 0 4 76 51 42 79; E-mail: [email protected]Search for more papers by this author

E. BELLEMAIN,

E. BELLEMAIN

Laboratoire d’Ecologie Alpine, CNRS-UMR 5553, Université Joseph Fourier, BP 53, 38041 Grenoble Cedex 09, France,

Department of Ecology and Natural Resource Management, Agricultural University of Norway, Box 5003, NO-1432 Ås, Norway,

Search for more papers by this author

P. BRONKEN EIDESEN,

P. BRONKEN EIDESEN

National Centre for Biosystematics, Natural History Museums and Botanical Garden, University of Oslo, PO Box 1172 Blindern, NO-0318 Oslo, Norway

Search for more papers by this author

F. POMPANON,

F. POMPANON

Laboratoire d’Ecologie Alpine, CNRS-UMR 5553, Université Joseph Fourier, BP 53, 38041 Grenoble Cedex 09, France,

Search for more papers by this author

C. BROCHMANN,

C. BROCHMANN

National Centre for Biosystematics, Natural History Museums and Botanical Garden, University of Oslo, PO Box 1172 Blindern, NO-0318 Oslo, Norway

Search for more papers by this author

P. TABERLET,

P. TABERLET

Laboratoire d’Ecologie Alpine, CNRS-UMR 5553, Université Joseph Fourier, BP 53, 38041 Grenoble Cedex 09, France,

Search for more papers by this author

First published: 15 October 2004

https://doi.org/10.1111/j.1365-294X.2004.02346.x

Citations: 1,105

Read the full text

About

PDF

Tools

Share a link

Email
Facebook
Twitter
LinkedIn
Reddit
Wechat

Abstract

Genotyping errors occur when the genotype determined after molecular analysis does not correspond to the real genotype of the individual under consideration. Virtually every genetic data set includes some erroneous genotypes, but genotyping errors remain a taboo subject in population genetics, even though they might greatly bias the final conclusions, especially for studies based on individual identification. Here, we consider four case studies representing a large variety of population genetics investigations differing in their sampling strategies (noninvasive or traditional), in the type of organism studied (plant or animal) and the molecular markers used [microsatellites or amplified fragment length polymorphisms (AFLPs)]. In these data sets, the estimated genotyping error rate ranges from 0.8% for microsatellite loci from bear tissues to 2.6% for AFLP loci from dwarf birch leaves. Main sources of errors were allelic dropouts for microsatellites and differences in peak intensities for AFLPs, but in both cases human factors were non-negligible error generators. Therefore, tracking genotyping errors and identifying their causes are necessary to clean up the data sets and validate the final results according to the precision required. In addition, we propose the outline of a protocol designed to limit and quantify genotyping errors at each step of the genotyping process. In particular, we recommend (i) several efficient precautions to prevent contaminations and technical artefacts; (ii) systematic use of blind samples and automation; (iii) experience and rigor for laboratory work and scoring; and (iv) systematic reporting of the error rate in population genetics studies.

References

Ajmone-Marsan P, Negrini R, Crepaldi P et al. (2001) Assessing genetic diversity in Italian goat populations using AFLP markers. Animal Genetics, 32, 281–288.
10.1046/j.1365-2052.2001.00789.x
CASPubMedWeb of Science®Google Scholar
Ajmone-Marsan P, Valentini A, Cassandro M et al. (1997) AFLP markers for DNA fingerprinting in cattle. Animal Genetics, 28, 418–426.
10.1111/j.1365-2052.1997.00204.x
CASPubMedWeb of Science®Google Scholar
Akey JM, Zhang K, Xiong M, Doris P, Jin L (2001) The effect that genotyping errors have on the robustness of common linkage-disequilibrium measures. American Journal of Human Genetics, 68, 1447–1456.
10.1086/320607
CASPubMedWeb of Science®Google Scholar
Akey JM, Zhang G, Zhang K, Jin L, Shriver MD (2002) Interrogating a high-density SNP map for signatures of natural selection. Genome Research, 12, 1805–1814.
10.1101/gr.631202
CASPubMedWeb of Science®Google Scholar
Bagley MJ, Anderson SL, May B (2001) Choice of methodology for assessing genetic impacts of environmental stressors: polymorphism and reproducibility of RAPD and AFLP fingerprints. Ecotoxicology, 10, 239–244.
10.1023/A:1016625612603
CASPubMedWeb of Science®Google Scholar
Beaumont MA, Nichols RA (1996) Evaluating loci for use in the genetic analysis of population structure. Proceedings of the Royal Society of London, 263, 1619–1626.
10.1098/rspb.1996.0237
Google Scholar
Bellemain E, Swenson JE, Tallmon DA, Brunberg S, Taberlet P (2004) Estimating population size of elusive animals using DNA from hunter-collected feces: comparing four methods for brown bears. Conservation Biology, in press.

Google Scholar
Bellemain E, Taberlet P (2004) Improved non invasive genotyping method: application to brown bear (Ursus arctos) faeces. Molecular Ecology Notes, 4, 519–522.
10.1111/j.1471-8286.2004.00711.x
CASWeb of Science®Google Scholar
Benham J, Jeung JU, Jasieniuk M, Kanazin V, Blake T (1999) Genographer: a graphical tool for automated fluorescent AFLP and microsatellite analysis. Journal of Agricultural Genomics, 4, 399.

Google Scholar
Bradley BJ, Vigilant L (2002) False alleles derived from microbial DNA pose a potential source of error in microsatellite genotyping of DNA from faeces. Molecular Ecology Notes, 2, 602–605.
10.1046/j.1471-8286.2002.00302.x
CASWeb of Science®Google Scholar
Buetow KH (1991) Influence of aberrant observations on high-resolution linkage analysis outcomes. American Journal of Human Genetics, 49, 985–994.

CASPubMedWeb of Science®Google Scholar
Cercueil A, Bellemain E, Manel S (2002) parente: computer program for parentage analysis. Journal of Heredity, 93, 458–459.
10.1093/jhered/93.6.458
CASPubMedWeb of Science®Google Scholar
Constable JL, Ashley MV, Goodall J, Pusey AE (2001) Noninvasive paternity assignment in Gombe chimpanzees. Molecular Ecology, 10, 1279–1300.
10.1046/j.1365-294X.2001.01262.x
CASPubMedWeb of Science®Google Scholar
Creel S, Spong G, Sands JL et al. (2003) Population size estimation in Yellowstone wolves with error-prone noninvasive microsatellite genotypes. Molecular Ecology, 12, 2003–2009.
10.1046/j.1365-294X.2003.01868.x
PubMedWeb of Science®Google Scholar
Davison A, Chiba S (2003) Laboratory temperature variation is a previously unrecognized source of genotyping error during capillary electrophoresis. Molecular Ecology Notes, 3, 321–323.
10.1046/j.1471-8286.2003.00418.x
CASWeb of Science®Google Scholar
Delmotte F, Leterme N, Simon JC (2001) Microsatellite allele sizing: difference between automated capillary electrophoresis and manual technique. Biotechniques, 31, 810, 814–816, 818.

CASPubMedWeb of Science®Google Scholar
Douglas JA, Skol AD, Boehnke M (2002) Probability of detection of genotyping errors and mutations as inheritance inconsistencies in nuclear-family data. American Journal of Human Genetics, 70, 487–495.
10.1086/338919
CASPubMedWeb of Science®Google Scholar
Duchesne P, Godbout MH, Bernatchez L (2002) papa (package for the analysis of parental allocation): a computer program for simulated and real parental allocation. Molecular Ecology Notes, 2, 191–193.
10.1046/j.1471-8286.2002.00164.x
CASWeb of Science®Google Scholar
Dyer AT, Leonard KJ (2000) Contamination, error, and nonspecific molecular tools. Phytopathology, 90, 565–567.
10.1094/PHYTO.2000.90.6.565
CASPubMedWeb of Science®Google Scholar
Ewen KR, Bahlo M, Treloar SA et al. (2000) Identification and analysis of error types in high-throughput genotyping. American Journal of Human Genetics, 67, 727–736.
10.1086/303048
CASPubMedWeb of Science®Google Scholar
Fernando P, Evans BJ, Morales JC, Melnick DJ (2001) Electrophoresis artifacts — a previously unrecognized cause of error in microsatellite analysis. Molecular Ecology Notes, 1, 325–328.
10.1046/j.1471-8278.2001.00083.x
CASWeb of Science®Google Scholar
Gagneux P, Woodruff DS, Boesch C (1997a) Furtive mating in female chimpanzees. Nature, 387, 358–359.
10.1038/387358a0
CASPubMedWeb of Science®Google Scholar
Gagneux P, Boesch C, Woodruff DS (1997b) Microsatellite scoring errors associated with noninvasive genotyping based on nuclear DNA amplified from shed hair. Molecular Ecology, 6, 861–868.
10.1111/j.1365-294X.1997.tb00140.x
CASPubMedWeb of Science®Google Scholar
Gaudeul M, Taberlet P, Till-Bottraud I (2000) Genetic diversity in an endangered alpine plant, Eryngium alpinum L. (Apiaceae), inferred from amplified fragment length polymorphism markers. Molecular Ecology, 9, 1625–1637.
10.1046/j.1365-294x.2000.01063.x
CASPubMedWeb of Science®Google Scholar
Gomes I, Collins A, Lonjou C et al. (1999) Hardy–Weinberg quality control. Annals of Human Genetics, 63, 535–538.
10.1046/j.1469-1809.1999.6360535.x
CASPubMedWeb of Science®Google Scholar
Goossens B, Waits LP, Taberlet P (1998) Plucked hair samples as a source of DNA: reliability of dinucleotide microsatellite genotyping. Molecular Ecology, 7, 1237–1241.
10.1046/j.1365-294x.1998.00407.x
CASPubMedWeb of Science®Google Scholar
Gordon D, Finch SJ, Nothnagel M, Ott J (2002) Power and sample size calculations for case–control genetic association tests when errors are present: application to single nucleotide polymorphisms. Human Heredity, 54, 22–33.
10.1159/000066696
PubMedWeb of Science®Google Scholar
Hackett CA, Broadfoot LB (2003) Effects of genotyping errors, missing values and segregation distortion in molecular marker data on the construction of linkage maps. Heredity, 90, 33–38.
10.1038/sj.hdy.6800173
CASPubMedWeb of Science®Google Scholar
Hansen M, Kraft T, Christiansson M, Nilsson NO (1999) Evaluation of AFLP in Beta. Theoretical and Applied Genetics, 98, 845–852.
10.1007/s001220051143
CASWeb of Science®Google Scholar
Hofreiter M, Serre D, Poinar HN, Kuch M, Paabo S (2001) Ancient DNA. Nature Reviews Genetics, 2, 353–359.
10.1038/35072071
CASPubMedWeb of Science®Google Scholar
Jeffery KJ, Keller LF, Arcese P, Bruford MW (2001) The development of microsatellite loci in the song sparrow, Melospiza melodia (Aves), and genotyping errors associated with good quality DNA. Molecular Ecology Notes, 1, 11–13.
10.1046/j.1471-8278.2000.00005.x
CASWeb of Science®Google Scholar
Jones CJ, Edwards KJ, Castaglione S et al. (1997) Reproducibility testing of RAPD, AFLP and SSR markers in plants by a network of European laboratories. Molecular Breeding, 3, 381–390.
10.1023/A:1009612517139
CASPubMedWeb of Science®Google Scholar
Kauer M, Dieringer D, Schlotterer C (2003) A microsatellite variability screen for positive selection associated with the ‘Out of Africa’ habitat expansion of Drosophila melanogaster. Genetics, 165, 1137–1148.
10.1093/genetics/165.3.1137
CASPubMedWeb of Science®Google Scholar
Kennedy GC, Matsuzaki H, Dong S et al. (2003) Large-scale genotyping of complex DNA. Nature Biotechnology, 21, 1233–1237.
10.1038/nbt869
CASPubMedWeb of Science®Google Scholar
Koonjul PK, Brandt WF, Farrant JM, Lindsey GG (1999) Inclusion of polyvinylpyrrolidone in the polymerase chain reaction reverses the inhibitory effects of polyphenolic contamination of RNA. Nucleic Acids Research, 27, 915–916.
10.1093/nar/27.3.915
CASPubMedWeb of Science®Google Scholar
Lincoln SE, Lander ES (1992) Systematic detection of errors in genetic linkage data. Genomics, 14, 604–610.
10.1016/S0888-7543(05)80158-2
CASPubMedWeb of Science®Google Scholar
Matthes MC, Daly A, Edwards KJ (1998) Amplified length polymorphism (AFLP). In: Molecular Tools for Screening Biodiversity: Plants and Animals (eds A Karp, PG Isaac, D Ingram S ), pp. 183–192. Chapman & Hall, London.
10.1007/978-94-009-0019-6_36
Google Scholar
McKelvey KS, Schwartz MK (2004) Genetic errors associated with population estimation using non-invasive molecular tagging: problems and new solutions. Journal of Wildlife Management, 68, 439–448.
10.2193/0022-541X(2004)068[0439:GEAWPE]2.0.CO;2
Web of Science®Google Scholar
Miller CR, Joyce P, Waits LP (2002) Assessing allelic dropout and genotype reliability using maximum likelihood. Genetics, 160, 357–366.
10.1093/genetics/160.1.357
PubMedWeb of Science®Google Scholar
Mitchell AA, Cutler DJ, Chakravarti A (2003) Undetected genotyping errors cause apparent overtransmission of common alleles in the transmission/disequilibrium test. American Journal of Human Genetics, 72, 598–610.
10.1086/368203
CASPubMedWeb of Science®Google Scholar
Mowat G, Paetkau D (2002) Estimating marten Martes americana population size using hair capture and genetic tagging. Wildlife Biology, 8, 201–209.
10.2981/wlb.2002.034
Web of Science®Google Scholar
O'Hanlon PC, Peakall R (2000) A simple method for the detection of size homoplasy among amplified fragment length polymorphism fragments. Molecular Ecology, 9, 815–816.
10.1046/j.1365-294x.2000.00924.x
CASPubMedWeb of Science®Google Scholar
Paetkau D (2003) An empirical exploration of data quality in DNA-based population inventories. Molecular Ecology, 12, 1375–1387.
10.1046/j.1365-294X.2003.01820.x
CASPubMedWeb of Science®Google Scholar
Paetkau D, Calvert W, Stirling I, Strobeck C (1995) Microsatellite analysis structure of population structure in Canadian polar bears. Molecular Ecology, 4, 347–354.
10.1111/j.1365-294X.1995.tb00227.x
CASPubMedWeb of Science®Google Scholar
Paetkau D, Strobeck C (1994) Microsatellite analysis of genetic variation in black bear populations. Molecular Ecology, 3, 489–495.
10.1111/j.1365-294X.1994.tb00127.x
CASPubMedWeb of Science®Google Scholar
Papa R, Troggio M, Ajmone-Marsan P, Nonnis Marzano F (2004) An improved protocol for the production of AFLP markers in complex genomes by means of capillary electrophoresis. Journal of Animal Breeding and Genetics, in press.

Google Scholar
Pigott M, Bellemain E, Taberlet P, Taylor A (2004) A multiplex pre-amplification method that significantly improves microsatellite amplification and error rates for faecal DNA in limiting conditions. Conservation Genetics, 5, 417–420.
10.1023/B:COGE.0000031138.67958.44
Web of Science®Google Scholar
Polisky B, Greene P, Garfin DE et al. (1975) Specificity of substrate recognition by the EcoRI restriction endonuclease. Proceedings of the National Academy of Sciences USA, 72, 3310–3314.
10.1073/pnas.72.9.3310
CASPubMedWeb of Science®Google Scholar
Rodriguez S, Visedo G, Zapata C (2001) Detection of errors in dinucleotide repeat typing by nondenaturing electrophoresis. Electrophoresis, 22, 2656–2664.
10.1002/1522-2683(200108)22:13<2656::AID-ELPS2656>3.0.CO;2-6
CASPubMedWeb of Science®Google Scholar
Savelkoul PH, Aarts HJ, De Haas J et al. (1999) Amplified-fragment length polymorphism analysis: the state of an art. Journal of Clinical Microbiology, 37, 3083–3091.
10.1128/JCM.37.10.3083-3091.1999
CASPubMedWeb of Science®Google Scholar
Segovia-Lerma A, Cantrell RG, Conway JM, Ray IM (2003) AFLP-based assessment of genetic diversity among nine alfalfa germplasms using bulk DNA templates. Genome, 46, 51–58.
10.1139/g02-100
CASPubMedWeb of Science®Google Scholar
Smith JR, Carpten JD, Brownstein MJ et al. (1995) Approach to genotyping errors caused by nontemplated nucleotide addition by Taq DNA-polymerase. Genome Research, 5, 312–317.
10.1101/gr.5.3.312
CASPubMedWeb of Science®Google Scholar
Sobel E, Papp JC, Lange K (2002) Detection and integration of genotyping errors in statistical genetics. American Journal of Human Genetics, 70, 496–508.
10.1086/338920
PubMedWeb of Science®Google Scholar
Swenson JE, Sandegren F, Bjärvall A, Wabakken P (1998) Living with success: research needs for an expanding brown bear population. Ursus, 10, 17–23.

Web of Science®Google Scholar
Taberlet P, Camarra JJ, Griffin S et al. (1997) Noninvasive genetic tracking of the endangered Pyrenean brown bear population. Molecular Ecology, 6, 869–876.
10.1111/j.1365-294X.1997.tb00141.x
CASPubMedWeb of Science®Google Scholar
Taberlet P, Griffin S, Goossens B et al. (1996) Reliable genotyping of samples with very low DNA quantities using PCR. Nucleic Acids Research, 24, 3189–3194.
10.2307/1940795
CASPubMedWeb of Science®Google Scholar
Taberlet P, Luikart G (1999) Non-invasive genetic sampling and individual identification. Biological Journal of the Linnean Society, 68, 41–55.
10.1111/j.1095-8312.1999.tb01157.x
Web of Science®Google Scholar
Taberlet P, Waits LP, Luikart G (1999) Noninvasive genetic sampling: look before you leap. Trends in Ecology and Evolution, 14, 323–327.
10.1016/S0169-5347(99)01637-7
CASPubMedWeb of Science®Google Scholar
Valière N (2002) gimlet: a computer program for analysing genetic individual identification data. Molecular Ecology Notes, 2, 377–379.
10.1046/j.1471-8286.2002.00228.x-i2
CASPubMedGoogle Scholar
Valière N, Berthier P, Mouchiroud D, Pontier D (2002) gemini: software for testing the effects of genotyping errors and multitubes approach for individual identification. Molecular Ecology Notes, 2, 83–86.
10.1046/j.1471-8286.2002.00134.x
CASWeb of Science®Google Scholar
Vekemans X, Beauwens T, Lemaire M, Roldan-Ruiz I (2002) Data from amplified fragment length polymorphism (AFLP) markers show indication of size homoplasy and of a relationship between degree of homoplasy and fragment size. Molecular Ecology, 11, 139–151.
10.1046/j.0962-1083.2001.01415.x
CASPubMedWeb of Science®Google Scholar
Vigilant L, Hofreiter M, Siedel H, Boesch C (2001) Paternity and relatedness in wild chimpanzee communities. Proceedings of the National Academy of Sciences USA, 98, 12890–12895.
10.1073/pnas.231320498
CASPubMedWeb of Science®Google Scholar
Vos P, Hogers R, Bleeker M et al. (1995) AFLP: a new technique for DNA fingerprinting. Nucleic Acids Research, 23, 4407–4414.
10.1111/j.1365-2699.2006.01462.x
CASPubMedWeb of Science®Google Scholar
Waits JL, Leberg PL (2000) Biases associated with population estimation using molecular tagging. Animal Conservation, 3, 191–199.
10.1111/j.1469-1795.2000.tb00103.x
Web of Science®Google Scholar
Waits L, Taberlet P, Swenson JE, Sandegren F, Franzen R (2000) Nuclear DNA microsatellite analysis of genetic diversity and gene flow in the Scandinavian brown bear Ursus arctos. Molecular Ecology, 9, 421–431.
10.1046/j.1365-294x.2000.00892.x
CASPubMedWeb of Science®Google Scholar
Wang J (2004) Sibship reconstruction from genetic data with typing errors. Genetics, 166, 1963–1979.
10.1093/genetics/166.4.1963
PubMedWeb of Science®Google Scholar
Wang DG, Fan JB, Siao CJ et al. (1998) Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. Science, 280, 1077–1082.
10.1126/science.280.5366.1077
CASPubMedWeb of Science®Google Scholar
Xu J, Turner A, Little J, Bleecker ER, Meyers DA (2002) Positive results in association studies are associated with departure from Hardy–Weinberg equilibrium: hint for genotyping error? Human Genetics, 111, 573–574.
10.1007/s00439-002-0819-y
PubMedWeb of Science®Google Scholar
Yoder AD, Delefosse T (2002) The rise and fall and rise of ancient DNA studies. In: Ancient DNA, pp. 9–14. McGraw-Hill/Yearbook of Science and Technology, New York.

Web of Science®Google Scholar

Citing Literature

Volume13, Issue11

November 2004

Pages 3261-3273

How to track and assess genotyping errors in population genetics studies

Abstract

References

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

How to track and assess genotyping errors in population genetics studies

Abstract

References

Citing Literature

References

Related

Information