Ultra-deep Illumina sequencing accurately identifies MHC class IIb alleles and provides evidence for copy number variation in the guppy (Poecilia reticulata)

Mol Ecol Resour. 2014 Jul;14(4):753-67. doi: 10.1111/1755-0998.12225. Epub 2014 Feb 5.

Abstract

We address the bioinformatic issue of accurately separating amplified genes of the major histocompatibility complex (MHC) from artefacts generated during high-throughput sequencing workflows. We fit observed ultra-deep sequencing depths (hundreds to thousands of sequences per amplicon) of allelic variants to expectations from genetic models of copy number variation (CNV). We provide a simple, accurate and repeatable method for genotyping multigene families, evaluating our method via analyses of 209 b of MHC class IIb exon 2 in guppies (Poecilia reticulata). Genotype repeatability for resequenced individuals (N = 49) was high (100%) within the same sequencing run. However, repeatability dropped to 83.7% between independent runs, either because of lower mean amplicon sequencing depth in the initial run or random PCR effects. This highlights the importance of fully independent replicates. Significant improvements in genotyping accuracy were made by greatly reducing type I genotyping error (i.e. accepting an artefact as a true allele), which may occur when using low-depth allele validation thresholds used by previous methods. Only a small amount (4.9%) of type II error (i.e. rejecting a genuine allele as an artefact) was detected through fully independent sequencing runs. We observed 1-6 alleles per individual, and evidence of sharing of alleles across loci. Variation in the total number of MHC class II loci among individuals, both among and within populations was also observed, and some genotypes appeared to be partially hemizygous; total allelic dosage added up to an odd number of allelic copies. Collectively, observations provide evidence of MHC CNV and its complex basis in natural populations.

Keywords: amplicon; copy number variation; guppy; major histocompatibility complex; multigene family; next-generation sequencing.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alleles
  • Animals
  • Gene Dosage*
  • Genotyping Techniques / methods*
  • High-Throughput Nucleotide Sequencing
  • Histocompatibility Antigens Class II / genetics*
  • Molecular Sequence Data
  • Multigene Family*
  • Poecilia / genetics*
  • Reproducibility of Results

Substances

  • Histocompatibility Antigens Class II

Associated data

  • GENBANK/KF321642
  • GENBANK/KF321643
  • GENBANK/KF321644
  • GENBANK/KF321645
  • GENBANK/KF321646
  • GENBANK/KF321647
  • GENBANK/KF321648
  • GENBANK/KF321649
  • GENBANK/KF321650
  • GENBANK/KF321651
  • GENBANK/KF321652
  • GENBANK/KF321653
  • GENBANK/KF321654
  • GENBANK/KF321655
  • GENBANK/KF321656
  • GENBANK/KF321657
  • GENBANK/KF321658
  • GENBANK/KF321659
  • GENBANK/KF321660
  • GENBANK/KF321661
  • GENBANK/KF321662
  • GENBANK/KF321663
  • GENBANK/KF321664
  • GENBANK/KF321665
  • GENBANK/KF321666
  • GENBANK/KF321667
  • GENBANK/KF321668
  • GENBANK/KF321669
  • GENBANK/KF321670
  • GENBANK/KF321671
  • GENBANK/KF321672
  • GENBANK/KF321673
  • GENBANK/KF321674
  • GENBANK/KF321675
  • GENBANK/KF321676
  • GENBANK/KF321677
  • GENBANK/KF321678
  • GENBANK/KF321679
  • GENBANK/KF321680
  • GENBANK/KF321681
  • GENBANK/KF321682
  • GENBANK/KF321683
  • GENBANK/KF321684
  • GENBANK/KF321685
  • GENBANK/KF321686
  • GENBANK/KF321687
  • GENBANK/KF321688
  • GENBANK/KF321689
  • GENBANK/KF321690
  • GENBANK/KF321691
  • GENBANK/KF321692
  • GENBANK/KF321693
  • GENBANK/KF321694
  • GENBANK/KF321695
  • GENBANK/KF321696
  • GENBANK/KF321697
  • GENBANK/KF321698
  • GENBANK/KF321699
  • GENBANK/KF321700
  • GENBANK/KF321701
  • GENBANK/KF321702
  • GENBANK/KF321703
  • GENBANK/KF321704
  • GENBANK/KF321705
  • GENBANK/KF321706
  • GENBANK/KF321707
  • GENBANK/KF321708
  • GENBANK/KF321709
  • GENBANK/KF321710
  • GENBANK/KF321711
  • GENBANK/KF321712
  • GENBANK/KF321713
  • GENBANK/KF321714
  • GENBANK/KF321715
  • GENBANK/KF321716
  • GENBANK/KF321717
  • GENBANK/KF321718
  • GENBANK/KF321719
  • GENBANK/KF321720
  • GENBANK/KF321721
  • GENBANK/KF321722
  • GENBANK/KF321723
  • GENBANK/KF321724
  • GENBANK/KF321725
  • GENBANK/KF321726
  • GENBANK/KF321727
  • GENBANK/KF321728