Genomic screening by 454 pyrosequencing identifies a new human IGHV gene and sixteen other new IGHV allelic variants

Immunogenetics. 2011 May;63(5):259-65. doi: 10.1007/s00251-010-0510-8. Epub 2011 Jan 20.

Abstract

Complete and accurate knowledge of the genes and allelic variants of the human immunoglobulin gene loci is critical for studies of B cell repertoire development and somatic point mutation, but evidence from studies of VDJ rearrangements suggests that our knowledge of the available immunoglobulin gene repertoire is far from complete. The reported repertoire has changed little over the last 15 years. This is, in part, a consequence of the inefficiencies involved in searching for new members of large, multigenic gene families by cloning and sequencing. The advent of high-throughput sequencing provides a new avenue by which the germline repertoire can be explored. In this report, we describe pyrosequencing studies of the heavy chain IGHV1, IGHV3 and IGHV4 gene subgroups in ten Papua New Guineans. Thousands of 454 reads aligned with complete identity to 51 previously reported functional IGHV genes and allelic variants. A new gene, IGHV3-NL1*01, was identified, which differs from the nearest previously reported gene by 15 nucleotides. Sixteen new IGHV alleles were also identified, 15 of which varied from previously reported functional IGHV genes by between one and four nucleotides, while one sequence appears to be a functional variant of the pseudogene IGHV3-25. BLAST searches suggest that at least six of these new genes are carried within the relatively well-studied populations of North America, Europe or Asia. This study substantially expands the known immunoglobulin gene repertoire and demonstrates that genetic variation of immunoglobulin genes can now be efficiently explored in different human populations using high-throughput pyrosequencing.

MeSH terms

  • Alleles
  • Base Sequence
  • Genes, Immunoglobulin Heavy Chain*
  • Genetic Loci
  • Genetic Testing / methods*
  • Genetic Variation
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Immunoglobulin Variable Region / genetics*
  • Molecular Sequence Data
  • Multigene Family
  • Papua New Guinea
  • Sequence Analysis, DNA / methods*

Substances

  • Immunoglobulin Variable Region