Abstract
We present an electronic format for exchanging data for HLA and KIR genotyping with extensions for next-generation sequencing (NGS). This format addresses NGS data exchange by refining the Histoimmunogenetics Markup Language (HML) to conform to the proposed Minimum Information for Reporting Immunogenomic NGS Genotyping (MIRING) reporting guidelines (miring.immunogenomics.org). Our refinements of HML include two major additions. First, NGS is supported by new XML structures to capture additional NGS data and metadata required to produce a genotyping result, including analysis-dependent (dynamic) and method-dependent (static) components. A full genotype, consensus sequence, and the surrounding metadata are included directly, while the raw sequence reads and platform documentation are externally referenced. Second, genotype ambiguity is fully represented by integrating Genotype List Strings, which use a hierarchical set of delimiters to represent allele and genotype ambiguity in a complete and accurate fashion. HML also continues to enable the transmission of legacy methods (e.g. site-specific oligonucleotide, sequence-specific priming, and sequence based typing (SBT)), adding features such as allowing multiple group-specific sequencing primers, and fully leveraging techniques that combine multiple methods to obtain a single result, such as SBT integrated with NGS.
- BRIDG
- Biomedical Research Integrated Domain Group
- CDISC
- Clinical Data Interchange Standards Consortium
- DaSH
- Data Standard Hackathon
- EMBL
- European Molecular Biology Laboratory
- ENA
- European Nucleotide Archive
- FDA
- Food and Drug Administration
- GL
- Genotype List
- HML
- Histoimmunogenetics Markup Language
- HLA
- Human Leucocyte Antigen
- IMGT
- ImMunoGeneTics
- ISO
- International Organization for Standardization
- LSDAM
- Life Sciences Domain Analysis Model
- KIR
- Killer-cell Immunoglobulin-like Receptor
- MHC
- Major Histocompatibility Complex
- MIRING
- Minimum Information for Reporting Immunogenomic NGS Genotyping
- NCI
- National Cancer Institute
- NGS
- Next Generation Sequencing
- NMDP
- National Marrow Donor Program
- OID
- Object Identifier
- SBT
- Sequence Based Typing
- SSO
- Sequence Specific Oligonucleotide
- SSP
- Sequence Specific Primer
- URI
- Uniform Resource Identifier
- XML
- eXtensible Markup Language