An improved pig reference genome sequence to enable pig genetics and genomics research

Abstract
The domestic pig (Sus scrofa) is important both as a food source and as a biomedical model with high anatomical and immunological similarity to humans. The draft reference genome (Sscrofa10.2) represents a purebred female pig from a commercial pork production breed (Duroc), and was established using older clone-based sequencing methods. The Sscrofa10.2 assembly was incomplete and unresolved redundancies, short range order and orientation errors and associated misassembled genes limited its utility. We present two genome assemblies created with more recent long read technologies and a whole genome shotgun strategy, one for the same Duroc female (Sscrofa11.1) and one for an outbred, composite breed male animal commonly used for commercial pork production (USMARCv1.0). Both assemblies are of substantially higher (>90-fold) continuity and accuracy compared to the earlier reference, and the availability of two independent assemblies provided an opportunity to identify large-scale variants and to error-check the accuracy of representation of the genome. We propose that the improved Duroc breed assembly (Sscrofa11.1) become the reference genome for genomic research in pigs.
Subject Area
- Biochemistry (4994)
- Bioengineering (3497)
- Bioinformatics (15279)
- Biophysics (6926)
- Cancer Biology (5427)
- Cell Biology (7770)
- Clinical Trials (138)
- Developmental Biology (4558)
- Ecology (7180)
- Epidemiology (2059)
- Evolutionary Biology (10260)
- Genetics (7532)
- Genomics (9826)
- Immunology (4899)
- Microbiology (13304)
- Molecular Biology (5165)
- Neuroscience (29568)
- Paleontology (203)
- Pathology (842)
- Pharmacology and Toxicology (1470)
- Physiology (2153)
- Plant Biology (4780)
- Synthetic Biology (1343)
- Systems Biology (4022)
- Zoology (771)