An accurate assignment test for extremely low-coverage whole-genome sequence data

Giada Ferrari; Lane M. Atmore; Sissel Jentoft; Kjetill S. Jakobsen; Daniel Makowiecki; James H. Barrett; Bastiaan Star

doi:10.1101/2021.06.04.447098

Abstract

Genomic assignment tests can provide important diagnostic biological characteristics, such as population of origin or ecotype. In ancient DNA research, such characters can provide further information on population continuity, evolution, climate change, species migration, or trade, depending on archaeological context. Yet, assignment tests often rely on moderate- to high-coverage sequence data, which can be difficult to obtain for many ancient specimens and in ecological studies, which often use sequencing techniques such as ddRAD to bypass the need for costly whole-genome sequencing. We have developed a novel approach that efficiently assigns biologically relevant information (such as population identity or structural variants) in extremely low-coverage sequence data. First, we generate databases from existing reference data using a subset of diagnostic Single Nucleotide Polymorphisms (SNPs) associated with a biological characteristic. Low coverage alignment files from ancient specimens are subsequently compared to these databases to ascertain allelic state yielding a joint probability for each association. To assess the efficacy of this approach, we assigned inversion haplotypes and population identity in several species including Heliconius butterflies, Atlantic herring, and Atlantic cod. We used both modern and ancient specimens, including the first whole-genome sequence data recovered from ancient herring bones. The method accurately assigns biological characteristics, including population membership, using extremely low-coverage (e.g. 0.0001x fold) based on genome-wide SNPs. This approach will therefore increase the number of ancient samples in ecological and bioarchaeological research for which relevant biological information can be obtained.

Introduction

Despite advances in methodologies that allow for the recovery of higher yields of endogenous ancient DNA (aDNA) (e.g., Boessenkool et al., 2017; Carpenter et al., 2013; Gamba et al., 2014; Pinhasi et al., 2015), DNA preservation in sub-fossil, archaeological, historical, or degraded biological material remains variable and is often context specific (Ferrari et al., 2021; Keighley et al., 2021; Tin et al., 2014). In order to account for such unpredictability, aDNA sequencing projects typically screen many specimens from which a subset with the best DNA preservation is selected for deeper sequencing (e.g. Martínez-García et al., 2021; Star et al., 2018; van der Valk et al., 2021). Similarly, DNA (organelle) reference databases are increasingly being generated through “genome skimming” sequencing strategies (e.g. Dodsworth, 2015; Marcus, 2021; Nevill et al., 2020; Zeng et al., 2018). These practices result in a proliferation of specimens for which (extremely) sparse genome-wide data is obtained (Bohmann et al., 2020). Due to their low coverage, these data are difficult to jointly analyze with specimens that have obtained higher coverage without introducing various types of statistical bias, particularly when dealing with heterochronous datasets (e.g. François & Jay, 2020; Lee et al., 2010; Patterson et al., 2006; Skoglund et al., 2014). For this reason, specimens with low coverage genome-wide data are typically discarded from further bioinformatic analyses, leading to the destruction of unique zooarchaeological specimens for which no meaningful information is obtained. Efforts to obtain as much relevant information as possible from such specimens are therefore important from a biological and an ethical perspective (Pálsdóttir et al., 2019).

Although high-coverage whole genome data allow a plethora of detailed bioinformatic analyses, high coverage is not necessarily required for the determination of basic biological characteristics. Moreover, depending on archaeological context, knowledge of such characteristics can provide highly relevant information on population continuity, species migration and distributions, hunting, and historic trade and/or burial practices. For instance, the genetic sex of ancient mammals can easily be determined from sparse sequencing data due to its association with extensive genomic differentiation on a chromosomal scale. Sexing has been applied to ancient low-coverage sequences to infer burial practices (Fages et al., 2020; Nistelberger et al., 2019), the impact of historic hunting (Barrett et al., 2020), and the behaviour of extinct species (Pečnerová et al., 2017).

Other relevant biological characteristics may also be associated with large scale genomic differentiation. In particular, structural variants (e.g. chromosomal inversions) have been increasingly identified as major drivers of evolutionary and ecological processes (Wellenreuther & Bernatchez, 2018), playing important roles in population structure and evolution. For instance, inversions are involved in the evolution of sex chromosomes (Hughes et al., 2010; Lemaitre et al., 2009) and speciation (Noor et al., 2001), and are critical for within-species adaptation to local environments (Ayala et al., 2013; Barth et al., 2017; Jones et al., 2012; Leitwein et al., 2017; Lowry & Willis, 2010; Morales et al., 2019; Nadeau et al., 2016; Pettersson et al., 2019; Todesco et al., 2020; Twyford & Friedman, 2015). Chromosomal inversions can affect megabase sized genomic regions (e.g. P. R. Berg et al., 2017; Fang et al., 2012; Twyford & Friedman, 2015), and are often characterized by high levels of linkage disequilibrium (LD) (Hoffmann & Rieseberg, 2008) due to inhibited recombination between non-collinear inversion haplotypes. Thus, genotyping of such haplotypes using a subset of segregating genetic markers is feasible using whole genome sequencing data (Donnelly et al., 2010; Salm et al., 2012).

Several methods have been developed for assigning inversion haplotypes in order to facilitate GWAS analysis for SNPs within inversions in the human genome (scoreInvHap, Ruiz-Arenas et al., 2019; pfido Salm et al., 2012; InvClust, Cáceres & González, 2015; inveRsion, Cáceres et al., 2012). These methods rely on LD break-points and structural variation (e.g. InvClust, inveRsion, and scoreInvHap, as well as methods proposed by (Bansal et al., 2007; Sindi & Raphael, 2010) or haplotype tagging (inveRsion) to identify inversion sites and then conduct various types of SNP calling within those sites. All of these methods are specifically developed for identifying inversions in human genomes (e.g., Ma & Amos, 2012) and their use in disease- and other phenotype-association studies (Ruiz-Arenas et al., 2019; Salm et al., 2012). They have not been tested with sparse genomic data and are specific to use with inversions; indeed, pfido was designed for just one inversion in the human genome (Salm et al., 2012). Because of their reliance on signatures of structural variation, they cannot be applied to other types of variation, such as genome-wide population differentiation. There is currently no approach specifically designed to classify extremely low-coverage data with a broad applicability to score different types of large-scale genomic differentiation in a range of species.

Here, we developed a new method that allows efficient assignment of different biological characteristics using extremely low-coverage sequence data. The approach is similar to scoreInvHap (Ruiz-Arenas et al., 2019) in that scoring is a two-step process, yet there are some key differences. First, a database is created with the allele frequency association of individual SNP loci with a specific biological character (e.g. an inversion type or population membership). These databases are based on moderate- to high-coverage sequences of a subset of specimens (Figure 1a). Second, sequence alignment data of (ancient) specimens are compared to this database and a joint probability (e.g. see Star et al., 2017) is calculated based on the binomial distribution of their frequency association (Figure 1b). Importantly, in contrast to earlier approaches, this probability calculation does not make any assumptions regarding specific signatures of structural variation and can therefore be applied to different types of genetic differentiation. This includes differentiation between inversion haplotypes or genome-wide differences associated with ecotype or population structure. Our program depends solely on freely available, commonly used software and file formats and is freely available for download at: https://github.com/laneatmore/BAMscorer.

Figure 1. The BAMscorer pipeline.

The BAMscorer pipeline has two main modules -- reference database creation and alignment file scoring. a) sequence data must be pre-processed and input into the pipeline as a VCF file. smartPCA is used to generate eigenvalues and SNP loading weights, which are then used to assign haplotypes in the reference dataset and create a database of highly-divergent loci in a given region of interest. b) These positions are called from the alignment files to be scored. The positions are then compared to the database for allelic similarity. The likelihood of a given allele at a locus belonging to a haplotype is coded as the frequency of that allele at the locus in each database. AB allele frequencies are calculated as the average of frequencies present in AA and BB haplotypes. A joint probability is estimated for each alignment file belonging to each of the three haplotypes (for genome-wide assignment only AA and BB are used) and these values are scaled to one, outputting a probability index of genomic assignment for each individual.

We investigated the efficiency of our approach in assigning haplotypes for three chromosomal inversions in species that differ in their availability of reference specimens (P3, Heliconius numata, n = 20; Chr12, Clupea harengus, n = 19, and LG01, Gadus morhua, n = 276). These inversions display clinal distributions that are associated with biological characters such as wing-pattern phenotypes (Joron et al., 2006, 2011; Nadeau, 2016), adaptation to water temperature and salinity (Pettersson et al., 2019) and migratory behaviour (Paul R. Berg et al., 2016). Finally, we investigated the accuracy of this approach for the genome-wide population assignment of western and eastern Atlantic cod specimens (Barth et al., 2019; Pinsky et al., 2021).

Materials and methods

Ancient DNA extraction and sequencing

Nine Atlantic herring bones from two sites, dated between the 9th and 15th century CE (Table S1), were UV-treated for 10 minutes per side and cleaned with ultra-pure water. DNA was extracted including a pre-digestion step, following Damgaard et al. (2015). 10-40 mg of bone were pulverized with micro pestles in the digestion buffer (1 ml 0.5 M EDTA, 0.5 mg/ml proteinase K and 0.5% N-Laurylsarcosine). Following overnight digestion, DNA was extracted with 9 volumes of a 3:2 mixture of QG buffer (QIAGEN) and isopropanol. MinElute purification was carried out using the QIAvac 24 Plus vacuum manifold system (QIAGEN) in a final elution volume of 65 μl. Parallel non-template controls were included. Single-indexed blunt-end sequencing libraries were built from 16 μl of DNA extract or non-template extraction blank, following the single-tube (BEST) protocol (Carøe et al., 2018) with the modifications described in Mak et al. (2017). All laboratory protocols up to indexing of sequencing libraries were carried out in a dedicated aDNA clean laboratory at the University of Oslo following standard anti-contamination and authentication protocols (Cooper & Poinar, 2000; Gilbert et al., 2005; Llamas et al., 2017). Library quality and concentration were inspected with a High Sensitivity DNA Assay on the Bioanalyzer 2100 (Agilent) and sequenced on an Illumina HiSeq 2500 platform at the Norwegian Sequencing Centre with paired-end 125 bp reads, demultiplexed allowing zero mismatches in the index tag.

Data processing

For each species investigated, two different datasets were used. The first dataset was used to create the reference SNPs database and the second dataset, containing different individuals, was scored utilizing the BAMscorer program. All the datasets used in this manuscript —including the newly generated archaeological Atlantic herring data— are publicly available.

Heliconius butterflies

The heliconius reference database was created using a set of 20 individuals from various H. numata subspecies described in Nadeau et al. (2016). A set of 40 unrelated individuals to be scored were obtained from Jay et al. (2019). Both datasets were aligned to the Hmel2.5 reference assembly (http://ensembl.lepbase.org) using PALEOMIX v.1.2.13 (Schubert et al., 2014) with BWA-mem. Genotypes for the reference database were called using the GATK4 pipeline (Van der Auwera & O’Connor, 2020) and the following filtering parameters: FS<60.0 && SOR<4 && MQ>30.0 && QD > && INFO/DP<5500, SnpGap 10, minGQ 15 minDP 3, maf 0.001, with indels removed and biallelic variants selected. The P3 inversion at the supergene P mimicry locus, located on chromosome 15 and associated with wing pattern types in Heliconius numata subspecies (Jay et al., 2018; Joron et al., 2011), was investigated. The ~1.1Mb P3 inversion is found on scaffold Hmel215003o (between 2000001-3100000bp).

Atlantic herring

The Atlantic herring reference database was created using a set of 21 individuals described in Han et al. (2020), representing all but one of the major herring populations in the eastern Atlantic. The nine ancient Atlantic herring specimens dating from the 9th-15th century in Poland (Domagała & Franczuk, 1992; Iwaszkiewicz, 1991; Makowiecki, 2003; Makowiecki et al., 2016) were scored. Modern herring reads were aligned to the Atlantic herring reference genome (GCA_900700415.1) (Pettersson et al., 2019) as above. Ancient herring reads were aligned as described in Ferrari et al. (2021), using BWA-backtrack. Genotypes for the reference database were called and filtered as described above. Two individual outliers were observed and checked for relatedness using KING (Manichaikul et al., 2010, Supplementary Note 1). These individuals appeared to be duplicates and were removed from the dataset. An ~8Mb inversion on chromosome 12 was investigated, which is associated with different Atlantic herring ecotypes (Pettersson et al., 2019). The inversion is located at chr12:17900000-25600000bp. Ancient herring alignment files were downsampled to 100K reads. Most specimens have excellent DNA preservation (Supplementary Table 1) and all show the typical aDNA fragmentation and misincorporation patterns of authentic ancient DNA data (Supplementary Figure 1).

Atlantic cod

The Atlantic cod reference dataset was created using 276 Atlantic cod individuals representing most major geographical locations (western Atlantic, eastern Atlantic, and Baltic sea) in the species’ range (Barth et al., 2019; Pinsky et al., 2021). A dataset of 15 unrelated archaeological specimens were obtained from Star et al. (2017). Modern and ancient reads were aligned to the gadMor2 reference genome (Star et al., 2011; Tørresen et al., 2017) as above. Genotype calling and filtering for the reference was performed as described in Barth et al. (2019) using the GATK haplotype caller v.3.4.46 (McKenna et al., 2010), bcftools v.1.3 (Li, 2011), VCFtools v.0.1.14 (Danecek et al., 2011). An ~16Mb double chromosomal inversion on LG01 which is associated with differences in migratory behaviour (Paul R. Berg et al., 2016; Kirubakaran et al., 2016; Sodeland et al., 2016) was investigated. This inversion is located at LG01:9100000-26200000. Finally, genome-wide data separating 24 western Atlantic from 252 eastern Atlantic cod specimens was analysed excluding the location of four major inversions (on LG01, LG02, LG07, and LG12) following Star et al. (2017).

Analyses

The BAMscorer pipeline operates as follows:

Module 1: Creation of SNP reference databases (Figure 1a)

The initial step of the BAMscorer pipeline is to create a reference database of divergent SNPs associated with each haplotype or population in a set of focal individuals. These divergent SNPs are referred to as “AA” and “BB” haplotypes/groups. SNP databases are created as follows:

The VCF file is first prepared with VCFtools v.0.1.16 (Danecek et al., 2011) and PLINK v.1.9 (Purcell et al., 2007), selecting only those regions of interest (i.e. where inversions are located or genome-wide).
A Principal Component Analysis (PCA) is run as implemented in smartPCA (EIGENSOFT v.7.2.1, Patterson et al., 2006; Price et al., 2006) to calculate axes of differentiation and individual SNP loadings between inversion haplotypes or populations. As a default, the BAMscorer pipeline selects diagnostic loci in the top and bottom 5% of the SNP loading distribution, although the optimal SNP loading cut-off value should be determined by the user. Visualization of the SNP-loading profile can help decide such cut-offs (see further below).
SNPs that pass cut-off filters form the divergent SNPs database for each haplotype or population. To help make relevant selection of individuals, heterozygosity is calculated per individual based on SNPs in the divergent database.
Individuals from the VCF file are scored for PC1 and heterozygosity values and manually classified into types: homozygous haplotypes AA and BB, and, if applicable — i.e. in the case of inversions —, heterozygous AB. Inversion haplotypes are known to fall into specific clusters in PCA analysis (see Figure 1a), which allow for easy identification using separation on PC1 and assessing levels of individual heterozygosity.
For individuals in AA and BB haplotypes, allele frequencies of the divergent SNPs are calculated. Two databases are created, containing the allelic state (e.g. A, C, G, T) and allele frequencies of the major (first database) and minor (second database) alleles in the AA and BB haplotypes. Databases containing few individuals can contain fixed alleles due to limited sampling. This uncertainty in sampling fixed alleles is addressed by calculating an expected frequency of (1/((2*N)+ 1)) where N is the number of individuals in the reference database for fixed alleles in the region of interest. When scoring inversions, allele frequencies for AB haplotypes are averaged to approximate the probability of observing a random set of alleles coming from either AA or BB haplotype

Once optimal database parameters have been identified (a full list of parameters can be found at https://github.com/laneatmore/BAMscorer), the SNP database can be reused for BAM scoring on many different datasets of the same species.

Module 2: BAM scoring (Figure 1b)

The divergent SNPs databases are used to score alignment files (BAM format) for a given set of (low-coverage) individuals. For each locus in the divergent SNPs database, matching reads are pulled from the BAM file using the python module pysam (https://github.com/pysam-developers/pysam). Allelic state is determined based on the most highly-represented allele in all reads for each position. In the event that there are equal numbers of reads for multiple alleles at a given locus, one allele is then chosen at random. This process provides a subset of observed alleles at divergent loci in each inversion or population for each individual BAM file.
The probability of observing an SNP variant associated with inversion haplotype or populations is based on allele frequencies of matching positions in the reference databases, e.g. if the position in the BAM file matches the dominant allele in haplotype AA, the probability for that locus in the BAM file is coded as the allele frequency of the dominant allele in haplotype AA. For each position, three probabilities are recorded -- the frequency of that allele in haplotypes AA, BB, and AB (only AA and BB for genome-wide analysis).
Joint probabilities of all observed alleles belonging to a particular haplotype or population are calculated for each individual using the following equation: Whereby the probability (p) of the scored individual (i) and genotype (g) is the product of allele frequencies (f) of the number (n) of observed SNP loci (l) in each database.
Finally, the joint probability scores for all genotypes are scaled to one to provide a final probability estimate of an individual belonging to a certain haplotype or population. We also report the number of SNPs in the reference dataset that were recovered from each individual BAM file to provide information on how well scored a specific individual is.

Assessing scoring certainty

To investigate the sensitivity of the BAMscorer pipeline, we downsampled each BAM file in the three datasets (Heliconius, Jay et al. 2019, Atlantic herring and Atlantic cod, Star et al. 2017). Following an approach described in Nistelberger et al. (2019), BAM files containing whole genome shotgun data were downsampled to contain between 1K and 40K reads (in most instances this is a mere fraction of the available data). At each read interval, and for each individual, the down-sampling was randomly iterated 20 times. We compared accuracy of the scoring results of the extremely down-sampled Heliconius data by comparing results to an independent PCA analysis of the complete dataset (Supplementary Figure 2). For Atlantic herring and Atlantic cod accuracy of results was confirmed by prior knowledge of the inversion haplotypes or geographic origin of specimens.

Results

We investigated three chromosomal inversions and one genome-wide analysis using BAMscorer. The Heliconius P3 inversion is the smallest (1.1Mb) inversion, followed by the Atlantic herring Chr12 (8Mb) and Atlantic cod LG01 inversion (16Mb, Table 1). Principal Component Analysis (PCA) as implemented through BAMscorer -select_snps separates the three main inversion genotypes along PC1 for the Heliconius P3, Atlantic herring Chr12 and Atlantic cod LG01 datasets (Figure 2a), reproducing earlier observations (Barth et al., 2019; Han et al., 2020; Nadeau et al., 2016; Pinsky et al., 2021). Similarly, the whole genome analysis separates western from eastern Atlantic cod specimens along PC1 (Figure 2a, Pinksy et al. 2021). For the data analysed here, BAMscorer -select_snps typically runs within 15 minutes. The SNP weight loading distribution underlying genetic divergence between inversion haplotypes of populations is either approximately symmetrical (e.g. Heliconius or Atlantic herring) or asymmetrical (Atlantic cod, Figure 2b). SNP weights are proportional to the correlation (across samples) between each SNP and each PC (Patterson et al., 2006; Price et al., 2006). SNPs that are strongly associated with divergence will have the highest SNP weight loading values and are therefore biologically informative.

View this table:

Table 1. Inversion and genome characteristics of Heliconius, Atlantic herring and Atlantic cod.

Each comparison differs in terms of size of inversion, overall genome size and relative size of inversion in terms of species-specific genome size, as well as in terms of the optimum number of divergent SNPs (see methods) and individuals used for the reference databases and scoring.

Figure 2. Inversion and population assignment for H. numata (P3 inversion), C. harengus (Chr12 inversion) and G. morhua (LG01 inversion, population differentiation) using extreme low coverage data.

a) Inversion and population PCA plots generated for the three species (silhouettes) using smartPCA (ref). The number of individuals (red) and the genome-wide Fst differentiation for G. morhua (red arrow) is indicated. b) SNPs most associated with either inversion (A or B) haplotype or large-scale population differentiation (western or eastern Atlantic) are selected based on their SNP weight loading distribution along PC1. Those with lowest and highest loadings are most associated with differentiation along PC1. SNP weight indicates the percentage of SNPs selected from the most extreme end(s) of the distribution (red). c) Assignment probability for individual specimens generated by down-sampling BAM files 1000 to 40 000 reads. At each interval, and for each individual, the down-sampling is iterated 20 times in order to generate box plots. Probabilities are calculated based on the joint binomial distribution of observing divergent SNPs associated with either genotype or population. Also indicated is the number of individuals scored (red, note these are not the same individuals used to create the original databases) and fold coverage (red dotted line, x coverage) at which more than 0.99 median assignment probability is obtained.

An important consideration of our approach therefore lies in the selection of loci based on their SNP-loading distribution patterns. In order to maximize the probability of observing loci in low-coverage sequencing data, as many loci as possible should be included in the database. Yet adding those loci that are not significantly associated with either inversion haplotype or specific population will add noise and uncertainty. We therefore tested the accuracy of our approach using a range of SNP-loading filtering parameters. For inversions, databases were created using cut-off values between 1 and 25%, depending on the species under investigation. For our genome-wide analyses, we set the SNP-loading cut-off weights between 1 and 5%. The default parameter in the BAMscorer pipeline is to take symmetrical portions from each side of the SNP-loading distribution (the 5% cut-off value, therefore, takes the top and bottom 5% of SNPs), yet we also noticed asymmetrical SNP-loading distribution values. We therefore also investigated the effect of selecting SNPs from either the top or bottom of the SNP-loading distribution.

For instance, for the Heliconius P3 inversion, the ability to confidently score heterozygous individuals (Jay et al. 2019) erodes with increasing SNP weight values (Figure 3), and the optimal cut-off to simultaneously score all possible genotypes lies at 2% and 1701 SNPs. For Atlantic herring Chr12, not all haplotypes are observed in the ancient read data, yet no major increase in ability of scoring is obtained after a SNP weight of 10% and 28205 SNPs (Supplementary Figure 3). For Atlantic cod, best separation of ancient data (Star et al., 2017) was obtained by selecting SNPs from the single, most extreme end of the SNP weight loading distribution (Figure 2B, Supplementary Figures 4-6). For Atlantic cod LG01, SNP selection is similar to the Heliconius P3 in that the optimal cut-off is a trade-off in scoring homozygotes and heterozygotes, which for cod lies at 15% and 47564 SNPs (Supplementary Figure 5). Finally, best population separation for Atlantic cod is obtained at 5% and 221790 SNPs (Supplementary Figure 6).

Figure 3. SNP selection by varying SNP weight in H. numata (P3 inversion).

SNP weight is here defined as the percentage of SNPs with the most extreme values at both sides of the SNP loading distribution. Confidence in probability assignment is obtained by down-sampling BAM files 1000 to 40 000 reads. At each interval, and for each individual, the down-sampling is iterated 20 times in order to generate box plots. Probabilities are calculated based on the joint binomial distribution of observing divergent SNPs associated with either genotype. Also indicated is the number of individuals (n, red) and number of SNPs (SNPn, red) and the chosen cut-off value (red dotted lines) at which all three genotypes can be efficiently separated.

After deciding the best-possible cut-off values, several observations can be made regarding the scoring accuracy of BAMscorer -score_bams depending on the number of reads for each of the comparisons. First, accurate scoring is obtained in extremely low coverage data for all comparisons (Figure 2c). For Heliconius, accurate genotype determination was obtained with 20K reads and 0.009x fold nuclear coverage. For all other comparisons, even less reads —by an order of magnitude— were required. Second, the scoring accuracy of heterozygote genotypes requires more reads compared to homozygous genotypes (e.g. see Heliconius P3 and Atlantic cod LG01, Figure 2c). Thus, depending on the type of haplotype of the sample, different levels of accuracy are obtained. Third, an increase in scoring accuracy at lower numbers of reads is observed for those comparisons for which more SNPs could be obtained (Table1, Figure 2c). Best scoring accuracy is obtained for the population comparison of Atlantic cod, for which population of origin can be determined with 1000 reads or less than 0.0001x fold nuclear coverage (Figure 2c). Finally, BAMscorer -score_bams takes—on average— less than five minutes to complete each comparison.

Discussion

The BAMscorer program allows genomic assignment on extremely low-coverage sequence data, thereby greatly increasing the capacity for conducting population genomics analysis on poor-quality data. Not only will this expand the amount of information that can be gleaned from such sequences, but it will also reduce waste in the aDNA research pipeline. Sequence data with coverage as low as 0.009x is often discarded as there is little usable information that can be recovered from such poor-quality data. Applying our method will allow these sequences to be used, both reducing waste in the laboratory, and providing a higher degree of confidence that usable information can be recovered when conducting destructive sampling. The method is, additionally, quite fast and can be applied to large quantities of data at one time, providing an efficient overview of the biological characteristics of a large dataset. Such analysis can provide crucial information on population of origin, past trade routes and/or migration patterns, and on species’ ecology and evolution, depending on research question and context.

Additionally, BAMscorer shows promise for application to disciplines outside or adjacent to the field of ancient DNA (e.g. Bohmann et al., 2020). As the underlying methodology is generalistic in design, it is not specific to use with ancient samples. It could therefore be applied to such sequence data as ddRAD (Peterson et al., 2012) or hyRAD (Suchan et al., 2016), two common methods for cost-efficient sequencing used in ecology and evolution studies for modern and historic specimens, respectively. The capacity to quickly identify population of origin, determine between domestic and wild types, assess ecotype distribution, and to identify hybrids could be an incredibly useful tool in the fields of wildlife forensics and conservation genomics.

Our results show that there are several things to take into account when applying the BAMscorer program. Each of our three species showed differentiation in filtering parameters, such as minimum required reads and SNP loading weight cut-off value. We further saw differences in these parameters between cod when looking at the inversion on LG01 as opposed to the genome-wide data. This implies that an understanding of the biological system in question is important for assessing the efficacy of the BAMscorer program. It is further recommended that users explore the filtering parameters as we have done above to ascertain the appropriate parameters for their biological system. The BAMscorer program is further unable to identify de novo inversions and is reliant on existing reference data to create the database from which alignment files are scored.

We have here introduced a novel software program that can be used to increase the information gleaned from extremely low-coverage sequence data. We have found that biological characteristics and genomic assignment can be recovered from sequences with as little as 1000 aligned reads (at ~0.0001x coverage in the case of Atlantic cod). The method is flexible and can be used on various types of genomic data. It is further scalable for BAM files from 1K to 50M reads and can handle up to hundreds of thousands of SNPs without sacrificing computational efficiency. We have shown that it can differentiate between subspecies, ecotypes, and genomic inversions. We expect this approach to be widely applicable in the fields of ancient DNA, conservation genomics and wildlife forensics (Ogden, 2011; Runa & Harbison, 2021).

Data availability

Reference data for all species have been publicly released earlier and are available from the European Nucleotide Archive (ENA) with the following accession numbers: Heliconius; PRJEB12740 and PRJEB40136, Atlantic herring; PRJNA642736, Atlantic cod; PRJEB29231 and PRJEB41431. The nine ancient Atlantic herring sequences are available at ENA under accession number PRJEB45393.

Program availability

The full software package is available for download at: https://github.com/laneatmore/BAMscorer

Author Contributions

G.F., L.M.A., and BS wrote the manuscript. B.S. conceived of the project. G.F., L.M.A., and B.S. developed the method. L.M.A. wrote the code for the software program. G.F., L.M.A., and B.S. conducted data analysis and visualization. J.H.B. and D.M. provided archaeological material for sequencing. S.J. and K.S.J provided early access genomic sequence data. All authors have read and approved the manuscript.

Acknowledgements

We thank A. Gondek-Wyrozemska for processing the ancient Atlantic herring specimens. We are grateful for the computational resources provided by Saga through allocations to the Centre for Ecological and Evolutionary Synthesis at the University of Oslo. We also thank M. Skage, S. Kollias, M.S. Hansen, and A. Tooming-Klunderud from the Norwegian Sequencing Centre for sequencing and processing of samples. The project benefited from data generated by the RCN-funded project “The Aqua Genome Project” (221734/O30)”. Finally, this project received funding from RCN project “Catching the Past” (262777) and the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 813383. The European Research Agency is not responsible for any use that may be made of the information it contains.

References

↵
Ayala, D., Guerrero, R. F., & Kirkpatrick, M. (2013). Reproductive isolation and local adaptation quantified for a chromosome inversion in a malaria mosquito. Evolution; International Journal of Organic Evolution, 67(4), 946–958.
OpenUrl CrossRef PubMed
↵
Bansal, V., Bashir, A., & Bafna, V. (2007). Evidence for large inversion polymorphisms in the human genome from HapMap data. Genome Research, 17(2), 219–230.
OpenUrl Abstract/FREE Full Text
↵
Barrett, J. H., Boessenkool, S., Kneale, C. J., O’Connell, T. C., & Star, B. (2020). Ecological globalisation, serial depletion and the medieval trade of walrus rostra. Quaternary Science Reviews, 229, 106122.
OpenUrl
↵
Barth, J. M. I., Berg, P. R., Jonsson, P. R., Bonanomi, S., Corell, H., Hemmer-Hansen, J., Jakobsen, K. S., Johannesson, K., Jorde, P. E., Knutsen, H., Moksnes, P.-O., Star, B., Stenseth, N. C., Svedäng, H., Jentoft, S., & André, C. (2017). Genome architecture enables local adaptation of Atlantic cod despite high connectivity. Molecular Ecology, 26(17), 4452–4466.
OpenUrl
↵
Barth, J. M. I., Villegas-Ríos, D., Freitas, C., Moland, E., Star, B., André, C., Knutsen, H., Bradbury, I., Dierking, J., Petereit, C., Righton, D., Metcalfe, J., Jakobsen, K. S., Olsen, E. M., & Jentoft, S. (2019). Disentangling structural genomic and behavioural barriers in a sea of connectivity. Molecular Ecology, 28(6), 1394–1411.
OpenUrl CrossRef
↵
Berg, P. R., Star, B., Pampoulie, C., Bradbury, I. R., Bentzen, P., Hutchings, J. A., Jentoft, S., & Jakobsen, K. S. (2017). Trans-oceanic genomic divergence of Atlantic cod ecotypes is associated with large inversions. Heredity, 119(6), 418–428.
OpenUrl
↵
Berg, P. R., Star, B., Pampoulie, C., Sodeland, M., Barth, J. M. I., Knutsen, H., Jakobsen, K. S., & Jentoft, S. (2016). Three chromosomal rearrangements promote genomic divergence between migratory and stationary ecotypes of Atlantic cod. Scientific Reports, 6, 23246.
OpenUrl
↵
Boessenkool, S., Hanghøj, K., Nistelberger, H. M., Der Sarkissian, C., Gondek, A. T., Orlando, L., Barrett, J. H., & Star, B. (2017). Combining bleach and mild predigestion improves ancient DNA recovery from bones. Molecular Ecology Resources, 17(4), 742–751.
OpenUrl
↵
Bohmann, K., Mirarab, S., Bafna, V., & Gilbert, M. T. P. (2020). Beyond DNA barcoding: The unrealized potential of genome skim data in sample identification. Molecular Ecology, 29(14), 2521–2534.
OpenUrl
↵
Cáceres, A., & González, J. R. (2015). Following the footprints of polymorphic inversions on SNP data: from detection to association tests. Nucleic Acids Research, 43(8), e53.
OpenUrl CrossRef PubMed
↵
Cáceres, A., Sindi, S. S., Raphael, B. J., Cáceres, M., & González, J. R. (2012). Identification of polymorphic inversions from genotypes. BMC Bioinformatics, 13, 28.
OpenUrl CrossRef PubMed
↵
Carøe, C., Gopalakrishnan, S., Vinner, L., Mak, S. S. T., Sinding, M. H. S., Samaniego, J. A., Wales, N., Sicheritz-Pontén, T., & Gilbert, M. T. P. (2018). Single-tube library preparation for degraded DNA. Methods in Ecology and Evolution / British Ecological Society, 9(2), 410–419.
OpenUrl
↵
Carpenter, M. L., Buenrostro, J. D., Valdiosera, C., Schroeder, H., Allentoft, M. E., Sikora, M., Rasmussen, M., Gravel, S., Guillén, S., Nekhrizov, G., Leshtakov, K., Dimitrova, D., Theodossiev, N., Pettener, D., Luiselli, D., Sandoval, K., Moreno-Estrada, A., Li, Y., Wang, J., … Bustamante, C. D. (2013). Pulling out the 1%: whole-genome capture for the targeted enrichment of ancient DNA sequencing libraries. American Journal of Human Genetics, 93(5), 852–864.
OpenUrl CrossRef PubMed
↵
Cooper, A., & Poinar, H. N. (2000). Ancient DNA: do it right or not at all. Science, 289(5482), 1139–1139.
OpenUrl CrossRef PubMed Web of Science
↵
Damgaard, P. B., Margaryan, A., Schroeder, H., Orlando, L., Willerslev, E., & Allentoft, M. E. (2015). Improving access to endogenous DNA in ancient bones and teeth. Scientific Reports, 5, 11184.
OpenUrl
↵
Danecek, P., Auton, A., Abecasis, G., Albers, C. A., Banks, E., DePristo, M. A., Handsaker, R. E., Lunter, G., Marth, G. T., Sherry, S. T., McVean, G., Durbin, R., & 1000 Genomes Project Analysis Group. (2011). The variant call format and VCFtools. Bioinformatics, 27(15), 2156–2158.
OpenUrl CrossRef PubMed Web of Science
↵
Dodsworth, S. (2015). Genome skimming for next-generation biodiversity analysis. Trends in Plant Science, 20(9), 525–527.
OpenUrl CrossRef PubMed
↵
Domagała, R., & Franczuk, R. (1992). Wyniki badań archeologiczno-architektonicznych na zamku w Małej Nieszawce. Rocznik Muzeum W Toruniu, 9, 41–53.
OpenUrl
↵
Donnelly, M. P., Paschou, P., Grigorenko, E., Gurwitz, D., Mehdi, S. Q., Kajuna, S. L. B., Barta, C., Kungulilo, S., Karoma, N. J., Lu, R.-B., Zhukova, O. V., Kim, J.-J., Comas, D., Siniscalco, M., New, M., Li, P., Li, H., Manolopoulos, V. G., Speed, W. C., … Kidd, K. K. (2010). The distribution and most recent common ancestor of the 17q21 inversion in humans. American Journal of Human Genetics, 86(2), 161–171.
OpenUrl CrossRef PubMed Web of Science
↵
Fages, A., Seguin-Orlando, A., Germonpré, M., & Orlando, L. (2020). Horse males became over-represented in archaeological assemblages during the Bronze Age. Journal of Archaeological Science: Reports, 31, 102364.
OpenUrl
↵
Fang, Z., Pyhäjärvi, T., Weber, A. L., Dawe, R. K., Glaubitz, J. C., González, J. de J. S., Ross-Ibarra, C., Doebley, J., Morrell, P. L., & Ross-Ibarra, J. (2012). Megabase-scale inversion polymorphism in the wild ancestor of maize. Genetics, 191(3), 883–894.
OpenUrl Abstract/FREE Full Text
↵
Ferrari, G., Cuevas, A., Gondek-Wyrozemska, A. T., Ballantyne, R., Kersten, O., Pálsdóttir, A. H., van der Jagt, I., Hufthammer, A. K., Ystgaard, I., Wickler, S., Bigelow, G. F., Harland, J., Nicholson, R., Orton, D., Clavel, B., Boessenkool, S., Barrett, J. H., & Star, B. (2021). The preservation of ancient DNA in archaeological fish bone. Journal of Archaeological Science, 126, 105317.
OpenUrl
↵
François, O., & Jay, F. (2020). Factor analysis of ancient population genomic samples. Nature Communications, 11(1), 4661.
OpenUrl
↵
Gamba, C., Jones, E. R., Teasdale, M. D., McLaughlin, R. L., Gonzalez-Fortes, G., Mattiangeli, V., Domboróczki, L., Kővári, I., Pap, I., Anders, A., Whittle, A., Dani, J., Raczky, P., Higham, T. F. G., Hofreiter, M., Bradley, D. G., & Pinhasi, R. (2014). Genome flux and stasis in a five millennium transect of European prehistory. Nature Communications, 5, 5257.
OpenUrl
↵
Gilbert, M. T. P., Bandelt, H.-J., Hofreiter, M., & Barnes, I. (2005). Assessing ancient DNA studies. Trends in Ecology & Evolution, 20(10), 541–544.
OpenUrl
↵
Han, F., Jamsandekar, M., Pettersson, M. E., Su, L., Fuentes-Pardo, A. P., Davis, B. W., Bekkevold, D., Berg, F., Casini, M., Dahle, G., Farrell, E. D., Folkvord, A., & Andersson, L. (2020). Ecological adaptation in Atlantic herring is associated with large shifts in allele frequencies at hundreds of loci. eLife, 9. https://doi.org/10.7554/eLife.61076
↵
Hoffmann, A. A., & Rieseberg, L. H. (2008). Revisiting the Impact of Inversions in Evolution: From Population Genetic Markers to Drivers of Adaptive Shifts and Speciation? Annual Review of Ecology, Evolution, and Systematics, 39, 21–42.
OpenUrl CrossRef Web of Science
↵
Hughes, J. F., Skaletsky, H., Pyntikova, T., Graves, T. A., van Daalen, S. K. M., Minx, P. J., Fulton, R. S., McGrath, S. D., Locke, D. P., Friedman, C., Trask, B. J., Mardis, E. R., Warren, W. C., Repping, S., Rozen, S., Wilson, R. K., & Page, D. C. (2010). Chimpanzee and human Y chromosomes are remarkably divergent in structure and gene content. Nature, 463(7280), 536–539.
OpenUrl CrossRef PubMed Web of Science
↵
Iwaszkiewicz, M. (1991). Szczątki ryb z zamku krzyżackiego w Małej Nieszawce (woj. toruńskie), Roczniki Akademii Rolniczej w Poznaniu 227. Archeozoologia, 16, 3–5.
OpenUrl
↵
Jay, P., Chouteau, M., Whibley, A., Bastide, H., Llaurens, V., Parrinello, H., & Joron, M. (2019). Mutation accumulation in chromosomal inversions maintains wing pattern polymorphism in a butterfly. In Cold Spring Harbor Laboratory (p. 736504). https://doi.org/10.1101/736504
↵
Jay, P., Whibley, A., Frézal, L., Rodríguez de Cara, M. Á., Nowell, R. W., Mallet, J., Dasmahapatra, K. K., & Joron, M. (2018). Supergene Evolution Triggered by the Introgression of a Chromosomal Inversion. Current Biology: CB, 28(11), 1839–1845.e3.
OpenUrl
↵
Jones, F. C., Grabherr, M. G., Chan, Y. F., Russell, P., Mauceli, E., Johnson, J., Swofford, R., Pirun, M., Zody, M. C., White, S., Birney, E., Searle, S., Schmutz, J., Grimwood, J., Dickson, M. C., Myers, R. M., Miller, C. T., Summers, B. R., Knecht, A. K., … Kingsley, D. M. (2012). The genomic basis of adaptive evolution in threespine sticklebacks. Nature, 484(7392), 55–61.
OpenUrl CrossRef PubMed Web of Science
↵
Joron, M., Frezal, L., Jones, R. T., Chamberlain, N. L., Lee, S. F., Haag, C. R., Whibley, A., Becuwe, M., Baxter, S. W., Ferguson, L., Wilkinson, P. A., Salazar, C., Davidson, C., Clark, R., Quail, M. A., Beasley, H., Glithero, R., Lloyd, C., Sims, S., … ffrench-Constant, R. H. (2011). Chromosomal rearrangements maintain a polymorphic supergene controlling butterfly mimicry. Nature, 477(7363), 203–206.
OpenUrl CrossRef PubMed Web of Science
↵
Joron, M., Papa, R., Beltrán, M., Chamberlain, N., Mavárez, J., Baxter, S., Abanto, M., Bermingham, E., Humphray, S. J., Rogers, J., Beasley, H., Barlow, K., ffrench-Constant, R. H., Mallet, J., McMillan, W. O., & Jiggins, C. D. (2006). A conserved supergene locus controls colour pattern diversity in Heliconius butterflies. PLoS Biology, 4(10), e303.
OpenUrl CrossRef PubMed
↵
Keighley, X., Bro-Jørgensen, M. H., Ahlgren, H., Szpak, P., Ciucani, M. M., Sánchez Barreiro, F., Howse, L., Gotfredsen, A. B., Glykou, A., Jordan, P., Lidén, K., & Olsen, M. T. (2021). Predicting sample success for large-scale ancient DNA studies on marine mammals. Molecular Ecology Resources, 21(4), 1149–1166.
OpenUrl
↵
Kirubakaran, T. G., Grove, H., Kent, M. P., Sandve, S. R., Baranski, M., Nome, T., De Rosa, M. C., Righino, B., Johansen, T., Otterå, H., Sonesson, A., Lien, S., & Andersen, Ø. (2016). Two adjacent inversions maintain genomic differentiation between migratory and stationary ecotypes of Atlantic cod. Molecular Ecology, 25(10), 2130–2143.
OpenUrl CrossRef
↵
Lee, S., Zou, F., & Wright, F. A. (2010). Convergence and prediction of principal component scores in high-dimensional settings. Annals of Statistics, 38(6), 3605–3629.
OpenUrl
↵
Leitwein, M., Garza, J. C., & Pearse, D. E. (2017). Ancestry and adaptive evolution of anadromous, resident, and adfluvial rainbow trout (Oncorhynchus mykiss) in the San Francisco bay area: application of adaptive genomic variation to conservation in a highly impacted landscape. Evolutionary Applications, 10(1), 56–67.
OpenUrl
↵
Lemaitre, C., Braga, M. D. V., Gautier, C., Sagot, M.-F., Tannier, E., & Marais, G. A. B. (2009). Footprints of inversions at present and past pseudoautosomal boundaries in human sex chromosomes. Genome Biology and Evolution, 1, 56–66.
OpenUrl CrossRef PubMed
↵
Li, H. (2011). A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics, 27(21), 2987–2993.
OpenUrl CrossRef PubMed Web of Science
↵
Llamas, B., Valverde, G., Fehren-Schmitz, L., Weyrich, L. S., Cooper, A., & Haak, W. (2017). From the field to the laboratory: Controlling DNA contamination in human ancient DNA research in the high-throughput sequencing era. STAR: Science & Technology of Archaeological Research, 3(1), 1–14.
OpenUrl
↵
Lowry, D. B., & Willis, J. H. (2010). A Widespread Chromosomal Inversion Polymorphism Contributes to a Major Life-History Transition, Local Adaptation, and Reproductive Isolation. PLoS Biology, 8(9), e1000500.
OpenUrl CrossRef PubMed
↵
Ma, J., & Amos, C. I. (2012). Investigation of inversion polymorphisms in the human genome using principal components analysis. PloS One, 7(7), e40224.
OpenUrl CrossRef PubMed
↵
Makowiecki, D. (2003). Historia ryb i rybołówstwa w holocenie na Niżu Polskim w świetle badań archeoichtiologicznych. Poznań: Institute of Archaeology and Ethnology, Polish Academy of Sciences.
↵
1. J. H. Barrett &
2. D. Orton
Makowiecki, D., Orton, D. C., & Barrett, J. H. (2016). Cod and Herring in Medieval Poland. In J. H. Barrett & D. Orton (Eds.), Cod & Herring: The Archaeology & History of Medieval Sea Fishing (pp. 117–132). Oxbow Books: Oxford & Philadelphia.
↵
Mak, S. S. T., Gopalakrishnan, S., Carøe, C., Geng, C., Liu, S., Sinding, M.-H. S., Kuderna, L. F. K., Zhang, W., Fu, S., Vieira, F. G., Germonpré, M., Bocherens, H., Fedorov, S., Petersen, B., Sicheritz-Pontén, T., Marques-Bonet, T., Zhang, G., Jiang, H., & Gilbert, M. T. P. (2017). Comparative performance of the BGISEQ-500 vs Illumina HiSeq2500 sequencing platforms for palaeogenomic sequencing. GigaScience, 6(8), 1–13.
OpenUrl CrossRef PubMed
↵
Manichaikul, A., Mychaleckyj, J. C., Rich, S. S., Daly, K., Sale, M., & Chen, W.-M. (2010). Robust relationship inference in genome-wide association studies. Bioinformatics, 26(22), 2867–2873.
OpenUrl CrossRef PubMed Web of Science
↵
Marcus, J. M. (2021). Our love-hate relationship with DNA barcodes, the Y2K problem, and the search for next generation barcodes. AIMS Genetics, 05(01), 001–023.
OpenUrl
↵
Martínez-García, L., Ferrari, G., Oosting, T., Ballantyne, R., van der Jagt, I., Ystgaard, I., Harland, J., Nicholson, R., Hamilton-Dyer, S., Baalsrud, H. T., Brieuc, M. S. O., Atmore, L. M., Burns, F., Schmölcke, U., Jakobsen, K. S., Jentoft, S., Orton, D., Hufthammer, A. K., Barrett, J. H., & Star, B. (2021). Historical Demographic Processes Dominate Genetic Variation in Ancient Atlantic Cod Mitogenomes. Frontiers in Ecology and Evolution, 9, 342.
OpenUrl
↵
McKenna, A., Hanna, M., Banks, E., Sivachenko, A., Cibulskis, K., Kernytsky, A., Garimella, K., Altshuler, D., Gabriel, S., Daly, M., & DePristo, M. A. (2010). The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Research, 20(9), 1297–1303.
OpenUrl Abstract/FREE Full Text
↵
Morales, H. E., Faria, R., Johannesson, K., Larsson, T., Panova, M., Westram, A. M., & Butlin, R. K. (2019). Genomic architecture of parallel ecological divergence: Beyond a single environmental contrast. Science Advances, 5(12), eaav9963.
OpenUrl FREE Full Text
↵
Nadeau, N. J. (2016). Genes controlling mimetic colour pattern variation in butterflies. Current Opinion in Insect Science, 17, 24–31.
OpenUrl
↵
Nadeau, N. J., Pardo-Diaz, C., Whibley, A., Supple, M. A., Saenko, S. V., Wallbank, R. W. R., Wu, G. C., Maroja, L., Ferguson, L., Hanly, J. J., Hines, H., Salazar, C., Merrill, R. M., Dowling, A. J., ffrench-Constant, R. H., Llaurens, V., Joron, M., McMillan, W. O., & Jiggins, C. D. (2016). The gene cortex controls mimicry and crypsis in butterflies and moths. Nature, 534(7605), 106–110.
OpenUrl CrossRef PubMed
↵
Nevill, P. G., Zhong, X., Tonti-Filippini, J., Byrne, M., Hislop, M., Thiele, K., van Leeuwen, S., Boykin, L. M., & Small, I. (2020). Large scale genome skimming from herbarium material for accurate plant identification and phylogenomics. Plant Methods, 16, 1.
OpenUrl
↵
Nistelberger, H. M., Pálsdóttir, A. H., Star, B., Leifsson, R., Gondek, A. T., Orlando, L., Barrett, J. H., Hallsson, J. H., & Boessenkool, S. (2019). Sexing Viking Age horses from burial and non-burial sites in Iceland using ancient DNA. Journal of Archaeological Science, 101, 115–122.
OpenUrl
↵
Noor, M. A. F., Grams, K. L., Bertucci, L. A., & Reiland, J. (2001). Chromosomal inversions and the reproductive isolation of species. Proceedings of the National Academy of Sciences of the United States of America, 98(21), 12084–12088.
OpenUrl Abstract/FREE Full Text
↵
Ogden, R. (2011). Unlocking the potential of genomic technologies for wildlife forensics. Molecular Ecology Resources, 11 Suppl 1, 109–116.
OpenUrl
↵
Pálsdóttir, A. H., Bläuer, A., Rannamäe, E., Boessenkool, S., & Hallsson, J. H. (2019). Not a limitless resource: ethics and guidelines for destructive sampling of archaeofaunal remains. Royal Society Open Science, 6(10), 191059.
OpenUrl
↵
Patterson, N., Price, A. L., & Reich, D. (2006). Population structure and eigenanalysis. PLoS Genetics, 2(12), e190.
OpenUrl
↵
Pečnerová, P., Díez-Del-Molino, D., Dussex, N., Feuerborn, T., von Seth, J., van der Plicht, J., Nikolskiy, P., Tikhonov, A., Vartanyan, S., & Dalén, L. (2017). Genome-Based Sexing Provides Clues about Behavior and Social Structure in the Woolly Mammoth. Current Biology: CB, 27(22), 3505–3510.e3.
OpenUrl
↵
Peterson, B. K., Weber, J. N., Kay, E. H., Fisher, H. S., & Hoekstra, H. E. (2012). Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species. PloS One, 7(5), e37135.
OpenUrl CrossRef PubMed
↵
Pettersson, M. E., Rochus, C. M., Han, F., Chen, J., Hill, J., Wallerman, O., Fan, G., Hong, X., Xu, Q., Zhang, H., Liu, S., Liu, X., Haggerty, L., Hunt, T., Martin, F. J., Flicek, P., Bunikis, I., Folkvord, A., & Andersson, L. (2019). A chromosome-level assembly of the Atlantic herring genome—detection of a supergene and other signals of selection. Genome Research. https://doi.org/10.1101/gr.253435.119
↵
Pinhasi, R., Fernandes, D., Sirak, K., Novak, M., Connell, S., Alpaslan-Roodenberg, S., Gerritsen, F., Moiseyev, V., Gromov, A., Raczky, P., Anders, A., Pietrusewsky, M., Rollefson, G., Jovanovic, M., Trinhhoang, H., Bar-Oz, G., Oxenham, M., Matsumura, H., & Hofreiter, M. (2015). Optimal Ancient DNA Yields from the Inner Ear Part of the Human Petrous Bone. PloS One, 10(6), e0129102.
OpenUrl CrossRef PubMed
↵
Pinsky, M. L., Eikeset, A. M., Helmerson, C., Bradbury, I. R., Bentzen, P., Morris, C., Gondek-Wyrozemska, A. T., Baalsrud, H. T., Brieuc, M. S. O., Kjesbu, O. S., Godiksen, J. A., Barth, J. M. I., Matschiner, M., Stenseth, N. C., Jakobsen, K. S., Jentoft, S., & Star, B. (2021). Genomic stability through time despite decades of exploitation in cod on both sides of the Atlantic. Proceedings of the National Academy of Sciences of the United States of America, 118(15). https://doi.org/10.1073/pnas.2025453118
↵
Price, A. L., Patterson, N. J., Plenge, R. M., Weinblatt, M. E., Shadick, N. A., & Reich, D. (2006). Principal components analysis corrects for stratification in genome-wide association studies. Nature Genetics, 38(8), 904–909.
OpenUrl CrossRef PubMed Web of Science
↵
Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M. A. R., Bender, D., Maller, J., Sklar, P., de Bakker, P. I. W., Daly, M. J., & Sham, P. C. (2007). PLINK: a tool set for whole-genome association and population-based linkage analyses. American Journal of Human Genetics, 81(3), 559–575.
OpenUrl CrossRef PubMed
↵
Ruiz-Arenas, C., Cáceres, A., López-Sánchez, M., Tolosana, I., Pérez-Jurado, L., & González, J. R. (2019). scoreInvHap: Inversion genotyping for genome-wide association studies. PLoS Genetics, 15(7), e1008203.
OpenUrl
↵
Runa, D., & Harbison, S. (2021). Sequencing Technology in Forensic Science: Next-Generation Sequencing. In Forensic DNA Analysis (pp. 149–199).
↵
Salm, M. P. A., Horswell, S. D., Hutchison, C. E., Speedy, H. E., Yang, X., Liang, L., Schadt, E. E., Cookson, W. O., Wierzbicki, A. S., Naoumova, R. P., & Shoulders, C. C. (2012). The origin, global distribution, and functional impact of the human 8p23 inversion polymorphism. Genome Research, 22(6), 1144–1153.
OpenUrl Abstract/FREE Full Text
↵
Schubert, M., Ermini, L., Der Sarkissian, C., Jónsson, H., Ginolhac, A., Schaefer, R., Martin, M. D., Fernández, R., Kircher, M., McCue, M., Willerslev, E., & Orlando, L. (2014). Characterization of ancient and modern genomes by SNP detection and phylogenomic and metagenomic analysis using PALEOMIX. Nature Protocols, 9(5), 1056–1082.
OpenUrl
↵
Sindi, S. S., & Raphael, B. J. (2010). Identification and frequency estimation of inversion polymorphisms from haplotype data. Journal of Computational Biology: A Journal of Computational Molecular Cell Biology, 17(3), 517–531.
OpenUrl
↵
Skoglund, P., Sjödin, P., Skoglund, T., Lascoux, M., & Jakobsson, M. (2014). Investigating population history using temporal genetic differentiation. Molecular Biology and Evolution, 31(9), 2516–2527.
OpenUrl CrossRef PubMed
↵
Sodeland, M., Jorde, P. E., Lien, S., Jentoft, S., Berg, P. R., Grove, H., Kent, M. P., Arnyasi, M., Olsen, E. M., & Knutsen, H. (2016). “Islands of Divergence” in the Atlantic Cod Genome Represent Polymorphic Chromosomal Rearrangements. Genome Biology and Evolution, 8(4), 1012–1022.
OpenUrl CrossRef PubMed
↵
Star, B., Barrett, J. H., Gondek, A. T., & Boessenkool, S. (2018). Ancient DNA reveals the chronology of walrus ivory trade from Norse Greenland. Proceedings. Biological Sciences / The Royal Society, 285(1884), 20180978.
OpenUrl
↵
Star, B., Boessenkool, S., Gondek, A. T., Nikulina, E. A., Hufthammer, A. K., Pampoulie, C., Knutsen, H., André, C., Nistelberger, H. M., Dierking, J., Petereit, C., Heinrich, D., Jakobsen, K. S., Stenseth, N. C., Jentoft, S., & Barrett, J. H. (2017). Ancient DNA reveals the Arctic origin of Viking Age cod from Haithabu, Germany. Proceedings of the National Academy of Sciences of the United States of America, 114(34), 9152–9157.
OpenUrl Abstract/FREE Full Text
↵
Star, B., Nederbragt, A. J., Jentoft, S., Grimholt, U., Malmstrøm, M., Gregers, T. F., Rounge, T. B., Paulsen, J., Solbakken, M. H., Sharma, A., Wetten, O. F., Lanzén, A., Winer, R., Knight, J., Vogel, J.-H., Aken, B., Andersen, O., Lagesen, K., Tooming-Klunderud, A., … Jakobsen, K. S. (2011). The genome sequence of Atlantic cod reveals a unique immune system. Nature, 477(7363), 207–210.
OpenUrl CrossRef PubMed Web of Science
↵
Suchan, T., Pitteloud, C., Gerasimova, N. S., Kostikova, A., Schmid, S., Arrigo, N., Pajkovic, M., Ronikier, M., & Alvarez, N. (2016). Hybridization Capture Using RAD Probes (hyRAD), a New Tool for Performing Genomic Analyses on Collection Specimens. PloS One, 11(3), e0151651.
OpenUrl CrossRef PubMed
↵
Tin, M. M.-Y., Economo, E. P., & Mikheyev, A. S. (2014). Sequencing degraded DNA from non-destructively sampled museum specimens for RAD-tagging and low-coverage shotgun phylogenetics. PloS One, 9(5), e96793.
OpenUrl CrossRef PubMed
↵
Todesco, M., Owens, G. L., Bercovich, N., Légaré, J.-S., Soudi, S., Burge, D. O., Huang, K., Ostevik, K. L., Drummond, E. B. M., Imerovski, I., Lande, K., Pascual-Robles, M. A., Nanavati, M., Jahani, M., Cheung, W., Staton, S. E., Muños, S., Nielsen, R., Donovan, L. A., … Rieseberg, L. H. (2020). Massive haplotypes underlie ecotypic differentiation in sunflowers. Nature, 584(7822), 602–607.
OpenUrl
↵
Tørresen, O. K., Star, B., Jentoft, S., Reinar, W. B., Grove, H., Miller, J. R., Walenz, B. P., Knight, J., Ekholm, J. M., Peluso, P., Edvardsen, R. B., Tooming-Klunderud, A., Skage, M., Lien, S., Jakobsen, K. S., & Nederbragt, A. J. (2017). An improved genome assembly uncovers prolific tandem repeats in Atlantic cod. BMC Genomics, 18(1), 95.
OpenUrl
↵
Twyford, A. D., & Friedman, J. (2015). Adaptive divergence in the monkey flower Mimulus guttatus is maintained by a chromosomal inversion. Evolution; International Journal of Organic Evolution, 69(6), 1476–1486.
OpenUrl CrossRef PubMed
↵
Van der Auwera, G. A., & O’Connor, B. D. (2020). Genomics in the Cloud: Using Docker, GATK, and WDL. In Terra (1st Edition). O’Reilly Media.
↵
van der Valk, T., Pečnerová, P., Díez-Del-Molino, D., Bergström, A., Oppenheimer, J., Hartmann, S., Xenikoudakis, G., Thomas, J. A., Dehasque, M., Sağlıcan, E., Fidan, F. R., Barnes, I., Liu, S., Somel, M., Heintzman, P. D., Nikolskiy, P., Shapiro, B., Skoglund, P., Hofreiter, M., … Dalén, L. (2021). Million-year-old DNA sheds light on the genomic history of mammoths. Nature, 591(7849), 265–269.
OpenUrl CrossRef PubMed
↵
Wellenreuther, M., & Bernatchez, L. (2018). Eco-Evolutionary Genomics of Chromosomal Inversions. Trends in Ecology & Evolution, 33(6), 427–440.
OpenUrl
↵
Zeng, C.-X., Hollingsworth, P. M., Yang, J., He, Z.-S., Zhang, Z.-R., Li, D.-Z., & Yang, J.-B. (2018). Genome skimming herbarium specimens for DNA barcoding and phylogenomics. Plant Methods, 14, 43.
OpenUrl

View the discussion thread.

Posted June 07, 2021.

Download PDF

Citation Tools

Subject Area

Bioinformatics

Subject Areas

All Articles

Animal Behavior and Cognition (5214)
Biochemistry (11745)
Bioengineering (8751)
Bioinformatics (29195)
Biophysics (14971)
Cancer Biology (12095)
Cell Biology (17411)
Clinical Trials (138)
Developmental Biology (9421)
Ecology (14178)
Epidemiology (2067)
Evolutionary Biology (18306)
Genetics (12245)
Genomics (16801)
Immunology (11867)
Microbiology (28083)
Molecular Biology (11592)
Neuroscience (60965)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2885)
Systems Biology (7339)
Zoology (1651)

[1] ↵
Ayala, D., Guerrero, R. F., & Kirkpatrick, M. (2013). Reproductive isolation and local adaptation quantified for a chromosome inversion in a malaria mosquito. Evolution; International Journal of Organic Evolution, 67(4), 946–958.
OpenUrl CrossRef PubMed

[2] ↵
Bansal, V., Bashir, A., & Bafna, V. (2007). Evidence for large inversion polymorphisms in the human genome from HapMap data. Genome Research, 17(2), 219–230.
OpenUrl Abstract/FREE Full Text

[3] ↵
Barrett, J. H., Boessenkool, S., Kneale, C. J., O’Connell, T. C., & Star, B. (2020). Ecological globalisation, serial depletion and the medieval trade of walrus rostra. Quaternary Science Reviews, 229, 106122.
OpenUrl

[4] ↵
Barth, J. M. I., Berg, P. R., Jonsson, P. R., Bonanomi, S., Corell, H., Hemmer-Hansen, J., Jakobsen, K. S., Johannesson, K., Jorde, P. E., Knutsen, H., Moksnes, P.-O., Star, B., Stenseth, N. C., Svedäng, H., Jentoft, S., & André, C. (2017). Genome architecture enables local adaptation of Atlantic cod despite high connectivity. Molecular Ecology, 26(17), 4452–4466.
OpenUrl

[5] ↵
Barth, J. M. I., Villegas-Ríos, D., Freitas, C., Moland, E., Star, B., André, C., Knutsen, H., Bradbury, I., Dierking, J., Petereit, C., Righton, D., Metcalfe, J., Jakobsen, K. S., Olsen, E. M., & Jentoft, S. (2019). Disentangling structural genomic and behavioural barriers in a sea of connectivity. Molecular Ecology, 28(6), 1394–1411.
OpenUrl CrossRef

[6] ↵
Berg, P. R., Star, B., Pampoulie, C., Bradbury, I. R., Bentzen, P., Hutchings, J. A., Jentoft, S., & Jakobsen, K. S. (2017). Trans-oceanic genomic divergence of Atlantic cod ecotypes is associated with large inversions. Heredity, 119(6), 418–428.
OpenUrl

[7] ↵
Berg, P. R., Star, B., Pampoulie, C., Sodeland, M., Barth, J. M. I., Knutsen, H., Jakobsen, K. S., & Jentoft, S. (2016). Three chromosomal rearrangements promote genomic divergence between migratory and stationary ecotypes of Atlantic cod. Scientific Reports, 6, 23246.
OpenUrl

[8] ↵
Boessenkool, S., Hanghøj, K., Nistelberger, H. M., Der Sarkissian, C., Gondek, A. T., Orlando, L., Barrett, J. H., & Star, B. (2017). Combining bleach and mild predigestion improves ancient DNA recovery from bones. Molecular Ecology Resources, 17(4), 742–751.
OpenUrl

[9] ↵
Bohmann, K., Mirarab, S., Bafna, V., & Gilbert, M. T. P. (2020). Beyond DNA barcoding: The unrealized potential of genome skim data in sample identification. Molecular Ecology, 29(14), 2521–2534.
OpenUrl

[10] ↵
Cáceres, A., & González, J. R. (2015). Following the footprints of polymorphic inversions on SNP data: from detection to association tests. Nucleic Acids Research, 43(8), e53.
OpenUrl CrossRef PubMed

[11] ↵
Cáceres, A., Sindi, S. S., Raphael, B. J., Cáceres, M., & González, J. R. (2012). Identification of polymorphic inversions from genotypes. BMC Bioinformatics, 13, 28.
OpenUrl CrossRef PubMed

[12] ↵
Carøe, C., Gopalakrishnan, S., Vinner, L., Mak, S. S. T., Sinding, M. H. S., Samaniego, J. A., Wales, N., Sicheritz-Pontén, T., & Gilbert, M. T. P. (2018). Single-tube library preparation for degraded DNA. Methods in Ecology and Evolution / British Ecological Society, 9(2), 410–419.
OpenUrl

[13] ↵
Carpenter, M. L., Buenrostro, J. D., Valdiosera, C., Schroeder, H., Allentoft, M. E., Sikora, M., Rasmussen, M., Gravel, S., Guillén, S., Nekhrizov, G., Leshtakov, K., Dimitrova, D., Theodossiev, N., Pettener, D., Luiselli, D., Sandoval, K., Moreno-Estrada, A., Li, Y., Wang, J., … Bustamante, C. D. (2013). Pulling out the 1%: whole-genome capture for the targeted enrichment of ancient DNA sequencing libraries. American Journal of Human Genetics, 93(5), 852–864.
OpenUrl CrossRef PubMed

[14] ↵
Cooper, A., & Poinar, H. N. (2000). Ancient DNA: do it right or not at all. Science, 289(5482), 1139–1139.
OpenUrl CrossRef PubMed Web of Science

[15] ↵
Damgaard, P. B., Margaryan, A., Schroeder, H., Orlando, L., Willerslev, E., & Allentoft, M. E. (2015). Improving access to endogenous DNA in ancient bones and teeth. Scientific Reports, 5, 11184.
OpenUrl

[16] ↵
Danecek, P., Auton, A., Abecasis, G., Albers, C. A., Banks, E., DePristo, M. A., Handsaker, R. E., Lunter, G., Marth, G. T., Sherry, S. T., McVean, G., Durbin, R., & 1000 Genomes Project Analysis Group. (2011). The variant call format and VCFtools. Bioinformatics, 27(15), 2156–2158.
OpenUrl CrossRef PubMed Web of Science

[17] ↵
Dodsworth, S. (2015). Genome skimming for next-generation biodiversity analysis. Trends in Plant Science, 20(9), 525–527.
OpenUrl CrossRef PubMed

[18] ↵
Domagała, R., & Franczuk, R. (1992). Wyniki badań archeologiczno-architektonicznych na zamku w Małej Nieszawce. Rocznik Muzeum W Toruniu, 9, 41–53.
OpenUrl

[19] ↵
Donnelly, M. P., Paschou, P., Grigorenko, E., Gurwitz, D., Mehdi, S. Q., Kajuna, S. L. B., Barta, C., Kungulilo, S., Karoma, N. J., Lu, R.-B., Zhukova, O. V., Kim, J.-J., Comas, D., Siniscalco, M., New, M., Li, P., Li, H., Manolopoulos, V. G., Speed, W. C., … Kidd, K. K. (2010). The distribution and most recent common ancestor of the 17q21 inversion in humans. American Journal of Human Genetics, 86(2), 161–171.
OpenUrl CrossRef PubMed Web of Science

[20] ↵
Fages, A., Seguin-Orlando, A., Germonpré, M., & Orlando, L. (2020). Horse males became over-represented in archaeological assemblages during the Bronze Age. Journal of Archaeological Science: Reports, 31, 102364.
OpenUrl

[21] ↵
Fang, Z., Pyhäjärvi, T., Weber, A. L., Dawe, R. K., Glaubitz, J. C., González, J. de J. S., Ross-Ibarra, C., Doebley, J., Morrell, P. L., & Ross-Ibarra, J. (2012). Megabase-scale inversion polymorphism in the wild ancestor of maize. Genetics, 191(3), 883–894.
OpenUrl Abstract/FREE Full Text

[22] ↵
Ferrari, G., Cuevas, A., Gondek-Wyrozemska, A. T., Ballantyne, R., Kersten, O., Pálsdóttir, A. H., van der Jagt, I., Hufthammer, A. K., Ystgaard, I., Wickler, S., Bigelow, G. F., Harland, J., Nicholson, R., Orton, D., Clavel, B., Boessenkool, S., Barrett, J. H., & Star, B. (2021). The preservation of ancient DNA in archaeological fish bone. Journal of Archaeological Science, 126, 105317.
OpenUrl

[23] ↵
François, O., & Jay, F. (2020). Factor analysis of ancient population genomic samples. Nature Communications, 11(1), 4661.
OpenUrl

[24] ↵
Gamba, C., Jones, E. R., Teasdale, M. D., McLaughlin, R. L., Gonzalez-Fortes, G., Mattiangeli, V., Domboróczki, L., Kővári, I., Pap, I., Anders, A., Whittle, A., Dani, J., Raczky, P., Higham, T. F. G., Hofreiter, M., Bradley, D. G., & Pinhasi, R. (2014). Genome flux and stasis in a five millennium transect of European prehistory. Nature Communications, 5, 5257.
OpenUrl

[25] ↵
Gilbert, M. T. P., Bandelt, H.-J., Hofreiter, M., & Barnes, I. (2005). Assessing ancient DNA studies. Trends in Ecology & Evolution, 20(10), 541–544.
OpenUrl

[26] ↵
Han, F., Jamsandekar, M., Pettersson, M. E., Su, L., Fuentes-Pardo, A. P., Davis, B. W., Bekkevold, D., Berg, F., Casini, M., Dahle, G., Farrell, E. D., Folkvord, A., & Andersson, L. (2020). Ecological adaptation in Atlantic herring is associated with large shifts in allele frequencies at hundreds of loci. eLife, 9. https://doi.org/10.7554/eLife.61076

[27] ↵
Hoffmann, A. A., & Rieseberg, L. H. (2008). Revisiting the Impact of Inversions in Evolution: From Population Genetic Markers to Drivers of Adaptive Shifts and Speciation? Annual Review of Ecology, Evolution, and Systematics, 39, 21–42.
OpenUrl CrossRef Web of Science

[28] ↵
Hughes, J. F., Skaletsky, H., Pyntikova, T., Graves, T. A., van Daalen, S. K. M., Minx, P. J., Fulton, R. S., McGrath, S. D., Locke, D. P., Friedman, C., Trask, B. J., Mardis, E. R., Warren, W. C., Repping, S., Rozen, S., Wilson, R. K., & Page, D. C. (2010). Chimpanzee and human Y chromosomes are remarkably divergent in structure and gene content. Nature, 463(7280), 536–539.
OpenUrl CrossRef PubMed Web of Science

[29] ↵
Iwaszkiewicz, M. (1991). Szczątki ryb z zamku krzyżackiego w Małej Nieszawce (woj. toruńskie), Roczniki Akademii Rolniczej w Poznaniu 227. Archeozoologia, 16, 3–5.
OpenUrl

[30] ↵
Jay, P., Chouteau, M., Whibley, A., Bastide, H., Llaurens, V., Parrinello, H., & Joron, M. (2019). Mutation accumulation in chromosomal inversions maintains wing pattern polymorphism in a butterfly. In Cold Spring Harbor Laboratory (p. 736504). https://doi.org/10.1101/736504

[31] ↵
Jay, P., Whibley, A., Frézal, L., Rodríguez de Cara, M. Á., Nowell, R. W., Mallet, J., Dasmahapatra, K. K., & Joron, M. (2018). Supergene Evolution Triggered by the Introgression of a Chromosomal Inversion. Current Biology: CB, 28(11), 1839–1845.e3.
OpenUrl

[32] ↵
Jones, F. C., Grabherr, M. G., Chan, Y. F., Russell, P., Mauceli, E., Johnson, J., Swofford, R., Pirun, M., Zody, M. C., White, S., Birney, E., Searle, S., Schmutz, J., Grimwood, J., Dickson, M. C., Myers, R. M., Miller, C. T., Summers, B. R., Knecht, A. K., … Kingsley, D. M. (2012). The genomic basis of adaptive evolution in threespine sticklebacks. Nature, 484(7392), 55–61.
OpenUrl CrossRef PubMed Web of Science

[33] ↵
Joron, M., Frezal, L., Jones, R. T., Chamberlain, N. L., Lee, S. F., Haag, C. R., Whibley, A., Becuwe, M., Baxter, S. W., Ferguson, L., Wilkinson, P. A., Salazar, C., Davidson, C., Clark, R., Quail, M. A., Beasley, H., Glithero, R., Lloyd, C., Sims, S., … ffrench-Constant, R. H. (2011). Chromosomal rearrangements maintain a polymorphic supergene controlling butterfly mimicry. Nature, 477(7363), 203–206.
OpenUrl CrossRef PubMed Web of Science

[34] ↵
Joron, M., Papa, R., Beltrán, M., Chamberlain, N., Mavárez, J., Baxter, S., Abanto, M., Bermingham, E., Humphray, S. J., Rogers, J., Beasley, H., Barlow, K., ffrench-Constant, R. H., Mallet, J., McMillan, W. O., & Jiggins, C. D. (2006). A conserved supergene locus controls colour pattern diversity in Heliconius butterflies. PLoS Biology, 4(10), e303.
OpenUrl CrossRef PubMed

[35] ↵
Keighley, X., Bro-Jørgensen, M. H., Ahlgren, H., Szpak, P., Ciucani, M. M., Sánchez Barreiro, F., Howse, L., Gotfredsen, A. B., Glykou, A., Jordan, P., Lidén, K., & Olsen, M. T. (2021). Predicting sample success for large-scale ancient DNA studies on marine mammals. Molecular Ecology Resources, 21(4), 1149–1166.
OpenUrl

[36] ↵
Kirubakaran, T. G., Grove, H., Kent, M. P., Sandve, S. R., Baranski, M., Nome, T., De Rosa, M. C., Righino, B., Johansen, T., Otterå, H., Sonesson, A., Lien, S., & Andersen, Ø. (2016). Two adjacent inversions maintain genomic differentiation between migratory and stationary ecotypes of Atlantic cod. Molecular Ecology, 25(10), 2130–2143.
OpenUrl CrossRef

[37] ↵
Lee, S., Zou, F., & Wright, F. A. (2010). Convergence and prediction of principal component scores in high-dimensional settings. Annals of Statistics, 38(6), 3605–3629.
OpenUrl

[38] ↵
Leitwein, M., Garza, J. C., & Pearse, D. E. (2017). Ancestry and adaptive evolution of anadromous, resident, and adfluvial rainbow trout (Oncorhynchus mykiss) in the San Francisco bay area: application of adaptive genomic variation to conservation in a highly impacted landscape. Evolutionary Applications, 10(1), 56–67.
OpenUrl

[39] ↵
Lemaitre, C., Braga, M. D. V., Gautier, C., Sagot, M.-F., Tannier, E., & Marais, G. A. B. (2009). Footprints of inversions at present and past pseudoautosomal boundaries in human sex chromosomes. Genome Biology and Evolution, 1, 56–66.
OpenUrl CrossRef PubMed

[40] ↵
Li, H. (2011). A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics, 27(21), 2987–2993.
OpenUrl CrossRef PubMed Web of Science

[41] ↵
Llamas, B., Valverde, G., Fehren-Schmitz, L., Weyrich, L. S., Cooper, A., & Haak, W. (2017). From the field to the laboratory: Controlling DNA contamination in human ancient DNA research in the high-throughput sequencing era. STAR: Science & Technology of Archaeological Research, 3(1), 1–14.
OpenUrl

[42] ↵
Lowry, D. B., & Willis, J. H. (2010). A Widespread Chromosomal Inversion Polymorphism Contributes to a Major Life-History Transition, Local Adaptation, and Reproductive Isolation. PLoS Biology, 8(9), e1000500.
OpenUrl CrossRef PubMed

[43] ↵
Ma, J., & Amos, C. I. (2012). Investigation of inversion polymorphisms in the human genome using principal components analysis. PloS One, 7(7), e40224.
OpenUrl CrossRef PubMed

[44] ↵
Makowiecki, D. (2003). Historia ryb i rybołówstwa w holocenie na Niżu Polskim w świetle badań archeoichtiologicznych. Poznań: Institute of Archaeology and Ethnology, Polish Academy of Sciences.

[45] ↵
J. H. Barrett &
D. Orton
Makowiecki, D., Orton, D. C., & Barrett, J. H. (2016). Cod and Herring in Medieval Poland. In J. H. Barrett & D. Orton (Eds.), Cod & Herring: The Archaeology & History of Medieval Sea Fishing (pp. 117–132). Oxbow Books: Oxford & Philadelphia.

[46] J. H. Barrett &

[47] D. Orton

[48] ↵
Mak, S. S. T., Gopalakrishnan, S., Carøe, C., Geng, C., Liu, S., Sinding, M.-H. S., Kuderna, L. F. K., Zhang, W., Fu, S., Vieira, F. G., Germonpré, M., Bocherens, H., Fedorov, S., Petersen, B., Sicheritz-Pontén, T., Marques-Bonet, T., Zhang, G., Jiang, H., & Gilbert, M. T. P. (2017). Comparative performance of the BGISEQ-500 vs Illumina HiSeq2500 sequencing platforms for palaeogenomic sequencing. GigaScience, 6(8), 1–13.
OpenUrl CrossRef PubMed

[49] ↵
Manichaikul, A., Mychaleckyj, J. C., Rich, S. S., Daly, K., Sale, M., & Chen, W.-M. (2010). Robust relationship inference in genome-wide association studies. Bioinformatics, 26(22), 2867–2873.
OpenUrl CrossRef PubMed Web of Science

[50] ↵
Marcus, J. M. (2021). Our love-hate relationship with DNA barcodes, the Y2K problem, and the search for next generation barcodes. AIMS Genetics, 05(01), 001–023.
OpenUrl

[51] ↵
Martínez-García, L., Ferrari, G., Oosting, T., Ballantyne, R., van der Jagt, I., Ystgaard, I., Harland, J., Nicholson, R., Hamilton-Dyer, S., Baalsrud, H. T., Brieuc, M. S. O., Atmore, L. M., Burns, F., Schmölcke, U., Jakobsen, K. S., Jentoft, S., Orton, D., Hufthammer, A. K., Barrett, J. H., & Star, B. (2021). Historical Demographic Processes Dominate Genetic Variation in Ancient Atlantic Cod Mitogenomes. Frontiers in Ecology and Evolution, 9, 342.
OpenUrl

[52] ↵
McKenna, A., Hanna, M., Banks, E., Sivachenko, A., Cibulskis, K., Kernytsky, A., Garimella, K., Altshuler, D., Gabriel, S., Daly, M., & DePristo, M. A. (2010). The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Research, 20(9), 1297–1303.
OpenUrl Abstract/FREE Full Text

[53] ↵
Morales, H. E., Faria, R., Johannesson, K., Larsson, T., Panova, M., Westram, A. M., & Butlin, R. K. (2019). Genomic architecture of parallel ecological divergence: Beyond a single environmental contrast. Science Advances, 5(12), eaav9963.
OpenUrl FREE Full Text

[54] ↵
Nadeau, N. J. (2016). Genes controlling mimetic colour pattern variation in butterflies. Current Opinion in Insect Science, 17, 24–31.
OpenUrl

[55] ↵
Nadeau, N. J., Pardo-Diaz, C., Whibley, A., Supple, M. A., Saenko, S. V., Wallbank, R. W. R., Wu, G. C., Maroja, L., Ferguson, L., Hanly, J. J., Hines, H., Salazar, C., Merrill, R. M., Dowling, A. J., ffrench-Constant, R. H., Llaurens, V., Joron, M., McMillan, W. O., & Jiggins, C. D. (2016). The gene cortex controls mimicry and crypsis in butterflies and moths. Nature, 534(7605), 106–110.
OpenUrl CrossRef PubMed

[56] ↵
Nevill, P. G., Zhong, X., Tonti-Filippini, J., Byrne, M., Hislop, M., Thiele, K., van Leeuwen, S., Boykin, L. M., & Small, I. (2020). Large scale genome skimming from herbarium material for accurate plant identification and phylogenomics. Plant Methods, 16, 1.
OpenUrl

[57] ↵
Nistelberger, H. M., Pálsdóttir, A. H., Star, B., Leifsson, R., Gondek, A. T., Orlando, L., Barrett, J. H., Hallsson, J. H., & Boessenkool, S. (2019). Sexing Viking Age horses from burial and non-burial sites in Iceland using ancient DNA. Journal of Archaeological Science, 101, 115–122.
OpenUrl

[58] ↵
Noor, M. A. F., Grams, K. L., Bertucci, L. A., & Reiland, J. (2001). Chromosomal inversions and the reproductive isolation of species. Proceedings of the National Academy of Sciences of the United States of America, 98(21), 12084–12088.
OpenUrl Abstract/FREE Full Text

[59] ↵
Ogden, R. (2011). Unlocking the potential of genomic technologies for wildlife forensics. Molecular Ecology Resources, 11 Suppl 1, 109–116.
OpenUrl

[60] ↵
Pálsdóttir, A. H., Bläuer, A., Rannamäe, E., Boessenkool, S., & Hallsson, J. H. (2019). Not a limitless resource: ethics and guidelines for destructive sampling of archaeofaunal remains. Royal Society Open Science, 6(10), 191059.
OpenUrl

[61] ↵
Patterson, N., Price, A. L., & Reich, D. (2006). Population structure and eigenanalysis. PLoS Genetics, 2(12), e190.
OpenUrl

[62] ↵
Pečnerová, P., Díez-Del-Molino, D., Dussex, N., Feuerborn, T., von Seth, J., van der Plicht, J., Nikolskiy, P., Tikhonov, A., Vartanyan, S., & Dalén, L. (2017). Genome-Based Sexing Provides Clues about Behavior and Social Structure in the Woolly Mammoth. Current Biology: CB, 27(22), 3505–3510.e3.
OpenUrl

[63] ↵
Peterson, B. K., Weber, J. N., Kay, E. H., Fisher, H. S., & Hoekstra, H. E. (2012). Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species. PloS One, 7(5), e37135.
OpenUrl CrossRef PubMed

[64] ↵
Pettersson, M. E., Rochus, C. M., Han, F., Chen, J., Hill, J., Wallerman, O., Fan, G., Hong, X., Xu, Q., Zhang, H., Liu, S., Liu, X., Haggerty, L., Hunt, T., Martin, F. J., Flicek, P., Bunikis, I., Folkvord, A., & Andersson, L. (2019). A chromosome-level assembly of the Atlantic herring genome—detection of a supergene and other signals of selection. Genome Research. https://doi.org/10.1101/gr.253435.119

[65] ↵
Pinhasi, R., Fernandes, D., Sirak, K., Novak, M., Connell, S., Alpaslan-Roodenberg, S., Gerritsen, F., Moiseyev, V., Gromov, A., Raczky, P., Anders, A., Pietrusewsky, M., Rollefson, G., Jovanovic, M., Trinhhoang, H., Bar-Oz, G., Oxenham, M., Matsumura, H., & Hofreiter, M. (2015). Optimal Ancient DNA Yields from the Inner Ear Part of the Human Petrous Bone. PloS One, 10(6), e0129102.
OpenUrl CrossRef PubMed

[66] ↵
Pinsky, M. L., Eikeset, A. M., Helmerson, C., Bradbury, I. R., Bentzen, P., Morris, C., Gondek-Wyrozemska, A. T., Baalsrud, H. T., Brieuc, M. S. O., Kjesbu, O. S., Godiksen, J. A., Barth, J. M. I., Matschiner, M., Stenseth, N. C., Jakobsen, K. S., Jentoft, S., & Star, B. (2021). Genomic stability through time despite decades of exploitation in cod on both sides of the Atlantic. Proceedings of the National Academy of Sciences of the United States of America, 118(15). https://doi.org/10.1073/pnas.2025453118

[67] ↵
Price, A. L., Patterson, N. J., Plenge, R. M., Weinblatt, M. E., Shadick, N. A., & Reich, D. (2006). Principal components analysis corrects for stratification in genome-wide association studies. Nature Genetics, 38(8), 904–909.
OpenUrl CrossRef PubMed Web of Science

[68] ↵
Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M. A. R., Bender, D., Maller, J., Sklar, P., de Bakker, P. I. W., Daly, M. J., & Sham, P. C. (2007). PLINK: a tool set for whole-genome association and population-based linkage analyses. American Journal of Human Genetics, 81(3), 559–575.
OpenUrl CrossRef PubMed

[69] ↵
Ruiz-Arenas, C., Cáceres, A., López-Sánchez, M., Tolosana, I., Pérez-Jurado, L., & González, J. R. (2019). scoreInvHap: Inversion genotyping for genome-wide association studies. PLoS Genetics, 15(7), e1008203.
OpenUrl

[70] ↵
Runa, D., & Harbison, S. (2021). Sequencing Technology in Forensic Science: Next-Generation Sequencing. In Forensic DNA Analysis (pp. 149–199).

[71] ↵
Salm, M. P. A., Horswell, S. D., Hutchison, C. E., Speedy, H. E., Yang, X., Liang, L., Schadt, E. E., Cookson, W. O., Wierzbicki, A. S., Naoumova, R. P., & Shoulders, C. C. (2012). The origin, global distribution, and functional impact of the human 8p23 inversion polymorphism. Genome Research, 22(6), 1144–1153.
OpenUrl Abstract/FREE Full Text

[72] ↵
Schubert, M., Ermini, L., Der Sarkissian, C., Jónsson, H., Ginolhac, A., Schaefer, R., Martin, M. D., Fernández, R., Kircher, M., McCue, M., Willerslev, E., & Orlando, L. (2014). Characterization of ancient and modern genomes by SNP detection and phylogenomic and metagenomic analysis using PALEOMIX. Nature Protocols, 9(5), 1056–1082.
OpenUrl

[73] ↵
Sindi, S. S., & Raphael, B. J. (2010). Identification and frequency estimation of inversion polymorphisms from haplotype data. Journal of Computational Biology: A Journal of Computational Molecular Cell Biology, 17(3), 517–531.
OpenUrl

[74] ↵
Skoglund, P., Sjödin, P., Skoglund, T., Lascoux, M., & Jakobsson, M. (2014). Investigating population history using temporal genetic differentiation. Molecular Biology and Evolution, 31(9), 2516–2527.
OpenUrl CrossRef PubMed

[75] ↵
Sodeland, M., Jorde, P. E., Lien, S., Jentoft, S., Berg, P. R., Grove, H., Kent, M. P., Arnyasi, M., Olsen, E. M., & Knutsen, H. (2016). “Islands of Divergence” in the Atlantic Cod Genome Represent Polymorphic Chromosomal Rearrangements. Genome Biology and Evolution, 8(4), 1012–1022.
OpenUrl CrossRef PubMed

[76] ↵
Star, B., Barrett, J. H., Gondek, A. T., & Boessenkool, S. (2018). Ancient DNA reveals the chronology of walrus ivory trade from Norse Greenland. Proceedings. Biological Sciences / The Royal Society, 285(1884), 20180978.
OpenUrl

[77] ↵
Star, B., Boessenkool, S., Gondek, A. T., Nikulina, E. A., Hufthammer, A. K., Pampoulie, C., Knutsen, H., André, C., Nistelberger, H. M., Dierking, J., Petereit, C., Heinrich, D., Jakobsen, K. S., Stenseth, N. C., Jentoft, S., & Barrett, J. H. (2017). Ancient DNA reveals the Arctic origin of Viking Age cod from Haithabu, Germany. Proceedings of the National Academy of Sciences of the United States of America, 114(34), 9152–9157.
OpenUrl Abstract/FREE Full Text

[78] ↵
Star, B., Nederbragt, A. J., Jentoft, S., Grimholt, U., Malmstrøm, M., Gregers, T. F., Rounge, T. B., Paulsen, J., Solbakken, M. H., Sharma, A., Wetten, O. F., Lanzén, A., Winer, R., Knight, J., Vogel, J.-H., Aken, B., Andersen, O., Lagesen, K., Tooming-Klunderud, A., … Jakobsen, K. S. (2011). The genome sequence of Atlantic cod reveals a unique immune system. Nature, 477(7363), 207–210.
OpenUrl CrossRef PubMed Web of Science

[79] ↵
Suchan, T., Pitteloud, C., Gerasimova, N. S., Kostikova, A., Schmid, S., Arrigo, N., Pajkovic, M., Ronikier, M., & Alvarez, N. (2016). Hybridization Capture Using RAD Probes (hyRAD), a New Tool for Performing Genomic Analyses on Collection Specimens. PloS One, 11(3), e0151651.
OpenUrl CrossRef PubMed

[80] ↵
Tin, M. M.-Y., Economo, E. P., & Mikheyev, A. S. (2014). Sequencing degraded DNA from non-destructively sampled museum specimens for RAD-tagging and low-coverage shotgun phylogenetics. PloS One, 9(5), e96793.
OpenUrl CrossRef PubMed

[81] ↵
Todesco, M., Owens, G. L., Bercovich, N., Légaré, J.-S., Soudi, S., Burge, D. O., Huang, K., Ostevik, K. L., Drummond, E. B. M., Imerovski, I., Lande, K., Pascual-Robles, M. A., Nanavati, M., Jahani, M., Cheung, W., Staton, S. E., Muños, S., Nielsen, R., Donovan, L. A., … Rieseberg, L. H. (2020). Massive haplotypes underlie ecotypic differentiation in sunflowers. Nature, 584(7822), 602–607.
OpenUrl

[82] ↵
Tørresen, O. K., Star, B., Jentoft, S., Reinar, W. B., Grove, H., Miller, J. R., Walenz, B. P., Knight, J., Ekholm, J. M., Peluso, P., Edvardsen, R. B., Tooming-Klunderud, A., Skage, M., Lien, S., Jakobsen, K. S., & Nederbragt, A. J. (2017). An improved genome assembly uncovers prolific tandem repeats in Atlantic cod. BMC Genomics, 18(1), 95.
OpenUrl

[83] ↵
Twyford, A. D., & Friedman, J. (2015). Adaptive divergence in the monkey flower Mimulus guttatus is maintained by a chromosomal inversion. Evolution; International Journal of Organic Evolution, 69(6), 1476–1486.
OpenUrl CrossRef PubMed

[84] ↵
Van der Auwera, G. A., & O’Connor, B. D. (2020). Genomics in the Cloud: Using Docker, GATK, and WDL. In Terra (1st Edition). O’Reilly Media.

[85] ↵
van der Valk, T., Pečnerová, P., Díez-Del-Molino, D., Bergström, A., Oppenheimer, J., Hartmann, S., Xenikoudakis, G., Thomas, J. A., Dehasque, M., Sağlıcan, E., Fidan, F. R., Barnes, I., Liu, S., Somel, M., Heintzman, P. D., Nikolskiy, P., Shapiro, B., Skoglund, P., Hofreiter, M., … Dalén, L. (2021). Million-year-old DNA sheds light on the genomic history of mammoths. Nature, 591(7849), 265–269.
OpenUrl CrossRef PubMed

[86] ↵
Wellenreuther, M., & Bernatchez, L. (2018). Eco-Evolutionary Genomics of Chromosomal Inversions. Trends in Ecology & Evolution, 33(6), 427–440.
OpenUrl

[87] ↵
Zeng, C.-X., Hollingsworth, P. M., Yang, J., He, Z.-S., Zhang, Z.-R., Li, D.-Z., & Yang, J.-B. (2018). Genome skimming herbarium specimens for DNA barcoding and phylogenomics. Plant Methods, 14, 43.
OpenUrl