Abstract
Ancient environmental DNA (aeDNA) is now commonly used in paleoecology and evolutionary ecology, yet due to difficulties in gaining sufficient genome coverage on individual species from metagenome data, its genetic perspectives remain largely uninvestigated. Hybridization capture has proven as an effective approach for enriching the DNA of target species, thus increasing the genome coverage of sequencing data and enabling population and evolutionary genetics analysis. However, to date there is no tool available for designing capture probe sets tailored for aeDNA based population genetics. Here we present eProbe, an efficient, flexible and easy-to-use program toolkit that provides a complete workflow for capture probe design, assessment and validation. By benchmarking a probe set for foxtail millet, an annual grass, made by the eProbe workflow, we demonstrate a remarkable increase of capturing efficiency, with the target taxa recovery rate improved by 577-fold, and the genome coverage achieved by soil capture-sequencing data even higher than data directly shotgun sequenced from the plant tissues. Probes that underwent our filtering panels show notably higher efficiency. The capture sequencing data enabled accurate population and evolutionary genetic analysis, by effectively inferring the fine-scale genetic structures and patterns, as well as the genotypes on functional genes.
Competing Interest Statement
The authors have declared no competing interest.