PT - JOURNAL ARTICLE AU - Marcio R. Pie AU - Marcos R. Bornschein AU - Luiz F. Ribeiro AU - Brant C. Faircloth AU - John E. McCormack TI - Phylogenomic species delimitation in microendemic frogs of the Brazilian Atlantic Forest AID - 10.1101/143735 DP - 2017 Jan 01 TA - bioRxiv PG - 143735 4099 - http://biorxiv.org/content/early/2017/05/30/143735.short 4100 - http://biorxiv.org/content/early/2017/05/30/143735.full AB - The advent of next-generation sequencing allows researchers to use large-scale datasets for species delimitation analyses, yet one can envision an inflection point where the added accuracy of including more loci does not offset the increased computational burden. One alternative to including all loci could be to prioritize the analysis of loci for which there is an expectation of high informativeness, such as those with higher numbers of parsimony-informative sites. Here, we explore the issue of species delimitation and locus selection with species from two anuran genera: Melanophryniscus (Bufonidae) and Brachycephalus (Brachycephalidae). Montane species in these genera have been isolated in sky islands across the southern Brazilian Atlantic Forest, which led to the formation of a number of microendemic species. To delimit species, we obtained genetic data using target enrichment of ultraconserved elements from 32 populations (13 for Melanophryniscus and 19 for Brachycephalus), and we were able to create datasets that included over 800 loci with no missing data. We ranked loci according to their corresponding number of parsimony-informative sites, and we performed species delimitation analyses using BPP in each genus based on the top 10, 20, 40, 80, 160, 320, and 640 loci. We also conducted several additional analyses using 10 randomly sampled datasets containing the same numbers of loci to discriminate the relative contribution of increasing the number of loci from prioritizing those with higher informativeness. We identified three types of node: nodes with either consistently high or low support regardless of the number of loci or their informativeness, and nodes that were initially poorly supported, but their support became stronger with more data. Adding more loci had a stronger impact on model support than prioritizing loci for their informativeness, but this effect was less apparent in datasets with more than 160 loci. When viewed across all sensitivity analyses, our results suggest that the current species richness in both genera might have been underestimated. In addition, our results provide useful guidelines to the use of different sampling strategies to carry out species delimitation with phylogenomic datasets.