PT - JOURNAL ARTICLE AU - James HR Farmery AU - Mike L Smith AU - NIHR BioResource - Rare Diseases AU - Andy G Lynch TI - Telomerecat: A ploidy-agnostic method for estimating telomere length from whole genome sequencing data AID - 10.1101/139972 DP - 2017 Jan 01 TA - bioRxiv PG - 139972 4099 - http://biorxiv.org/content/early/2017/05/19/139972.1.short 4100 - http://biorxiv.org/content/early/2017/05/19/139972.1.full AB - Telomere length is a risk factor in disease and the dynamics of telomere length are crucial to our understanding of cell replication and vitality. The proliferation of whole genome sequencing represents an unprecedented opportunity to glean new insights into telomere biology on a previously unimaginable scale. To this end, a number of approaches for estimating telomere length from whole-genome sequencing data have been proposed. Here we present Telomerecat, a novel approach to the estimation of telomere length. Previous methods have been dependent on the number of telomeres present in a cell being known, which may be problematic when analysing aneuploid cancer data and non-human samples. Telomerecat is designed to be agnostic to the number of telomeres present, making it suited for the purpose of estimating telomere length in cancer studies. Telomerecat also accounts for interstitial telomeric reads and presents a novel approach to dealing with sequencing errors. We show that Telomerecat performs well at telomere length estimation when compared to leading experimental and computational methods. Furthermore, we show that it detects expected patterns in longitudinal data, technical replicates, and cross-species comparisons. We also apply the method to a cancer cell data, uncovering an interesting relationship with the underlying telomerase genotype.