TY - JOUR T1 - find-tfbs: a tool to identify functional non-coding variants associated with complex human traits using open chromatin maps and phased whole-genome sequences JF - bioRxiv DO - 10.1101/2020.11.23.394296 SP - 2020.11.23.394296 AU - Sébastian Méric de Bellefon AU - Florian Thibord AU - Paul L. Auer AU - John Blangero AU - Zeynep H Coban-Akdemir AU - James S. Floyd AU - Myriam Fornage AU - Jill M. Johnsen AU - Leslie A. Lange AU - Joshua P. Lewis AU - Rasika A. Mathias AU - Caitlin P. McHugh AU - Jee-Young Moon AU - Alex P. Reiner AU - Adrienne M. Stilp AU - NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium AU - Guillaume Lettre Y1 - 2020/01/01 UR - http://biorxiv.org/content/early/2020/11/23/2020.11.23.394296.abstract N2 - Motivation Whole-genome DNA sequencing (WGS) enables the discovery of non-coding variants, but tools are lacking to prioritize the subset that functionally impacts human phenotypes. DNA sequence variants that disrupt or create transcription factor binding sites (TFBS) can modulate gene expression. find-tfbs efficiently scans phased WGS in large cohorts to identify and count TFBSs in regulatory sequences. This information can then be used in association testing to find putatively functional non-coding variants associated with complex human diseases or traits.Results We applied find-tfbs to discover functional non-coding variants associated with hematological traits in the NHLBI Trans-Omics for Precision Medicine (TOPMed) WGS dataset (Nmax=44,709). We identified >2000 associations at P<1×10−9, implicating specific blood cell-types, transcription factors and causal genes. The vast majority of these associations are captured by variants identified in large genome-wide association studies (GWAS) for blood-cell traits. find-tfbs is computationally efficient and robust, allowing for the rapid identification of non-coding variants associated with multiple human phenotypes in very large sample size.Availability https://github.com/Helkafen/find-tfbs and https://github.com/Helkafen/find-tfbs-demoContacts sebastian.meric.de.bellefon{at}umontreal.ca and guillaume.lettre{at}umontreal.caSupplementary information Supplementary data are available.Competing Interest StatementThe authors have declared no competing interest. ER -