TY - JOUR T1 - Improving the informativeness of Mendelian disease-derived pathogenicity scores for common disease JF - bioRxiv DO - 10.1101/2020.01.02.890657 SP - 2020.01.02.890657 AU - Samuel S. Kim AU - Kushal K. Dey AU - Omer Weissbrod AU - Carla Marquez-Luna AU - Steven Gazal AU - Alkes L. Price Y1 - 2020/01/01 UR - http://biorxiv.org/content/early/2020/07/09/2020.01.02.890657.abstract N2 - Despite considerable progress on pathogenicity scores prioritizing both coding and noncoding variants for Mendelian disease, little is known about the utility of these pathogenicity scores for common disease. Here, we sought to assess the informativeness of Mendelian diseasederived pathogenicity scores for common disease, and to improve upon existing scores. We first applied stratified LD score regression to assess the informativeness of annotations defined by top variants from published Mendelian disease-derived pathogenicity scores across 41 independent common diseases and complex traits (average N = 320K). Several of the resulting annotations were informative for common disease, even after conditioning on a broad set of coding, conserved, regulatory and LD-related annotations from the baseline-LD model. We then improved upon the published pathogenicity scores by developing AnnotBoost, a gradient boosting-based framework to impute and denoise pathogenicity scores using functional annotations from the baseline-LD model. AnnotBoost substantially increased the informativeness for common disease of both previously uninformative and previously informative pathogenicity scores, implying pervasive variant-level overlap between Mendelian disease and common disease. The boosted scores also produced significant improvements in heritability model fit and in classifying disease-associated, fine-mapped SNPs. Our boosted scores have high potential to improve candidate gene discovery and fine-mapping for common disease.Competing Interest StatementThe authors have declared no competing interest. ER -