RT Journal Article SR Electronic T1 Burden analysis of missense variants in 1,330 disease-associated genes on 3D provides insights into the mutation effects JF bioRxiv FD Cold Spring Harbor Laboratory SP 693259 DO 10.1101/693259 A1 Sumaiya Iqbal A1 Jakob B. Jespersen A1 Eduardo Perez-Palma A1 Patrick May A1 David Hoksza A1 Henrike O. Heyne A1 Shehab S. Ahmed A1 Zaara T. Rifat A1 M. Sohel Rahman A1 Kasper Lage A1 Aarno Palotie A1 Jeffrey R. Cottrell A1 Florence F. Wagner A1 Mark J. Daly A1 Arthur J. Campbell A1 Dennis Lal YR 2020 UL http://biorxiv.org/content/early/2020/03/05/693259.abstract AB Interpretation of the colossal number of genetic variants identified from sequencing applications is one of the major bottlenecks in clinical genetics, with the inference of the effect of amino acid-substituting missense variants on protein structure and function being especially challenging. Here we evaluated the burden of amino acids affected in pathogenic variants (n=32,923) compared to the variants (n=164,915) from the general population in 1,330 disease-associated genes on forty protein features using over 14,000 experimentally-solved 3D structures. By analyzing the whole gene/variant set jointly, we identified 18 features associated with 3D mutational hotspots that are generally important for protein fitness and stability. Individual analyses performed for twenty-four protein functional classes further revealed 240 characteristics of mutational hotspots in total, including new associations recapitulating the sheer diversity across proteins essential structural regions. We demonstrated that the function-specific features of variants correspond to the readouts of mutagenesis experiments and positively correlate with clinically-interpreted pathogenic and benign missense variants. Finally, we made our results available through a web server to foster accessibility and downstream research. Our findings represent a crucial step towards translational genetics, from highlighting the impact of mutations on protein structure to rationalizing the pathogenicity of variants in terms of the perturbed molecular mechanisms.