RT Journal Article SR Electronic T1 Sequence effects on size, shape, and structural heterogeneity in Intrinsically Disordered Proteins JF bioRxiv FD Cold Spring Harbor Laboratory SP 427476 DO 10.1101/427476 A1 Upayan Baul A1 Debayan Chakraborty A1 Mauro L. Mugnai A1 John E. Straub A1 D. Thirumalai YR 2019 UL http://biorxiv.org/content/early/2019/02/10/427476.abstract AB Intrinsically disordered proteins (IDPs) lack well-defined three-dimensional structures, thus challenging the archetypal notion of structure-function relationships. Determining the ensemble of conformations that IDPs explore under physiological conditions is the first step towards understanding their diverse cellular functions. Here, we quantitatively characterize the structural features of IDPs as a function of sequence and length using coarse-grained simulations. For diverse IDP sequences, with the number of residues (NT) ranging from 24 to 441, our simulations not only reproduce the radii of gyration (Rg) obtained from experiments, but also predict the full scattering intensity profiles in very good agreement with Small Angle X-ray Scattering experiments. The Rg values are well-described by the standard Flory scaling law, , with v ≈ 0.588, making it tempting to assert that IDPs behave as polymers in a good solvent. However, clustering analysis reveals that the menagerie of structures explored by IDPs is diverse, with the extent of heterogeneity being highly sequence-dependent, even though ensemble-averaged properties, such as the dependence of Rg on chain length, may suggest synthetic polymer-like behavior in a good solvent. For example, we show that for the highly charged Prothymosin-α, a substantial fraction of conformations is highly compact. Even if the sequence compositions are similar, as is the case for α-Synuclein and a truncated construct from the Tau protein, there are substantial differences in the conformational heterogeneity. Taken together, these observations imply that metrics based on net charge or related quantities alone, cannot be used to anticipate the phases of IDPs, either in isolation or in complex with partner IDPs or RNA. Our work sets the stage for probing the interactions of IDPs with each other, with folded protein domains, or with partner RNAs, which are critical for describing the structures of stress granules and biomolecular condensates with important cellular functions.Graphical TOC Entry