RT Journal Article SR Electronic T1 MultiPLIER: a transfer learning framework reveals systemic features of rare autoimmune disease JF bioRxiv FD Cold Spring Harbor Laboratory SP 395947 DO 10.1101/395947 A1 Jaclyn N. Taroni A1 Peter C. Grayson A1 Qiwen Hu A1 Sean Eddy A1 Matthias Kretzler A1 Peter A. Merkel A1 Casey S. Greene YR 2018 UL http://biorxiv.org/content/early/2018/08/20/395947.abstract AB Unsupervised machine learning methods provide a promising means to analyze and interpret large datasets. However, most datasets generated by individual researchers remain too small to fully benefit from these methods. In the case of rare diseases, there may be too few cases available, even when multiple studies are combined. We sought to determine whether or not machine learning models could be constructed from large public data compendia and then transferred to small datasets for subsequent analysis. We trained models using Pathway Level Information ExtractoR (PLIER) over datasets of different types and scales. Models constructed from large public datasets were i) more detailed than those constructed from individual datasets; ii) included features that aligned well to important biological factors; iii) transferrable to rare disease datasets where the models describe biological processes related to disease severity more effectively than models trained within those datasets.