TY - JOUR T1 - Defining Major Depressive Disorder Cohorts Using the EHR: Multiple Phenotypes Based on ICD-9 Codes and Medication Orders JF - bioRxiv DO - 10.1101/227561 SP - 227561 AU - Wendy Marie Ingram AU - Anna M. Baker AU - Christopher R. Bauer AU - Jason P. Brown AU - Fernando S. Goes AU - Sharon Larson AU - Peter P. Zandi Y1 - 2019/01/01 UR - http://biorxiv.org/content/early/2019/11/09/227561.abstract N2 - Background Major Depressive Disorder (MDD) is one of the most common mental illnesses and a leading cause of disability worldwide. Electronic Health Records (EHR) allow researchers to conduct unprecedented large-scale observational studies investigating MDD, its disease development and its interaction with other health outcomes. While there exist methods to classify patients as clear cases or controls, given specific data requirements, there are presently no simple, generalizable, and validated methods to classify an entire patient population into varying groups of depression likelihood and severity.Methods We have tested a simple, pragmatic electronic phenotype algorithm that classifies patients into one of five mutually exclusive, ordinal groups, varying in depression phenotype. Using data from an integrated health system on 278,026 patients from a 10-year study period we have tested the convergent validity of these constructs using measures of external validation, including patterns of psychiatric prescriptions, symptom severity, indicators of suicidality, comorbidity, mortality, health care utilization, and polygenic risk scores for MDD.Results We found consistent patterns of increasing morbidity and/or adverse outcomes across the five groups, providing evidence for convergent validity.Limitations The study population is from a single rural integrated health system which is predominantly white, possibly limiting its generalizability.Conclusion Our study provides initial evidence that a simple algorithm, generalizable to most EHR data sets, provides categories with meaningful face and convergent validity that can be used for stratification of an entire patient population. ER -