RT Journal Article SR Electronic T1 Population modeling with machine learning can enhance measures of mental health JF bioRxiv FD Cold Spring Harbor Laboratory SP 2020.08.25.266536 DO 10.1101/2020.08.25.266536 A1 Kamalaker Dadi A1 Gaƫl Varoquaux A1 Josselin Houenou A1 Danilo Bzdok A1 Bertrand Thirion A1 Denis Engemann YR 2021 UL http://biorxiv.org/content/early/2021/09/24/2020.08.25.266536.abstract AB Background Biological aging is revealed by physical measures, e.g., DNA probes or brain scans. Instead, individual differences in mental function are explained by psychological constructs, e.g., intelligence or neuroticism. These constructs are typically assessed by tailored neuropsychological tests that build on expert judgement and require careful interpretation. Could machine learning on large samples from the general population be used to build proxy measures of these constructs that do not require human intervention?Results Here, we built proxy measures by applying machine learning on multimodal MR images and rich sociodemographic information from the largest biomedical cohort to date: the UK Biobank. Objective model comparisons revealed that all proxies captured the target constructs and were as useful, and sometimes more useful than the original measures for characterizing real-world health behavior (sleep, exercise, tobacco, alcohol consumption). We observed this complementarity of proxy measures and original measures when modeling from brain signals or sociodemographic data, capturing multiple health-related constructs.Conclusions Population modeling with machine learning can derive measures of mental health from brain signals and questionnaire data, which may complement or even substitute for psychometric assessments in clinical populations.Key PointsWe applied machine learning on more than 10.000 individuals from the general population to define empirical approximations of health-related psychological measures that do not require human judgment.We found that machine-learning enriched the given psychological measures via approximation from brain and sociodemographic data: Resulting proxy measures related as well or better to real-world health behavior than the original measures.Model comparisons showed that sociodemographic information contributed most to characterizing psychological traits beyond aging.Competing Interest StatementThe authors have declared no competing interest.