RT Journal Article SR Electronic T1 A scalable approach for genome-wide inference of ancestral recombination graphs JF bioRxiv FD Cold Spring Harbor Laboratory SP 2024.08.31.610248 DO 10.1101/2024.08.31.610248 A1 Gunnarsson, Árni Freyr A1 Zhu, Jiazheng A1 Zhang, Brian C. A1 Tsangalidou, Zoi A1 Allmont, Alex A1 Palamara, Pier Francesco YR 2024 UL http://biorxiv.org/content/early/2024/09/02/2024.08.31.610248.abstract AB The ancestral recombination graph (ARG) is a graph-like structure that encodes a detailed genealogical history of a set of individuals along the genome. ARGs that are accurately reconstructed from genomic data have several downstream applications, but inference from data sets comprising millions of samples and variants remains computationally challenging. We introduce Threads, a threading-based method that significantly reduces the computational costs of ARG inference while retaining high accuracy. We apply Threads to infer the ARG of 487,409 genomes from the UK Biobank using ∼10 million high-quality imputed variants, reconstructing a detailed genealogical history of the samples while compressing the input genotype data. Additionally, we develop ARG-based imputation strategies that increase genotype imputation accuracy for ultra-rare variants (MAC ≤10) from UK Biobank exome sequencing data by 5-10%. We leverage ARGs inferred by Threads to detect associations with 52 quantitative traits in non-European UK Biobank samples, identifying 22.5% more signals than ARG-Needle. These analyses underscore the value of using computationally efficient genealogical modeling to improve and complement genotype imputation in large-scale genomic studies.Competing Interest StatementA.F.G. is an employee of deCODE genetics/Amgen; B.C.Z. is an employee of Adaptive Biotechnologies.