Medical consequences of pathogenic CNVs in adults: analysis of the UK Biobank

Karen Crawford; Matthew Bracher-Smith; David Owen; Kimberley M Kendall; Elliott Rees; Antonio F Pardiñas; Mark Einon; Valentina Escott-Price; James T R Walters; Michael C O’Donovan; Michael J Owen; George Kirov

doi:10.1136/jmedgenet-2018-105477

Article Text

PDF

Copy-number variation

Original article

Medical consequences of pathogenic CNVs in adults: analysis of the UK Biobank

Free

Karen Crawford,
Matthew Bracher-Smith,
David Owen,
Kimberley M Kendall,
Elliott Rees,
Antonio F Pardiñas,
Mark Einon,
Valentina Escott-Price,
James T R Walters,
Michael C O’Donovan,
Michael J Owen,
http://orcid.org/0000-0002-3427-3950George Kirov

MRC Centre for Neuropsychiatric Genetics and Genomics, Institute of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK

Correspondence to Dr. George Kirov, MRC Centre for Neuropsychiatric Genetics and Genomics, Institute of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Hadyn Ellis Building,Maindy Road, Cardiff CF14 4XN, UK; kirov{at}cardiff.ac.uk

Abstract

Background Genomic CNVs increase the risk for early-onset neurodevelopmental disorders, but their impact on medical outcomes in later life is still poorly understood. The UK Biobank allows us to study the medical consequences of CNVs in middle and old age in half a million well-phenotyped adults.

Methods We analysed all Biobank participants for the presence of 54 CNVs associated with genomic disorders or clinical phenotypes, including their reciprocal deletions or duplications. After array quality control and exclusion of first-degree relatives, we compared 381 452 participants of white British or Irish origin who carried no CNVs with carriers of each of the 54 CNVs (ranging from 5 to 2843 persons). We used logistic regression analysis to estimate the risk of developing 58 common medical phenotypes (3132 comparisons).

Results and conclusions Many of the CNVs have profound effects on medical health and mortality, even in people who have largely escaped early neurodevelopmental outcomes. Forty-six CNV–phenotype associations were significant at a false discovery rate threshold of 0.1, all in the direction of increased risk. Known medical consequences of CNVs were confirmed, but most identified associations are novel. Deletions at 16p11.2 and 16p12.1 had the largest numbers of significantly associated phenotypes (seven each). Diabetes, hypertension, obesity and renal failure were affected by the highest numbers of CNVs. Our work should inform clinicians in planning and managing the medical care of CNV carriers.

cnv
UK biobank
medical

https://doi.org/10.1136/jmedgenet-2018-105477

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Introduction

Genomic CNVs are structural alterations to chromosomes of >1000 bases in length that can intersect multiple genes.1 Specific CNVs have been shown to increase risk for autism spectrum disorders,2 developmental delay and other neurodevelopmental disorders,3 and schizophrenia.4 Apart from their association with neurodevelopmental and psychiatric outcomes, these CNVs can lead to medical disorders. Several CNVs, for example, deletions at 22q11.2,5 have been extensively studied on hundreds of carriers and their medical consequences are well established. However, for CNVs with lower penetrance, very rare CNVs or several reciprocal deletions/duplications of known genomic disorders, the associated medical phenotypes have not been identified. Moreover, most research has been performed on children and young people referred to genetic clinics,3 6 creating a strong referral bias towards recording high rates of developmental delay, early-onset medical conditions and more adverse outcomes. Most CNVs display incomplete penetrance,7 resulting in apparently unaffected adult carriers in the general population. The rate of medical outcomes in later life of CNV carriers, or in the general population as a whole, has not been addressed in adequately powered studies to date.

The establishment of the UK Biobank presents a unique opportunity to examine the spectrum of medical outcomes of CNVs in middle-aged and old-aged people, as all half a million participants have been assessed with identical methods and blindly to their CNV status. The Biobank collects longitudinal data from hospital admissions, self-report, death certificates, cancer registries and primary care (general practitioners’) records. Here, we report on the medical consequences of carrier status for 54 CNVs that are recognised as associated with clinical phenotypes or genomic disorders,3 6 8 including their reciprocal deletions/duplications.

Methods

Participants

The UK Biobank recruited just over half a million people from the general population of the UK, using National Health Service patient registers, with no exclusion criteria. Participants have consented to provide personal and health information, urine, saliva and blood samples, and to have their DNA tested. We obtained approval from the UK Biobank to analyse the CNVs in project 14421: ‘Identifying the spectrum of biomedical traits in adults with pathogenic copy number variants (CNVs)’.

Participants were between 40 and 69 years of age at the time of recruitment between 2006 and 2010. As the lifetime prevalence of disorders often varies by ancestry, we restricted the analysis to those participants who declared themselves as ‘white British or Irish’: 421 268 participants who passed our genotyping quality control (QC) filters (CNV calling). After exclusion of first-degree relatives, 396 725 subjects were retained for analysis, 53.8% of whom were female. The mean age at the end of the current follow-up interval for medical outcomes (in 2016) was 64.7 years, SD=8.0 years.

CNV calling

Samples were genotyped at the Affymetrix Research Services Laboratory, Santa Clara, California, USA, on two arrays with 95% common content between them: around 50 000 samples were genotyped on the UK BiLEVE Array (807 411 probes) and the remaining samples on the UK Biobank Axiom Array (820 967 probes).9 We downloaded the anonymised genotypic data from the UK Biobank as 488 415 raw (CEL) files and analysed them with the methods we reported previously.10 Briefly, we generated normalised signal intensity data, genotype calls and confidences, using ~750 000 biallelic markers. These were then processed with PennCNV-Affy software.11 Individual samples were excluded if they had >30 CNVs, a waviness factor >0.03 or <−0.03, a call rate <96% or log R ratio SD >0.35. A total of 25 069 files were excluded after this QC (5.1%). Individual CNVs were excluded if they were covered by <10 probes or had a density coverage of less than one probe per 20 000 base pairs.

Choice of CNVs

We compiled a list of 92 CNVs in 47 genomic locations from two widely accepted sources that proposed largely overlapping sets of CNVs (online supplementary table 1 in supplementary material).3 6 The authors of these studies used information from databases, reviews and publications to produce lists of CNV regions that lead to genomic disorders, congenital malformations, neurodevelopmental or other clinical phenotypes. We refer to this set of 92 CNVs as ‘pathogenic’, consistent with the criteria proposed by the American College of Medical Genetics standards which describe as pathogenic those CNVs that have been documented as clinically significant in multiple peer-reviewed publications, even if penetrance and expressivity of the CNV are known to be variable.12 Many (but not all) have been shown to statistically increase the risk for developmental delay.3 Online supplementary table 1 lists the sources for selection and our criteria for inclusion in analysis. Several overlapping or adjacent CNVs listed as separate loci in the original publications were grouped together (eg, the ‘small’ and the ‘common’ 22q11.2 or the ‘small’ and the ‘large’ 16p13.11 deletions/duplications). As a rule, the reciprocal deletions/duplications of known genomic disorders were also included by the above authors and by us, in order to examine their medical consequences, even if the evidence for their pathogenicity has not been established.

Supplementary file 1

[jmedgenet-2018-105477-supp001.docx]

The criteria for calling CNVs that do not span the full critical region are given in online supplementary table 2. As a rule, a CNV had to intersect at least 50% of the critical region, marked as ‘Location (hg19)’, and intersect the relevant candidate genes, if known. For single gene CNVs, we required deletions to intersect at least one exon, and duplications to span the whole coding region, as the functional consequences of partial gene duplications can be unpredictable, while deletions of any part of the coding sequence of a gene are likely to act as loss-of-function mutations. We observed several loci, mostly telomeric, where a number of small CNVs were preferentially called on arrays that failed QC (marked ‘Unreliable’ in online supplementary table 1). We excluded these loci from analysis in order to avoid potential false-positives on this genotyping platform. We also excluded from analysis CNVs with fewer than five observations in the full sample, as being too rare for statistical analysis (marked ‘Rare’ in online supplementary table 1). The above filtering left 54 CNVs for analysis (table 1).

View this table:

Table 1

List of 54 CNVs analysed in this study

Choice of medical phenotypes

Data on health outcomes were collected from several sources. Self-declared illnesses were disclosed by participants at their initial assessments and coded into 445 distinct categories. Hospital discharge diagnoses (primary and secondary) and death certificates contain over 11 000 International Statistical Classification of Diseases and Related Health Problems, 10th revision (ICD-10) codes assigned to at least one participant. Analysing each individual code separately against 54 CNV loci would result in small numbers of participants with each code and fail to provide the statistical power needed to detect true associations. To reduce the dimensionality of the data and therefore increase power and provide more meaningful results, we grouped together discrete disease entities into broader disease groups. A participant was coded as a ‘case’ if he/she had a relevant diagnosis on at least one occasion, in any of the above sources of information. We gave preference to common conditions and grouped disorders into recognised categories, based on organ, system or aetiology, while excluding from the current analysis infectious diseases, injuries and neuropsychiatric disorders (the latter being analysed separately). The disease codes used to construct each phenotype group are listed in online supplementary table 3. For myocardial infarction and stroke, we used the ‘adjudicated’ data provided by the UK Biobank (data fields 42 000 to 42 013). Phenotype groups found in fewer than 2000 participants were not included. The final list of disease groups contains 58 entities, including ‘death during follow-up’ obtained from the death registries. Data on cancer were taken only from the UK cancer registries, as collected and supplied by the UK Biobank, as this is the most reliable and complete resource for cancers in the UK. For the current work we considered all malignant cancers as a single phenotype. As risk for cancer was not significantly affected by CNVs as a group, and because most individual cancers affected relatively small numbers of patients, we did not analyse the cancers further by subtype.

Supplementary file 2

[jmedgenet-2018-105477-supp002.pdf]

Statistical analysis

Analyses were performed in the statistical package R (version 3.3.2) using a Linux server. We examined the effect of the presence of a CNV on each medical phenotype with logistic regression analysis. As covariates, we used age, gender, array type (Axiom/BiLEVE), Townsend deprivation index (as a measure of the socioeconomic status) and the first 15 principal components from the genetic analysis, as provided by the UK Biobank. We used Firth’s bias-reduced logistic regression method,13 with the R library ‘logistf’, as it better handles cells with small numbers. We report the resulting p-values, ORs and 95% CIs for the ORs. We also report the uncorrected relative risk (RR), for having the phenotype in carriers of a specific CNV and non-carriers of any of the 54 CNVs. (RR is used for the additional images on our website (http://kirov.psycm.cf.ac.uk/), as it returns the more intuitive value of zero for associations with zero CNVs in cases.) Conservative Bonferroni correction for the testing of 54 CNVs×58 phenotypes gives a p<1.6×10⁻⁵ as a project-wide significance level. As many true-positive associations were expected, it is more appropriate to use the Benjamini-Hochberg false discovery rate (B-H FDR) for correction of p-values.14 Our preferred B-H FDR is 0.1.

Results and discussion

Quality control

The Affymetrix arrays produced reliable calls for the 54 CNVs. This is not surprising, given the large size and good probe coverage of these CNVs. This impression is confirmed by the remarkably similar CNVs frequencies, compared with those reported by us in previous control populations (online supplementary table 4 and supplementary figure 1). There were no apparent batch effects affecting the calls: the distribution of each CNV in the 106 batches produced no outliers from the expected Poisson distribution, after taking into account the multiple testing for 54 CNVs (online supplementary table 5). The best confirmation of the data quality would be the identification of well-known phenotypes associated with specific CNVs. This was indeed the case (table 2), as we identified, for example, the known associations of neuropathies and 17p12 deletions/duplications,15 obesity and deletions at 16p11.2 and 16p11.2 distal,16 17 diabetes and 17q12 deletions (also called ‘renal cysts and diabetes syndrome’).18 This increases our confidence that the newly identified associations are also real.

Supplementary file 3

[jmedgenet-2018-105477-supp003.pdf]

View this table:

Table 2

CNV/Phenotype associations significant at FDR=0.1

Effects of CNVs on medical phenotypes

Each of the 54 CNVs was tested for association with each of the 58 medical phenotypes (a total of 3132 tests). Results are presented as ORs for risk of developing the phenotype, corrected for age, sex and the other covariates detailed in the Methods section. All results are presented in online supplementary table 6 (grouped by CNV) and in online supplementary table 7 (grouped by phenotype).

Supplementary file 4

[jmedgenet-2018-105477-supp004.pdf]

Supplementary file 5

[jmedgenet-2018-105477-supp005.pdf]

The top 14 significant phenotype/CNV associations (table 2) survive a Bonferroni correction for 3132 tests (a project-wide significant p-value threshold of 1.6×10⁻⁵). This correction is overconservative, due to medical comorbidities (eg, people with diabetes also have increased rates of heart attacks, stroke and others). A more appropriate correction of statistical significance for this analysis is the B-H FDR.14 There are 46 CNV/phenotype comparisons that were significant at an FDR=0.1 (table 2). Most of these are novel associations and none are protective for the tested phenotypes (all have OR >1).

A total of 330 tests were nominally significant (at p<0.05), instead of the expected 157. Figure 1 shows the distribution of p-values, with a clear trend for over-representation below the p<0.1 level. This suggests that there are many more real associations, than those presented in table 2, but they cannot be identified with sufficient statistical significance in a sample of this size. Clinicians might therefore decide to also consider consequences of CNVs that do not survive our corrections.

Figure 1

Distribution of all 3132 p-values from CNV/phenotype associations. There are 330 nominally significant CNV/phenotype associations (p<0.05), instead of the 157 expected by chance.

Deletions at 16p11.2 and 16p12.1 had the largest numbers of significantly associated phenotypes (seven each). Deletions at 16p11.2 are a known risk factor for obesity.16 We now provide data showing that adult carriers also have a high incidence of diabetes, osteoarthritis and hypertension, possibly as expected consequences/comorbidities of obesity. Other associated phenotypes are not necessarily linked to a high body mass index (BMI), such as asthma, anaemia and renal problems, suggesting that this and other CNVs have pleiotropic effects (see conditional analysis below). This should be expected from CNVs intersecting multiple genes. This has already been shown for some large CNVs, for example, 22q11.2 deletions, where highly variable phenotypic presentations are the norm.5

We should point out that CNVs with higher numbers of significant results are not necessarily the most pathogenic ones, as significance depends also on CNV frequency, which is low for the most pathogenic CNVs in this population. Such CNVs are under-represented in the UK Biobank, as the participants are middle-aged and participation is subject to ‘healthy volunteer’ selection bias.19 For example, 22q11.2 deletions are highly pathogenic,5 but there were only 10 such carriers in the Biobank, instead of the expected ~100 (the rate of this deletion among newborns is ~1:4000).7 These 10 carriers were not sufficient to produce significant results at FDR=0.1, even for ORs>10 (online supplementary table 6). The more informative data from our research is on CNVs with lower penetrance, as they are more common.

The increased risk for medical morbidities or mortality observed in CNV carriers is unlikely to be due to the presence of early neurodevelopmental disorders or schizophrenia in carriers, as the UK Biobank population has largely escaped such conditions: only 34 of the 14 791 people who had one of the tested CNVs had schizophrenia, 17 had developmental delay and 4 had autism. Accidental death or death in epilepsy cannot account for the increased death rate in CNV carriers: out of the 504 CNV carriers who had died during follow-up, only 1 had ‘sudden unexpected death in epilepsy’ and another 4 had accidental deaths (motor/pedal cyclist acidents and falls from a high place). All death causes in CNV carriers, according to the death registries, are listed in online supplementary table 8.

Supplementary file 6

[jmedgenet-2018-105477-supp006.pdf]

Phenotypes most likely to be affected by CNVs

Diabetes, hypertension, obesity and renal failure were the phenotypes affected by the highest number of CNVs (table 2). The real number of affected phenotypes by the CNVs is probably much higher, as suggested in figure 1. We can provide further evidence for this, by testing the effect on the phenotypes in the group of pathogenic CNV carriers as a whole, thus substantially increasing the statistical power. After excluding the five relatively common CNVs : deletions and duplications at 15q11.2 and 2q13(NPHP1) and duplications at 15q13.3(CHRNA7) (as they would determine the results due to their high frequencies), the remaining 4782 carriers of 49 rare CNVs had significantly increased risk for developing 26 of the 58 tested phenotypes (figure 2). Hypertension, diabetes, cardiac, respiratory and renal disorders dominate the top results. These are common phenotypes that increase mortality. We do indeed observe an increased death rate among CNV carriers during the follow-up period of Biobank participants (death was the second most-significant phenotype, figure 2). The RR of death from each CNV is presented in figure 3, where the RRs are ordered by the statistical strength of the association (strongest p-value on the left). The vertical line demarcates the 12 CNVs that are nominally significantly associated with increased mortality (p<0.05). Not surprisingly, the more pathogenic CNVs were also associated with increased mortality. The top significant CNV was, unexpectedly, the relatively common duplication at 16p13.11, found in ~0.2% of the general population, an association that has not been outlined before.

Figure 2

ORs and 95% CI for the ORs for developing the 58 tested phenotypes in carriers of any one of the 49 rare pathogenic CNVs. The phenotypes are ordered by the strength of the p-value. COPD, chronic obstructive pulmonary disease; MI, myocardial infarction, WBC, white blood cell count.

Figure 3

Relative risk (RR) for dying during the follow-up to 2016 for carriers of the 54 CNVs. The CNVs are ordered by the strength of the significance (strongest result on the left, for 16p13.11 duplications). The vertical line demarcates the nominally significant results (p<0.05). Due to zero observations in cases for some CNVs, RRs are shown, instead of ORs.

Most of the reported associations are novel, although some of them can be explained as logical adult medical consequences of known, early-onset phenotypes, for example, obesity leading to diabetes, hypertension and increased cardiovascular mortality. In order to test this possibility, we performed a conditional analysis of three CNVs and two phenotypes, where obesity is most likely to account for some or all of the associations, by adding the BMI as a new covariate to the original analysis. This analysis amounted to 276 independent tests, to which we applied again the Benjamini-Hochberg FDR method to establish which associations remained significant at FDR=0.1, after controlling for BMI. Obesity is a well-established phenotype of 16p11.2 classic and distal deletions. The results and comparisons with the original analysis for all phenotypes and these two CNVs are shown in online supplementary tables 9 and 10 and supplementary figures 2 and 3. For 16p11.2 classic deletion, four of the six originally significant associations at FDR=0.1 remained significant (excluding obesity from these numbers). The changes in the ORs give a better global impression of the changes (online supplementary figure 2) and indicate that several associations are much reduced: diabetes type 1 and 2, hypertension, high cholesterol, gout and ostheoarthritis. This indicates that these disorders are, to a large extent, consequences of obesity. However, the ORs for anaemia and asthma did not change substantially. 16p11.2 distal deletions showed smaller reductions in the ORs (online supplementary figure 3) and four phenotypes (excluding obesity) remain significant at FDR=0.1. This pattern suggests that other factors also play a role in the causation of phenotypes in carriers of this CNV. Although deletions at 16p12.1 have not been an established cause for obesity, the pattern of results (table 2) also raised the question as to whether the multiple associated phenotypes could be explained by obesity. Therefore, we included this CNV in the conditional analysis (online supplementary table 11 and supplementary figure 4). Increased BMI appeared to play a smaller role in the causation of disease phenotypes for this CNV, with small changes in the ORs and the number of significant results.

Somewhat counterintuitively, the association with obesity does not get fully abolished when the analysis is corrected for BMI. There are, however, several factors that can explain this apparent anomaly. Most relevantly, the phenotype ‘obesity’ is not equivalent to high BMI. It is a hospital ICD-10 diagnosis, made on a small proportion of people who have a BMI>30. In fact, 24.3% of the Biobank population has a BMI>30, qualifying them for a diagnosis of obesity, but only 9.2% of them received this diagnosis. Furthermore, obesity is a categorical variable, while BMI is a continuous one, making them not equivalent from a statistical point of view, and therefore adjusting an analysis of one for another does not necessarily remove all evidence for association. The distribution of BMI values is very different in the three CNVs tested: 71.6% of 16p11.2 deletion carriers had a BMI>30, compared with 55.6% of 16p11.2 distal deletion carriers and 37% of 16p12.1 deletion carriers (online supplementary figure 5a–c). ICD-10 diagnosis of ‘obesity’ was given to correspondingly smaller proportions of carriers: 18.6%, 16.7% and 9.8%. These differences could explain why correcting for BMI does not lead to identical changes to the associations of the three CNVs.

We also tested whether increased BMI accounted for associations of diabetes type 2 or mortality with any of the 54 CNVs (online supplementary tables 12 and 13 and supplementary figures 6 and 7). As already reported above, this was the case for diabetes and the ‘classic’ and ‘distal’ 16p11.2 deletions. However, for 1q21.1and 2q13 duplications, 22q11.2 distal deletions and 17q12 deletions (also known as ‘renal cysts and diabetes syndrome’), the ORs for diabetes increased, suggesting that these CNVs have a more direct effect on the development of diabetes. In total, six CNVs were significantly associated with diabetes, after controlling for BMI (online supplementary table 12). The associations with mortality remained essentially unchanged after correction with BMI, with four significantly associated CNVs (online supplementary table 13) and very similar ORs (online supplementary figure 7), indicating that obesity is only one of many consequences that shortens the lives of CNV carriers.

Homozygous deletions and more than one CNV per person

Only four carriers of homozygous deletions were found, perhaps not surprisingly for this relatively healthy population. Three of these clustered in a single locus, 2q13 (11 086–11 098 kb), affecting the gene NPHP1. Homozygous deletions at this locus are known to cause the kidney disorder juvenile nephronophthisis. All three Biobank individuals with homozygous deletions at NPHP1 had renal failure (Fisher’s exact test p=9×10⁻⁶). We also examined the data for the occurrence of two CNVs in the same person. 264 people carried two of these CNVs, not significantly different from the 249 expected by chance. All combinations of two CNVs observed in the same person are presented in online supplementary table 14.

Monitoring of CNV carriers

Our results indicate a need for regular medical monitoring of apparently healthy carriers of specific pathogenic CNVs. Examples include monitoring for blood pressure, kidney function and glucose levels for carriers of 16p12.1 and 16p11.2 deletions, and for cancer in 3q29 duplication carriers. Apart from specific medical phenotypes, it appears that such carriers require enhanced medical monitoring in general, as their health can be affected in multiple ways. Our results should enable clinicians to better plan the medical management of CNV carriers.

Finally, the reported CNV morbidity map can provide researchers with another avenue for the elucidation of pathophysiological disease mechanisms.

Acknowledgments

This research has been conducted using the UK Biobank Resource under Application no: 14421.

References

1.↵
2. Lee C ,
3. Scherer SW
. The clinical context of copy number variation in the human genome. Expert Rev Mol Med 2010;12:e8.doi:10.1017/S1462399410001390
OpenUrl CrossRef PubMed
2.↵
2. Sanders SJ ,
3. He X ,
4. Willsey AJ ,
5. Ercan-Sencicek AG ,
6. Samocha KE ,
7. Cicek AE ,
8. Murtha MT ,
9. Bal VH ,
10. Bishop SL ,
11. Dong S ,
12. Goldberg AP ,
13. Jinlu C ,
14. Keaney JF ,
15. Klei L ,
16. Mandell JD ,
17. Moreno-De-Luca D ,
18. Poultney CS ,
19. Robinson EB ,
20. Smith L ,
21. Solli-Nowlan T ,
22. Su MY ,
23. Teran NA ,
24. Walker MF ,
25. Werling DM ,
26. Beaudet AL ,
27. Cantor RM ,
28. Fombonne E ,
29. Geschwind DH ,
30. Grice DE ,
31. Lord C ,
32. Lowe JK ,
33. Mane SM ,
34. Martin DM ,
35. Morrow EM ,
36. Talkowski ME ,
37. Sutcliffe JS ,
38. Walsh CA ,
39. Yu TW ,
40. Ledbetter DH ,
41. Martin CL ,
42. Cook EH ,
43. Buxbaum JD ,
44. Daly MJ ,
45. Devlin B ,
46. Roeder K ,
47. State MW
. Autism Sequencing Consortium. Insights into Autism Spectrum Disorder Genomic Architecture and Biology from 71 Risk Loci. Neuron 2015;87:1215–33.doi:10.1016/j.neuron.2015.09.016
OpenUrl CrossRef PubMed
3.↵
2. Coe BP ,
3. Witherspoon K ,
4. Rosenfeld JA ,
5. van Bon BW ,
6. Vulto-van Silfhout AT ,
7. Bosco P ,
8. Friend KL ,
9. Baker C ,
10. Buono S ,
11. Vissers LE ,
12. Schuurs-Hoeijmakers JH ,
13. Hoischen A ,
14. Pfundt R ,
15. Krumm N ,
16. Carvill GL ,
17. Li D ,
18. Amaral D ,
19. Brown N ,
20. Lockhart PJ ,
21. Scheffer IE ,
22. Alberti A ,
23. Shaw M ,
24. Pettinato R ,
25. Tervo R ,
26. de Leeuw N ,
27. Reijnders MR ,
28. Torchia BS ,
29. Peeters H ,
30. O’Roak BJ ,
31. Fichera M ,
32. Hehir-Kwa JY ,
33. Shendure J ,
34. Mefford HC ,
35. Haan E ,
36. Gécz J ,
37. de Vries BB ,
38. Romano C ,
39. Eichler EE
. Refining analyses of copy number variation identifies specific genes associated with developmental delay. Nat Genet 2014;46:1063–71.doi:10.1038/ng.3092
OpenUrl CrossRef PubMed
4.↵
2. Rees E ,
3. Walters JT ,
4. Georgieva L ,
5. Isles AR ,
6. Chambert KD ,
7. Richards AL ,
8. Mahoney-Davies G ,
9. Legge SE ,
10. Moran JL ,
11. McCarroll SA ,
12. O’Donovan MC ,
13. Owen MJ ,
14. Kirov G
. Analysis of copy number variations at 15 schizophrenia-associated loci. Br J Psychiatry 2014;204:108–14.doi:10.1192/bjp.bp.113.131052
OpenUrl Abstract/FREE Full Text
5.↵
2. McDonald-McGinn DM ,
3. Sullivan KE ,
4. Marino B ,
5. Philip N ,
6. Swillen A ,
7. Vorstman JA ,
8. Zackai EH ,
9. Emanuel BS ,
10. Vermeesch JR ,
11. Morrow BE ,
12. Scambler PJ ,
13. Bassett AS
. 22q11.2 deletion syndrome. Nat Rev Dis Primers 2015;1:15071.doi:10.1038/nrdp.2015.71
OpenUrl CrossRef PubMed
6.↵
2. Dittwald P ,
3. Gambin T ,
4. Szafranski P ,
5. Li J ,
6. Amato S ,
7. Divon MY ,
8. Rodríguez Rojas LX ,
9. Elton LE ,
10. Scott DA ,
11. Schaaf CP ,
12. Torres-Martinez W ,
13. Stevens AK ,
14. Rosenfeld JA ,
15. Agadi S ,
16. Francis D ,
17. Kang SH ,
18. Breman A ,
19. Lalani SR ,
20. Bacino CA ,
21. Bi W ,
22. Milosavljevic A ,
23. Beaudet AL ,
24. Patel A ,
25. Shaw CA ,
26. Lupski JR ,
27. Gambin A ,
28. Cheung SW ,
29. Stankiewicz P
. NAHR-mediated copy-number variants in a clinical population: mechanistic insights into both genomic disorders and Mendelizing traits. Genome Res 2013;23:1395–409.doi:10.1101/gr.152454.112
OpenUrl Abstract/FREE Full Text
7.↵
2. Kirov G ,
3. Rees E ,
4. Walters JT ,
5. Escott-Price V ,
6. Georgieva L ,
7. Richards AL ,
8. Chambert KD ,
9. Davies G ,
10. Legge SE ,
11. Moran JL ,
12. McCarroll SA ,
13. O’Donovan MC ,
14. Owen MJ
. The penetrance of copy number variations for schizophrenia and developmental delay. Biol Psychiatry 2014;75:378–85.doi:10.1016/j.biopsych.2013.07.022
OpenUrl CrossRef PubMed Web of Science
8.↵
2. Cooper GM ,
3. Coe BP ,
4. Girirajan S ,
5. Rosenfeld JA ,
6. Vu TH ,
7. Baker C ,
8. Williams C ,
9. Stalker H ,
10. Hamid R ,
11. Hannig V ,
12. Abdel-Hamid H ,
13. Bader P ,
14. McCracken E ,
15. Niyazov D ,
16. Leppig K ,
17. Thiese H ,
18. Hummel M ,
19. Alexander N ,
20. Gorski J ,
21. Kussmann J ,
22. Shashi V ,
23. Johnson K ,
24. Rehder C ,
25. Ballif BC ,
26. Shaffer LG ,
27. Eichler EE
. A copy number variation morbidity map of developmental delay. Nat Genet 2011;43:838–46.doi:10.1038/ng.909
OpenUrl CrossRef PubMed
9.↵
2. Wain LV ,
3. Shrine N ,
4. Miller S ,
5. Jackson VE ,
6. Ntalla I ,
7. Soler Artigas M ,
8. Billington CK ,
9. Kheirallah AK ,
10. Allen R ,
11. Cook JP ,
12. Probert K ,
13. Obeidat M ,
14. Bossé Y ,
15. Hao K ,
16. Postma DS ,
17. Paré PD ,
18. Ramasamy A ,
19. Mägi R ,
20. Mihailov E ,
21. Reinmaa E ,
22. Melén E ,
23. O’Connell J ,
24. Frangou E ,
25. Delaneau O ,
26. Freeman C ,
27. Petkova D ,
28. McCarthy M ,
29. Sayers I ,
30. Deloukas P ,
31. Hubbard R ,
32. Pavord I ,
33. Hansell AL ,
34. Thomson NC ,
35. Zeggini E ,
36. Morris AP ,
37. Marchini J ,
38. Strachan DP ,
39. Tobin MD ,
40. Hall IP
. UK Brain Expression Consortium (UKBEC) OxGSK Consortium. Novel insights into the genetics of smoking behaviour, lung function, and chronic obstructive pulmonary disease (UK BiLEVE): a genetic association study in UK Biobank. Lancet Respir Med 2015;3:769–81.doi:10.1016/S2213-2600(15)00283-0
OpenUrl CrossRef PubMed
10.↵
2. Kendall KM ,
3. Rees E ,
4. Escott-Price V ,
5. Einon M ,
6. Thomas R ,
7. Hewitt J ,
8. O’Donovan MC ,
9. Owen MJ ,
10. Walters JTR ,
11. Kirov G
. Cognitive performance among carriers of pathogenic copy number variants: Analysis of 152,000 UK Biobank subjects. Biol Psychiatry 2017;82:103–10.doi:10.1016/j.biopsych.2016.08.014
OpenUrl
11.↵
2. Wang K ,
3. Li M ,
4. Hadley D ,
5. Liu R ,
6. Glessner J ,
7. Grant SF ,
8. Hakonarson H ,
9. Bucan M
. PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Res 2007;17:1665–74.doi:10.1101/gr.6861907
OpenUrl Abstract/FREE Full Text
12.↵
2. Kearney HM ,
3. Thorland EC ,
4. Brown KK ,
5. Quintero-Rivera F ,
6. South ST
. Working group of the American college of medical genetics laboratory quality assurance committee. American college of medical genetics standards and guidelines for interpretation and reporting of postnatal constitutional copy number variants. Genet Med 2011;13:680–5.
OpenUrl CrossRef PubMed
13.↵
2. Firth D
. Bias reduction of maximum likelihood estimates. Biometrika 1993;80:27–38.doi:10.1093/biomet/80.1.27
OpenUrl CrossRef Web of Science
14.↵
2. Benjamini Y ,
3. Hochberg Y
. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Royal Stat Soc B 1995;57:289–300.
OpenUrl
15.↵
2. Lupski JR ,
3. Wise CA ,
4. Kuwano A ,
5. Pentao L ,
6. Parke JT ,
7. Glaze DG ,
8. Ledbetter DH ,
9. Greenberg F ,
10. Patel PI
. Gene dosage is a mechanism for Charcot-Marie-Tooth disease type 1A. Nat Genet 1992;1:29–33.doi:10.1038/ng0492-29
OpenUrl CrossRef PubMed Web of Science
16.↵
2. Jacquemont S ,
3. Reymond A ,
4. Zufferey F ,
5. Harewood L ,
6. Walters RG ,
7. Kutalik Z ,
8. Martinet D ,
9. Shen Y ,
10. Valsesia A ,
11. Beckmann ND ,
12. Thorleifsson G ,
13. Belfiore M ,
14. Bouquillon S ,
15. Campion D ,
16. de Leeuw N ,
17. de Vries BB ,
18. Esko T ,
19. Fernandez BA ,
20. Fernández-Aranda F ,
21. Fernández-Real JM ,
22. Gratacòs M ,
23. Guilmatre A ,
24. Hoyer J ,
25. Jarvelin MR ,
26. Kooy RF ,
27. Kurg A ,
28. Le Caignec C ,
29. Männik K ,
30. Platt OS ,
31. Sanlaville D ,
32. Van Haelst MM ,
33. Villatoro Gomez S ,
34. Walha F ,
35. Wu BL ,
36. Yu Y ,
37. Aboura A ,
38. Addor MC ,
39. Alembik Y ,
40. Antonarakis SE ,
41. Arveiler B ,
42. Barth M ,
43. Bednarek N ,
44. Béna F ,
45. Bergmann S ,
46. Beri M ,
47. Bernardini L ,
48. Blaumeiser B ,
49. Bonneau D ,
50. Bottani A ,
51. Boute O ,
52. Brunner HG ,
53. Cailley D ,
54. Callier P ,
55. Chiesa J ,
56. Chrast J ,
57. Coin L ,
58. Coutton C ,
59. Cuisset JM ,
60. Cuvellier JC ,
61. David A ,
62. de Freminville B ,
63. Delobel B ,
64. Delrue MA ,
65. Demeer B ,
66. Descamps D ,
67. Didelot G ,
68. Dieterich K ,
69. Disciglio V ,
70. Doco-Fenzy M ,
71. Drunat S ,
72. Duban-Bedu B ,
73. Dubourg C ,
74. El-Sayed Moustafa JS ,
75. Elliott P ,
76. Faas BH ,
77. Faivre L ,
78. Faudet A ,
79. Fellmann F ,
80. Ferrarini A ,
81. Fisher R ,
82. Flori E ,
83. Forer L ,
84. Gaillard D ,
85. Gerard M ,
86. Gieger C ,
87. Gimelli S ,
88. Gimelli G ,
89. Grabe HJ ,
90. Guichet A ,
91. Guillin O ,
92. Hartikainen AL ,
93. Heron D ,
94. Hippolyte L ,
95. Holder M ,
96. Homuth G ,
97. Isidor B ,
98. Jaillard S ,
99. Jaros Z ,
100. Jiménez-Murcia S ,
101. Helas GJ ,
102. Jonveaux P ,
103. Kaksonen S ,
104. Keren B ,
105. Kloss-Brandstätter A ,
106. Knoers NV ,
107. Koolen DA ,
108. Kroisel PM ,
109. Kronenberg F ,
110. Labalme A ,
111. Landais E ,
112. Lapi E ,
113. Layet V ,
114. Legallic S ,
115. Leheup B ,
116. Leube B ,
117. Lewis S ,
118. Lucas J ,
119. MacDermot KD ,
120. Magnusson P ,
121. Marshall C ,
122. Mathieu-Dramard M ,
123. McCarthy MI ,
124. Meitinger T ,
125. Mencarelli MA ,
126. Merla G ,
127. Moerman A ,
128. Mooser V ,
129. Morice-Picard F ,
130. Mucciolo M ,
131. Nauck M ,
132. Ndiaye NC ,
133. Nordgren A ,
134. Pasquier L ,
135. Petit F ,
136. Pfundt R ,
137. Plessis G ,
138. Rajcan-Separovic E ,
139. Ramelli GP ,
140. Rauch A ,
141. Ravazzolo R ,
142. Reis A ,
143. Renieri A ,
144. Richart C ,
145. Ried JS ,
146. Rieubland C ,
147. Roberts W ,
148. Roetzer KM ,
149. Rooryck C ,
150. Rossi M ,
151. Saemundsen E ,
152. Satre V ,
153. Schurmann C ,
154. Sigurdsson E ,
155. Stavropoulos DJ ,
156. Stefansson H ,
157. Tengström C ,
158. Thorsteinsdóttir U ,
159. Tinahones FJ ,
160. Touraine R ,
161. Vallée L ,
162. van Binsbergen E ,
163. Van der Aa N ,
164. Vincent-Delorme C ,
165. Visvikis-Siest S ,
166. Vollenweider P ,
167. Völzke H ,
168. Vulto-van Silfhout AT ,
169. Waeber G ,
170. Wallgren-Pettersson C ,
171. Witwicki RM ,
172. Zwolinksi S ,
173. Andrieux J ,
174. Estivill X ,
175. Gusella JF ,
176. Gustafsson O ,
177. Metspalu A ,
178. Scherer SW ,
179. Stefansson K ,
180. Blakemore AI ,
181. Beckmann JS ,
182. Froguel P
. Mirror extreme BMI phenotypes associated with gene dosage at the chromosome 16p11.2 locus. Nature 2011;478:97–102.doi:10.1038/nature10406
OpenUrl CrossRef PubMed Web of Science
17.↵
2. Bachmann-Gagescu R ,
3. Mefford HC ,
4. Cowan C ,
5. Glew GM ,
6. Hing AV ,
7. Wallace S ,
8. Bader PI ,
9. Hamati A ,
10. Reitnauer PJ ,
11. Smith R ,
12. Stockton DW ,
13. Muhle H ,
14. Helbig I ,
15. Eichler EE ,
16. Ballif BC ,
17. Rosenfeld J ,
18. Tsuchiya KD
. Recurrent 200-kb deletions of 16p11.2 that include the SH2B1 gene are associated with developmental delay and obesity. Genet Med 2010;12:641–7.doi:10.1097/GIM.0b013e3181ef4286
OpenUrl CrossRef PubMed
18.↵
2. Mefford HC ,
3. Clauin S ,
4. Sharp AJ ,
5. Moller RS ,
6. Ullmann R ,
7. Kapur R ,
8. Pinkel D ,
9. Cooper GM ,
10. Ventura M ,
11. Ropers HH ,
12. Tommerup N ,
13. Eichler EE ,
14. Bellanne-Chantelot C
. Recurrent reciprocal genomic rearrangements of 17q12 are associated with renal disease, diabetes, and epilepsy. Am J Hum Genet 2007;81:1057–69.doi:10.1086/522591
OpenUrl CrossRef PubMed Web of Science
19.↵
2. Fry A ,
3. Littlejohns TJ ,
4. Sudlow C ,
5. Doherty N ,
6. Adamska L ,
7. Sprosen T ,
8. Collins R ,
9. Allen NE
. Comparison of sociodemographic and health-related characteristics of uk biobank participants with those of the general population. Am J Epidemiol 2017;186:1026–34.doi:10.1093/aje/kwx246
OpenUrl CrossRef PubMed

Footnotes

Contributors KC, MB-S and DO analysed the data; KMK, ER and MB-S called the CNVs; AFP and ME contributed to the bioinformatics and website design; VE-P, JTRW, MCO’D and MJO contributed to the statistical analysis; JTRW, MCO’D and MJO edited the paper; GK conceived the project, drafted the paper and took part in all analysis steps.
Funding The work at Cardiff University was funded by the Medical Research Council (MRC) Centre Grant (MR/L010305/1) and Programme Grant (G0800509).
Competing interests None declared.
Patient consent Not required.
Ethics approval Ethical approval for the study was granted by the North West multi-centre ethics committee.
Provenance and peer review Not commissioned; externally peer reviewed.
Data sharing statement All CNV calls will be made available to the UK Biobank, in accordance with their requirements, within 6 months of the first publication of results.

[1] 1.↵

Lee C ,
Scherer SW
. The clinical context of copy number variation in the human genome. Expert Rev Mol Med 2010;12:e8.doi:10.1017/S1462399410001390
OpenUrl CrossRef PubMed

[3] Lee C ,

[4] Scherer SW

[5] 2.↵

Sanders SJ ,
He X ,
Willsey AJ ,
Ercan-Sencicek AG ,
Samocha KE ,
Cicek AE ,
Murtha MT ,
Bal VH ,
Bishop SL ,
Dong S ,
Goldberg AP ,
Jinlu C ,
Keaney JF ,
Klei L ,
Mandell JD ,
Moreno-De-Luca D ,
Poultney CS ,
Robinson EB ,
Smith L ,
Solli-Nowlan T ,
Su MY ,
Teran NA ,
Walker MF ,
Werling DM ,
Beaudet AL ,
Cantor RM ,
Fombonne E ,
Geschwind DH ,
Grice DE ,
Lord C ,
Lowe JK ,
Mane SM ,
Martin DM ,
Morrow EM ,
Talkowski ME ,
Sutcliffe JS ,
Walsh CA ,
Yu TW ,
Ledbetter DH ,
Martin CL ,
Cook EH ,
Buxbaum JD ,
Daly MJ ,
Devlin B ,
Roeder K ,
State MW
. Autism Sequencing Consortium. Insights into Autism Spectrum Disorder Genomic Architecture and Biology from 71 Risk Loci. Neuron 2015;87:1215–33.doi:10.1016/j.neuron.2015.09.016
OpenUrl CrossRef PubMed

[7] Sanders SJ ,

[8] He X ,

[9] Willsey AJ ,

[10] Ercan-Sencicek AG ,

[11] Samocha KE ,

[12] Cicek AE ,

[13] Murtha MT ,

[14] Bal VH ,

[15] Bishop SL ,

[16] Dong S ,

[17] Goldberg AP ,

[18] Jinlu C ,

[19] Keaney JF ,

[20] Klei L ,

[21] Mandell JD ,

[22] Moreno-De-Luca D ,

[23] Poultney CS ,

[24] Robinson EB ,

[25] Smith L ,

[26] Solli-Nowlan T ,

[27] Su MY ,

[28] Teran NA ,

[29] Walker MF ,

[30] Werling DM ,

[31] Beaudet AL ,

[32] Cantor RM ,

[33] Fombonne E ,

[34] Geschwind DH ,

[35] Grice DE ,

[36] Lord C ,

[37] Lowe JK ,

[38] Mane SM ,

[39] Martin DM ,

[40] Morrow EM ,

[41] Talkowski ME ,

[42] Sutcliffe JS ,

[43] Walsh CA ,

[44] Yu TW ,

[45] Ledbetter DH ,

[46] Martin CL ,

[47] Cook EH ,

[48] Buxbaum JD ,

[49] Daly MJ ,

[50] Devlin B ,

[51] Roeder K ,

[52] State MW

[53] 3.↵

Coe BP ,
Witherspoon K ,
Rosenfeld JA ,
van Bon BW ,
Vulto-van Silfhout AT ,
Bosco P ,
Friend KL ,
Baker C ,
Buono S ,
Vissers LE ,
Schuurs-Hoeijmakers JH ,
Hoischen A ,
Pfundt R ,
Krumm N ,
Carvill GL ,
Li D ,
Amaral D ,
Brown N ,
Lockhart PJ ,
Scheffer IE ,
Alberti A ,
Shaw M ,
Pettinato R ,
Tervo R ,
de Leeuw N ,
Reijnders MR ,
Torchia BS ,
Peeters H ,
O’Roak BJ ,
Fichera M ,
Hehir-Kwa JY ,
Shendure J ,
Mefford HC ,
Haan E ,
Gécz J ,
de Vries BB ,
Romano C ,
Eichler EE
. Refining analyses of copy number variation identifies specific genes associated with developmental delay. Nat Genet 2014;46:1063–71.doi:10.1038/ng.3092
OpenUrl CrossRef PubMed

[55] Coe BP ,

[56] Witherspoon K ,

[57] Rosenfeld JA ,

[58] van Bon BW ,

[59] Vulto-van Silfhout AT ,

[60] Bosco P ,

[61] Friend KL ,

[62] Baker C ,

[63] Buono S ,

[64] Vissers LE ,

[65] Schuurs-Hoeijmakers JH ,

[66] Hoischen A ,

[67] Pfundt R ,

[68] Krumm N ,

[69] Carvill GL ,

[70] Li D ,

[71] Amaral D ,

[72] Brown N ,

[73] Lockhart PJ ,

[74] Scheffer IE ,

[75] Alberti A ,

[76] Shaw M ,

[77] Pettinato R ,

[78] Tervo R ,

[79] de Leeuw N ,

[80] Reijnders MR ,

[81] Torchia BS ,

[82] Peeters H ,

[83] O’Roak BJ ,

[84] Fichera M ,

[85] Hehir-Kwa JY ,

[86] Shendure J ,

[87] Mefford HC ,

[88] Haan E ,

[89] Gécz J ,

[90] de Vries BB ,

[91] Romano C ,

[92] Eichler EE

[93] 4.↵

Rees E ,
Walters JT ,
Georgieva L ,
Isles AR ,
Chambert KD ,
Richards AL ,
Mahoney-Davies G ,
Legge SE ,
Moran JL ,
McCarroll SA ,
O’Donovan MC ,
Owen MJ ,
Kirov G
. Analysis of copy number variations at 15 schizophrenia-associated loci. Br J Psychiatry 2014;204:108–14.doi:10.1192/bjp.bp.113.131052
OpenUrl Abstract/FREE Full Text

[95] Rees E ,

[96] Walters JT ,

[97] Georgieva L ,

[98] Isles AR ,

[99] Chambert KD ,

[100] Richards AL ,

[101] Mahoney-Davies G ,

[102] Legge SE ,

[103] Moran JL ,

[104] McCarroll SA ,

[105] O’Donovan MC ,

[106] Owen MJ ,

[107] Kirov G

[108] 5.↵

McDonald-McGinn DM ,
Sullivan KE ,
Marino B ,
Philip N ,
Swillen A ,
Vorstman JA ,
Zackai EH ,
Emanuel BS ,
Vermeesch JR ,
Morrow BE ,
Scambler PJ ,
Bassett AS
. 22q11.2 deletion syndrome. Nat Rev Dis Primers 2015;1:15071.doi:10.1038/nrdp.2015.71
OpenUrl CrossRef PubMed

[110] McDonald-McGinn DM ,

[111] Sullivan KE ,

[112] Marino B ,

[113] Philip N ,

[114] Swillen A ,

[115] Vorstman JA ,

[116] Zackai EH ,

[117] Emanuel BS ,

[118] Vermeesch JR ,

[119] Morrow BE ,

[120] Scambler PJ ,

[121] Bassett AS

[122] 6.↵

Dittwald P ,
Gambin T ,
Szafranski P ,
Li J ,
Amato S ,
Divon MY ,
Rodríguez Rojas LX ,
Elton LE ,
Scott DA ,
Schaaf CP ,
Torres-Martinez W ,
Stevens AK ,
Rosenfeld JA ,
Agadi S ,
Francis D ,
Kang SH ,
Breman A ,
Lalani SR ,
Bacino CA ,
Bi W ,
Milosavljevic A ,
Beaudet AL ,
Patel A ,
Shaw CA ,
Lupski JR ,
Gambin A ,
Cheung SW ,
Stankiewicz P
. NAHR-mediated copy-number variants in a clinical population: mechanistic insights into both genomic disorders and Mendelizing traits. Genome Res 2013;23:1395–409.doi:10.1101/gr.152454.112
OpenUrl Abstract/FREE Full Text

[124] Dittwald P ,

[125] Gambin T ,

[126] Szafranski P ,

[127] Li J ,

[128] Amato S ,

[129] Divon MY ,

[130] Rodríguez Rojas LX ,

[131] Elton LE ,

[132] Scott DA ,

[133] Schaaf CP ,

[134] Torres-Martinez W ,

[135] Stevens AK ,

[136] Rosenfeld JA ,

[137] Agadi S ,

[138] Francis D ,

[139] Kang SH ,

[140] Breman A ,

[141] Lalani SR ,

[142] Bacino CA ,

[143] Bi W ,

[144] Milosavljevic A ,

[145] Beaudet AL ,

[146] Patel A ,

[147] Shaw CA ,

[148] Lupski JR ,

[149] Gambin A ,

[150] Cheung SW ,

[151] Stankiewicz P

[152] 7.↵

Kirov G ,
Rees E ,
Walters JT ,
Escott-Price V ,
Georgieva L ,
Richards AL ,
Chambert KD ,
Davies G ,
Legge SE ,
Moran JL ,
McCarroll SA ,
O’Donovan MC ,
Owen MJ
. The penetrance of copy number variations for schizophrenia and developmental delay. Biol Psychiatry 2014;75:378–85.doi:10.1016/j.biopsych.2013.07.022
OpenUrl CrossRef PubMed Web of Science

[154] Kirov G ,

[155] Rees E ,

[156] Walters JT ,

[157] Escott-Price V ,

[158] Georgieva L ,

[159] Richards AL ,

[160] Chambert KD ,

[161] Davies G ,

[162] Legge SE ,

[163] Moran JL ,

[164] McCarroll SA ,

[165] O’Donovan MC ,

[166] Owen MJ

[167] 8.↵

Cooper GM ,
Coe BP ,
Girirajan S ,
Rosenfeld JA ,
Vu TH ,
Baker C ,
Williams C ,
Stalker H ,
Hamid R ,
Hannig V ,
Abdel-Hamid H ,
Bader P ,
McCracken E ,
Niyazov D ,
Leppig K ,
Thiese H ,
Hummel M ,
Alexander N ,
Gorski J ,
Kussmann J ,
Shashi V ,
Johnson K ,
Rehder C ,
Ballif BC ,
Shaffer LG ,
Eichler EE
. A copy number variation morbidity map of developmental delay. Nat Genet 2011;43:838–46.doi:10.1038/ng.909
OpenUrl CrossRef PubMed

[169] Cooper GM ,

[170] Coe BP ,

[171] Girirajan S ,

[172] Rosenfeld JA ,

[173] Vu TH ,

[174] Baker C ,

[175] Williams C ,

[176] Stalker H ,

[177] Hamid R ,

[178] Hannig V ,

[179] Abdel-Hamid H ,

[180] Bader P ,

[181] McCracken E ,

[182] Niyazov D ,

[183] Leppig K ,

[184] Thiese H ,

[185] Hummel M ,

[186] Alexander N ,

[187] Gorski J ,

[188] Kussmann J ,

[189] Shashi V ,

[190] Johnson K ,

[191] Rehder C ,

[192] Ballif BC ,

[193] Shaffer LG ,

[194] Eichler EE

[195] 9.↵

Wain LV ,
Shrine N ,
Miller S ,
Jackson VE ,
Ntalla I ,
Soler Artigas M ,
Billington CK ,
Kheirallah AK ,
Allen R ,
Cook JP ,
Probert K ,
Obeidat M ,
Bossé Y ,
Hao K ,
Postma DS ,
Paré PD ,
Ramasamy A ,
Mägi R ,
Mihailov E ,
Reinmaa E ,
Melén E ,
O’Connell J ,
Frangou E ,
Delaneau O ,
Freeman C ,
Petkova D ,
McCarthy M ,
Sayers I ,
Deloukas P ,
Hubbard R ,
Pavord I ,
Hansell AL ,
Thomson NC ,
Zeggini E ,
Morris AP ,
Marchini J ,
Strachan DP ,
Tobin MD ,
Hall IP
. UK Brain Expression Consortium (UKBEC) OxGSK Consortium. Novel insights into the genetics of smoking behaviour, lung function, and chronic obstructive pulmonary disease (UK BiLEVE): a genetic association study in UK Biobank. Lancet Respir Med 2015;3:769–81.doi:10.1016/S2213-2600(15)00283-0
OpenUrl CrossRef PubMed

[197] Wain LV ,

[198] Shrine N ,

[199] Miller S ,

[200] Jackson VE ,

[201] Ntalla I ,

[202] Soler Artigas M ,

[203] Billington CK ,

[204] Kheirallah AK ,

[205] Allen R ,

[206] Cook JP ,

[207] Probert K ,

[208] Obeidat M ,

[209] Bossé Y ,

[210] Hao K ,

[211] Postma DS ,

[212] Paré PD ,

[213] Ramasamy A ,

[214] Mägi R ,

[215] Mihailov E ,

[216] Reinmaa E ,

[217] Melén E ,

[218] O’Connell J ,

[219] Frangou E ,

[220] Delaneau O ,

[221] Freeman C ,

[222] Petkova D ,

[223] McCarthy M ,

[224] Sayers I ,

[225] Deloukas P ,

[226] Hubbard R ,

[227] Pavord I ,

[228] Hansell AL ,

[229] Thomson NC ,

[230] Zeggini E ,

[231] Morris AP ,

[232] Marchini J ,

[233] Strachan DP ,

[234] Tobin MD ,

[235] Hall IP

[236] 10.↵

Kendall KM ,
Rees E ,
Escott-Price V ,
Einon M ,
Thomas R ,
Hewitt J ,
O’Donovan MC ,
Owen MJ ,
Walters JTR ,
Kirov G
. Cognitive performance among carriers of pathogenic copy number variants: Analysis of 152,000 UK Biobank subjects. Biol Psychiatry 2017;82:103–10.doi:10.1016/j.biopsych.2016.08.014
OpenUrl

[238] Kendall KM ,

[239] Rees E ,

[240] Escott-Price V ,

[241] Einon M ,

[242] Thomas R ,

[243] Hewitt J ,

[244] O’Donovan MC ,

[245] Owen MJ ,

[246] Walters JTR ,

[247] Kirov G

[248] 11.↵

Wang K ,
Li M ,
Hadley D ,
Liu R ,
Glessner J ,
Grant SF ,
Hakonarson H ,
Bucan M
. PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Res 2007;17:1665–74.doi:10.1101/gr.6861907
OpenUrl Abstract/FREE Full Text

[250] Wang K ,

[251] Li M ,

[252] Hadley D ,

[253] Liu R ,

[254] Glessner J ,

[255] Grant SF ,

[256] Hakonarson H ,

[257] Bucan M

[258] 12.↵

Kearney HM ,
Thorland EC ,
Brown KK ,
Quintero-Rivera F ,
South ST
. Working group of the American college of medical genetics laboratory quality assurance committee. American college of medical genetics standards and guidelines for interpretation and reporting of postnatal constitutional copy number variants. Genet Med 2011;13:680–5.
OpenUrl CrossRef PubMed

[260] Kearney HM ,

[261] Thorland EC ,

[262] Brown KK ,

[263] Quintero-Rivera F ,

[264] South ST

[265] 13.↵

Firth D
. Bias reduction of maximum likelihood estimates. Biometrika 1993;80:27–38.doi:10.1093/biomet/80.1.27
OpenUrl CrossRef Web of Science

[267] Firth D

[268] 14.↵

Benjamini Y ,
Hochberg Y
. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Royal Stat Soc B 1995;57:289–300.
OpenUrl

[270] Benjamini Y ,

[271] Hochberg Y

[272] 15.↵

Lupski JR ,
Wise CA ,
Kuwano A ,
Pentao L ,
Parke JT ,
Glaze DG ,
Ledbetter DH ,
Greenberg F ,
Patel PI
. Gene dosage is a mechanism for Charcot-Marie-Tooth disease type 1A. Nat Genet 1992;1:29–33.doi:10.1038/ng0492-29
OpenUrl CrossRef PubMed Web of Science

[274] Lupski JR ,

[275] Wise CA ,

[276] Kuwano A ,

[277] Pentao L ,

[278] Parke JT ,

[279] Glaze DG ,

[280] Ledbetter DH ,

[281] Greenberg F ,

[282] Patel PI

[285] Jacquemont S ,

[286] Reymond A ,

[287] Zufferey F ,

[288] Harewood L ,

[289] Walters RG ,

[290] Kutalik Z ,

[291] Martinet D ,

[292] Shen Y ,

[293] Valsesia A ,

[294] Beckmann ND ,

[295] Thorleifsson G ,

[296] Belfiore M ,

[297] Bouquillon S ,

[298] Campion D ,

[299] de Leeuw N ,

[300] de Vries BB ,

[301] Esko T ,

[302] Fernandez BA ,

[303] Fernández-Aranda F ,

[304] Fernández-Real JM ,

[305] Gratacòs M ,

[306] Guilmatre A ,

[307] Hoyer J ,

[308] Jarvelin MR ,

[309] Kooy RF ,

[310] Kurg A ,

[311] Le Caignec C ,

[312] Männik K ,

[313] Platt OS ,

[314] Sanlaville D ,

[315] Van Haelst MM ,

[316] Villatoro Gomez S ,

[317] Walha F ,

[318] Wu BL ,

[319] Yu Y ,

[320] Aboura A ,

[321] Addor MC ,

[322] Alembik Y ,

[323] Antonarakis SE ,

[324] Arveiler B ,

[325] Barth M ,

[326] Bednarek N ,

[327] Béna F ,

[328] Bergmann S ,

[329] Beri M ,

[330] Bernardini L ,

[331] Blaumeiser B ,

[332] Bonneau D ,

[333] Bottani A ,

[334] Boute O ,

[335] Brunner HG ,

[336] Cailley D ,

[337] Callier P ,

[338] Chiesa J ,

[339] Chrast J ,

[340] Coin L ,

[341] Coutton C ,

[342] Cuisset JM ,

[343] Cuvellier JC ,

[344] David A ,

[345] de Freminville B ,

[346] Delobel B ,

[347] Delrue MA ,

[348] Demeer B ,

[349] Descamps D ,

[350] Didelot G ,

[351] Dieterich K ,

[352] Disciglio V ,

[353] Doco-Fenzy M ,

[354] Drunat S ,

[355] Duban-Bedu B ,

[356] Dubourg C ,

[357] El-Sayed Moustafa JS ,

[358] Elliott P ,

[359] Faas BH ,

[360] Faivre L ,

[361] Faudet A ,

[362] Fellmann F ,

[363] Ferrarini A ,

[364] Fisher R ,

[365] Flori E ,

[366] Forer L ,

[367] Gaillard D ,

[368] Gerard M ,

[369] Gieger C ,

[370] Gimelli S ,

[371] Gimelli G ,

[372] Grabe HJ ,

[373] Guichet A ,

[374] Guillin O ,

[375] Hartikainen AL ,

[376] Heron D ,

[377] Hippolyte L ,

[378] Holder M ,

[379] Homuth G ,

[380] Isidor B ,

[381] Jaillard S ,

[382] Jaros Z ,

[383] Jiménez-Murcia S ,

[384] Helas GJ ,

[385] Jonveaux P ,

[386] Kaksonen S ,

[387] Keren B ,

[388] Kloss-Brandstätter A ,

[389] Knoers NV ,

[390] Koolen DA ,

[391] Kroisel PM ,

[392] Kronenberg F ,

[393] Labalme A ,

[394] Landais E ,

[395] Lapi E ,

[396] Layet V ,

[397] Legallic S ,

[398] Leheup B ,

[399] Leube B ,

[400] Lewis S ,

[401] Lucas J ,

[402] MacDermot KD ,

[403] Magnusson P ,

[404] Marshall C ,

[405] Mathieu-Dramard M ,

[406] McCarthy MI ,

[407] Meitinger T ,

[408] Mencarelli MA ,

[409] Merla G ,

[410] Moerman A ,

[411] Mooser V ,

[412] Morice-Picard F ,

[413] Mucciolo M ,

[414] Nauck M ,

[415] Ndiaye NC ,

[416] Nordgren A ,

[417] Pasquier L ,

[418] Petit F ,

[419] Pfundt R ,

[420] Plessis G ,

[421] Rajcan-Separovic E ,

[422] Ramelli GP ,

[423] Rauch A ,

[424] Ravazzolo R ,

[425] Reis A ,

[426] Renieri A ,

[427] Richart C ,

[428] Ried JS ,

[429] Rieubland C ,

[430] Roberts W ,

[431] Roetzer KM ,

[432] Rooryck C ,

[433] Rossi M ,

[434] Saemundsen E ,

[435] Satre V ,

[436] Schurmann C ,

[437] Sigurdsson E ,

[438] Stavropoulos DJ ,

[439] Stefansson H ,

[440] Tengström C ,

[441] Thorsteinsdóttir U ,

[442] Tinahones FJ ,

[443] Touraine R ,

[444] Vallée L ,

[445] van Binsbergen E ,

[446] Van der Aa N ,

[447] Vincent-Delorme C ,

[448] Visvikis-Siest S ,

[449] Vollenweider P ,

[450] Völzke H ,

[451] Vulto-van Silfhout AT ,

[452] Waeber G ,

[453] Wallgren-Pettersson C ,

[454] Witwicki RM ,

[455] Zwolinksi S ,

[456] Andrieux J ,

[457] Estivill X ,

[458] Gusella JF ,

[459] Gustafsson O ,

[460] Metspalu A ,

[461] Scherer SW ,

[462] Stefansson K ,

[463] Blakemore AI ,

[464] Beckmann JS ,

[465] Froguel P

[466] 17.↵

Bachmann-Gagescu R ,
Mefford HC ,
Cowan C ,
Glew GM ,
Hing AV ,
Wallace S ,
Bader PI ,
Hamati A ,
Reitnauer PJ ,
Smith R ,
Stockton DW ,
Muhle H ,
Helbig I ,
Eichler EE ,
Ballif BC ,
Rosenfeld J ,
Tsuchiya KD
. Recurrent 200-kb deletions of 16p11.2 that include the SH2B1 gene are associated with developmental delay and obesity. Genet Med 2010;12:641–7.doi:10.1097/GIM.0b013e3181ef4286
OpenUrl CrossRef PubMed

[468] Bachmann-Gagescu R ,

[469] Mefford HC ,

[470] Cowan C ,

[471] Glew GM ,

[472] Hing AV ,

[473] Wallace S ,

[474] Bader PI ,

[475] Hamati A ,

[476] Reitnauer PJ ,

[477] Smith R ,

[478] Stockton DW ,

[479] Muhle H ,

[480] Helbig I ,

[481] Eichler EE ,

[482] Ballif BC ,

[483] Rosenfeld J ,

[484] Tsuchiya KD

[485] 18.↵

Mefford HC ,
Clauin S ,
Sharp AJ ,
Moller RS ,
Ullmann R ,
Kapur R ,
Pinkel D ,
Cooper GM ,
Ventura M ,
Ropers HH ,
Tommerup N ,
Eichler EE ,
Bellanne-Chantelot C
. Recurrent reciprocal genomic rearrangements of 17q12 are associated with renal disease, diabetes, and epilepsy. Am J Hum Genet 2007;81:1057–69.doi:10.1086/522591
OpenUrl CrossRef PubMed Web of Science

[487] Mefford HC ,

[488] Clauin S ,

[489] Sharp AJ ,

[490] Moller RS ,

[491] Ullmann R ,

[492] Kapur R ,

[493] Pinkel D ,

[494] Cooper GM ,

[495] Ventura M ,

[496] Ropers HH ,

[497] Tommerup N ,

[498] Eichler EE ,

[499] Bellanne-Chantelot C

[500] 19.↵

Fry A ,
Littlejohns TJ ,
Sudlow C ,
Doherty N ,
Adamska L ,
Sprosen T ,
Collins R ,
Allen NE
. Comparison of sociodemographic and health-related characteristics of uk biobank participants with those of the general population. Am J Epidemiol 2017;186:1026–34.doi:10.1093/aje/kwx246
OpenUrl CrossRef PubMed

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Statistics from Altmetric.com

Request Permissions

Introduction

Methods

Participants

CNV calling

Choice of CNVs

Supplementary file 1

Choice of medical phenotypes

Supplementary file 2

Statistical analysis

Results and discussion

Quality control

Supplementary file 3

Effects of CNVs on medical phenotypes

Supplementary file 4

Supplementary file 5

Supplementary file 6

Phenotypes most likely to be affected by CNVs

Homozygous deletions and more than one CNV per person

Monitoring of CNV carriers

Acknowledgments

References

Footnotes

Read the full text or download the PDF:

Log in using your username and password