Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

An introduction to LifeLines DEEP: study design and baseline characteristics

Ettje F. Tigchelaar, Alexandra Zhernakova, Jackie A.M. Dekens, Gerben Hermes, Agnieszka Baranska, Zlatan Mujagic, Morris A. Swertz, Angélica M. Muñoz, Patrick Deelen, Maria C. Cénit, Lude Franke, Salome Scholtens, Ronald P. Stolk, Cisca Wijmenga, Edith J.M. Feskens
doi: https://doi.org/10.1101/009217
Ettje F. Tigchelaar
1University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
2Top Institute Food and Nutrition, Wageningen, the Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexandra Zhernakova
1University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
2Top Institute Food and Nutrition, Wageningen, the Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jackie A.M. Dekens
1University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
2Top Institute Food and Nutrition, Wageningen, the Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gerben Hermes
2Top Institute Food and Nutrition, Wageningen, the Netherlands
3Laboratory of Microbiology, Wageningen University, Wageningen, the Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Agnieszka Baranska
2Top Institute Food and Nutrition, Wageningen, the Netherlands
4Department of Toxicology, Nutrition and Toxicology Research (NUTRIM), Maastricht University Medical Center+, Maastricht, the Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zlatan Mujagic
2Top Institute Food and Nutrition, Wageningen, the Netherlands
5Division of Gastroenterology-Hepatology, Maastricht University Medical Center+, Maastricht, the Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Morris A. Swertz
1University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
2Top Institute Food and Nutrition, Wageningen, the Netherlands
6University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Angélica M. Muñoz
1University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
7University of Antioquia, Research Group in Food and Human Nutrition, Medellín, Colombia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Patrick Deelen
1University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
6University of Groningen, University Medical Center Groningen, Genomics Coordination Center, Groningen, the Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Maria C. Cénit
1University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lude Franke
1University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Salome Scholtens
8LifeLines Cohort Study, Groningen, the Netherlands
9University of Groningen, University Medical Center Groningen, Department of Epidemiology, Groningen, the Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ronald P. Stolk
8LifeLines Cohort Study, Groningen, the Netherlands
9University of Groningen, University Medical Center Groningen, Department of Epidemiology, Groningen, the Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Cisca Wijmenga
1University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, the Netherlands
2Top Institute Food and Nutrition, Wageningen, the Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Edith J.M. Feskens
2Top Institute Food and Nutrition, Wageningen, the Netherlands
10Division of Human Nutrition, Section Nutrition and Epidemiology, Wageningen University, Wageningen, the Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

ABSTRACT

There is a critical need for population-based prospective cohort studies because they follow individuals before the onset of disease, allowing for studies that can identify biomarkers and disease-modifying effects and thereby contributing to systems epidemiology. This paper describes the design and baseline characteristics of an intensively examined subpopulation of the LifeLines cohort in the Netherlands. For this unique sub-cohort, LifeLines DEEP, additional blood (n=1387), exhaled air (n=1425), fecal samples (n=1248) and gastrointestinal health questionnaires (n=1176) were collected for analysis of the genome, epigenome, transcriptome, microbiome, metabolome and other biological levels. Here, we provide an overview of the different data layers in LifeLines DEEP and present baseline characteristics of the study population including food intake and quality of life. We also describe how the LifeLines DEEP cohort allows for the detailed investigation of genetic, genomic and metabolic variation on a wealth of phenotypic outcomes. Finally, we examine the determinants of gastrointestinal health, an area of particular interest to us that can be addressed by LifeLines DEEP.

INTRODUCTION

Many diseases are multifactorial in origin, meaning that they are caused by a combination of genetic and environmental components. To date, a considerable number of genetic variants have been identified that are associated with almost every multifactorial disease or trait.[1] These independent genetic factors are often common, occurring frequently in the absence of disease, and therefore cannot yet be used to predict disease. For example, 40 risk loci have been identified for celiac disease that explain about 54% of disease risk [2], yet there is no clear correlation between carrying these risk alleles and actually developing celiac disease [3]. Thus the question remains: why do some people develop the disease while others are resilient despite carrying many genetic risk alleles? These resilient individuals may provide important clues to disease prevention, but they can only be identified when apparently healthy individuals are followed over time. This highlights the need for prospective cohort studies where life course processes are investigated and determinants of health and disease are identified. An advantage of population-based prospective cohort studies is that they are not specifically targeted to a diseased population and they follow individuals before disease onset, allowing for studies that can identify biomarkers and disease-modifying effects.[4] Furthermore, age-related processes that correlate to health and disease can be studied in these cohorts.

LifeLines is a population cohort of over 165,000 participants which covers multiple generations and focuses on determinants for multifactorial diseases. This large-scale population cohort includes detailed information on phenotypic and environmental factors, change of risk factors, quality of life, and disease incidence.[5] Genetic information is also available for about 10% of the population. A subset of approximately 1500 LifeLines participants also take part in LifeLines DEEP. These participants are examined more thoroughly, specifically with respect to molecular data, which allow for a more thorough investigation of the association between genetic and phenotypic variation. For these participants, additional biological materials are collected, as well as additional information on environmental factors. Subsequently, genome-wide transcriptomics and methylation data are generated, metabolites and biomarkers are measured, and the gut microbiome is assessed.

LifeLines DEEP specifically allows for in-depth analysis of gastrointestinal-health-related problems like irritable bowel syndrome (IBS). This is an important direction for research since gastrointestinal complaints are highly prevalent in the general population and have a high impact on quality of life.[6–8] IBS is a functional bowel disorder which involves abdominal pain or discomfort and related change in bowel habits.[9] Prevalence of IBS in Western countries varies widely among different studies ranging from 4% up to 22%.[10] There are, however, no specific tests available to diagnose IBS. The current diagnosis is based on excluding gastrointestinal diseases and on symptoms using diagnostic criteria such as the Rome III criteria.[9]

In this paper we describe the study design and baseline characteristics of the LifeLines DEEP cohort and explain how the collected data can be applied to multiple fields of interest.

METHODS

LifeLines

Individuals aged 25-50 years old were invited by their general practitioner to participate in the LifeLines study. Upon inclusion, participants were asked about their family. Family members were then also invited to participate in LifeLines in order to obtain inclusion over three generations. At baseline, all participants visited one of the LifeLines Research Sites twice for physical examinations. Prior to these visits, two extensive baseline questionnaires were completed at home. At the first visit, anthropometry, blood pressure, cognitive functioning and pulmonary function as well as other factors were measured (Table 1a). At the second visit, approximately two weeks later, a fasting blood sample was collected. In total 167,729 participants have been included who will be followed for 30 years. Every eighteen months, each participant receives a follow-up questionnaire. Additionally, once every five years, each participant is invited to one of the LifeLines Research Sites for follow-up measurement of their health parameters.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1a.

Overview of collected data in LifeLines and source it originates from. More detailed information is available through the LifeLines data catalog at www.lifelines.net

LifeLines DEEP

From April to August 2013, all participants registered at the LifeLines Research Site in Groningen were invited to participate in LifeLines DEEP in addition to the regular LifeLines program. After participants signed informed consent, three additional tubes of blood were drawn by one of the LifeLines physician assistants during the participant's second visit to site. Exhaled air was also collected during this visit and participants were given instructions for feces collection at home by one of the LifeLines DEEP assistants. The participants who agreed to collect a fecal sample also received the questionnaire on gastrointestinal complaints. Immediately after fecal sample collection, the sample was frozen at −20°C. Fecal samples were collected on dry-ice from the participants’ homes within 1 to 2 weeks after the second site visit. Upon arrival at the research location, fecal samples were immediately stored at −80°C.

Inclusion

Initially, 1539 participants were included in the LifeLines DEEP study. Of these participants, 78 dropped-out: 51 did not complete the second visit to the LifeLines location in time and 27 withdrew from participation. In total, 1461 individuals completed the LifeLines DEEP study. From those participants, we collected additional blood for genetics, methylation, and transcriptomics analyses (n=1387); exhaled air for analysis of volatile organic compounds (n=1425); and fecal samples for microbiome and biomarker assessment (n=1248) (Table 1b). Moreover, 1176 gastrointestinal complaints questionnaires were returned. For 1183 participants, we collected all three biomaterials: additional blood, exhaled air and feces (Fig 1). For 168 participants, we collected additional blood and exhaled air and for 65 participants we collected exhaled air and feces. For 45 participants, we only have additional blood or exhaled air.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1b.

Overview of additional data collected in LifeLines DEEP including the number of samples, the source biomaterial it originates from and the method of analysis used

Fig 1
  • Download figure
  • Open in new tab
Fig 1

Overview of additional biomaterials collected in LifeLines DEEP

Additional data types

Genome wide transcriptomics were assessed as a measure of gene expression. We isolated RNA from whole blood collected in a PAXgene tube using PAXgene Blood miRNA Kit (Qiagen, CA, USA). The RNA samples were quantified and assessed for integrity before sequencing. Total RNA from whole blood was deprived of globin using GLOBINclear kit (Ambion, Austin, TX, USA) and subsequently processed for sequencing using Truseq version 2 library preparation kit (Illumina Inc., San Diego, CA, USA). Paired-end sequencing of 2 x 50bp was performed using Illumina's Hiseq2000, pooling 10 samples per lane. Finally, read sets per sample were generated using CASAVA, retaining only reads passing Illumina's Chastity Filter for further processing. On average, the number of raw reads per individual after QC was 44.3 million. After adapter trimming, the reads were mapped to human genome build 37 using STAR (https://code.google.com/p/rna-star/). Of these, 96% of reads were successfully mapped to the genome. Transcription was quantified on the gene and meta-exon level using BEDTools (https://code.google.com/p/bedtools/) and custom scripts, and on the transcript level using FluxCapacitor (http://sammeth.net/confluence/display/FLUX/Home).

We isolated total DNA from EDTA tubes and profiled genome wide methylation using the Infinium HumanMethylation450 BeadChip, as previously described.[11] In short, 500 ng of genomic DNA was bisulfite modified and used for hybridization on Infinium HumanMethylation450 BeadChips, according to the Illumina Infinium HD Methylation protocol.

We determined metabolites in both exhaled air and blood. Metabolites from exhaled air were measured by a combination of gas chromatography and time-of-flight mass spectrometry (GC-tof-MS) as described previously[12,13]. In short, the exhaled air sample was introduced in a GC that separates the different compounds in the mixture. Subsequently, the compounds were introduced into the MS to detect and also to identify the separated volatile organic compounds. The metabolites in plasma were measured using the nuclear magnetic resonance (NMR) method, as described by Kettunen et al.[14]

Genotyping of genomic DNA was performed using both the HumanCytoSNP-12 BeadChip [15] and the ImmunoChip, a customized Illumina Infinium array [16]. Genotyping was successful for 1,385 samples (CytoSNP) and 1,374 samples (IChip), respectively. First, SNP quality control was applied independently for both platforms. SNPs were filtered on MAF above 0.001, a HWE p-value >1e−4 and call rate of 0.98 using Plink.[17] The genotypes from both platforms were merged into one dataset. For genotypes present on both platforms the genotypes were put on missing in the case of non-concordant calls. After merging, SNPs were filtered again on MAF 0.05 and call rate of 0.98 resulting in a total of 379,885 genotyped SNPs. Next, this data was imputed based on the Genome of the Netherlands (GoNL) reference panel.[18–20] The merged genotypes were pre-phased using SHAPEIT24[21] and aligned to the GoNL reference panel using Genotype Harmonizer (http://www.molgenis.org/systemsgenetics/) in order to resolve strand issues. The imputation was performed using IMPUTE2[22] version 2.3.0 against the GoNL reference panel. We used MOLGENIS compute [23] imputation pipeline to generate our scripts and monitor the imputation. Imputation yielded 8,606,371 variants with Info score ≥0.8. In addition, HLA type was established via Broad SNP2HLA imputation pipeline.[24]

We collected several types of cells including lymphocytes and granulocytes for assessment of telomere length as a measure for aging. Currently, we are optimizing the FlowFish method of telomere measuring as described by Baerlocher et al.[25] In addition, peripheral blood mononuclear cells (PBMCs) were collected and stored at −80°C for future functional studies.

Fecal samples were collected in order to study the gut microbiome. Gut microbial composition was assessed by 16S rRNA gene sequencing of the V4 variable region on the Illumina MiSeq platform according to the manufacturer's specification.[26] Reads were quality filtered and taxonomy was inferred using a closed reference Operational Taxonomic Unit picking protocol against a pre-clustered GreenGenes database, as implemented by QIIME.[27,28] Moreover, fecal aliquots were stored for future analysis of gastrointestinal– health-related biomarkers.

In addition, phenotypic data was collected on gastrointestinal health complaints by means of the Rome III criteria questionnaire [9] and the Bristol Stool Form Scale.[29]

We collected and stored plasma for future analysis of disease and ageing-related biomarkers such as circulating microRNAs.

Analyses of baseline characteristics, quality of life, gastrointestinal complaints and qualitative food intake

For each participant a risk score for cardiovascular disease (CVD) was calculated according to the scoring algorithm developed within the Framingham Heart Study.[30] The CVD risk score ranges from ≤-3 to ≥18 and is calculated based on gender, age, HDL, total cholesterol, systolic blood pressure, smoking status, and presence or absence of diabetes.

We calculated quality of life scores based on the RAND 36-item Short Form Health Survey scoring version I by calculating eight summary scores and the mental and physical component score.[31–33] The summary scores range from zero to 100 points where 100 represents the best quality of life. The mental and physical component scores were transformed to have a mean of 50 and a standard deviation of 10 compared to the reference population as described by Ware et al.[32–34]

Occurrence of functional bowel disorders was assessed via the Rome III criteria.[9] Participants with self-reported Crohn's disease, ulcerative colitis and celiac disease were excluded from this analysis.

Data on habitual dietary intake was collected via a validated food frequency questionnaire, developed by the division of Human Nutrition of Wageningen University.[35]

Mean and standard deviations for the baseline characteristics and quality of life scores were calculated. Statistical programs R (version 3.0.1) and IBM SPSS Statistics (version 20) were used for analyses and construction of figures.

RESULTS

Over a period of 6 months, 1539 participants enrolled in the LifeLines DEEP study. Slightly more women (n=903, 58.7%) than men (n=636, 41.3%) were included (Table 2). The age of the participants ranged from 18 to 86 years, with a mean age of 44. Mean BMI was 25.2 kg/m2. On average, total cholesterol level and blood glucose level both were 5.0 mmol/L. Average blood pressure was lower in women (116/68 mmHg) compared to men (124/74 mmHg). Amongst women, the percentage of current smokers was a little lower (18.3%) than in men (19.5%). The Framingham risk score for cardiovascular disease was on average 5.7 for women and 8.6 for men, corresponding to a 3% and 7% risk of a first cardiovascular event, respectively (Table 2).[30] In our cohort the quality of life score was lowest for vitality (mean(sd): 67.2(15.5)) and highest for physical functioning (mean(sd): 92.1(12.3)) (Fig. 2 and Supplementary table Table 3). The age-and gender-adjusted quality of life scores yielded similar results.

Fig 2
  • Download figure
  • Open in new tab
Fig 2 Mean and sd of crude and adjusted quality of life scores, two component scores and eight group scores in the LifeLines DEEP population (n=1539) compared to a national sample of the Dutch population [34,42].

Adjusted score is adjusted for gender and age. PCS: physical component score, MCS: mental component score, PF: physical functioning, RP: role-physical, BP: bodily pain, GH: general health, VT: vitality, SF: social functioning, RE: role-emotional, MH: mental health.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2.

Baseline characteristics LifeLines DEEP by gender including smoking, age, BMI, cholesterol level, glucose level, blood pressure and Framingham risk score for cardiovascular disease

Analysis of 1176 gastrointestinal complaints questionnaires identified 409 participants with functional bowel disorders (Fig 3). Prevalence of irritable bowel syndrome (IBS) in our cohort was 21% (n=249). Another 13% (n=160) of participants fulfilled criteria for functional bloating (9%, n=108) or functional constipation (3%, n=37) or functional diarrhea (1%, n=15). Two third of the participants (n=767) did not meet the Rome III criteria for functional bowel disorders. Moreover, 4% (n=51) of the participants answered that they never experience any gastrointestinal complaints (Fig 3).

Fig 3
  • Download figure
  • Open in new tab
Fig 3 Functional bowel disorders in the LifeLines DEEP cohort based on Rome III criteria (n=1176)

IBS (Irritable Bowel Syndrome): pain or discomfort at least 2-3 days/month, IBS_strict: pain or discomfort more than one day per week, FBD: functional bowel disorder, healthy gut: lowest possible score on Rome III questionnaire

Analysis of the frequency of intake of major food groups showed a subdivision into three main categories. The first category contained food groups that were consumed daily, bread and coffee, for example (Fig 4a and 4b). The second category contained food groups for which consumption ranged from daily to a few days per week. Examples of these food groups include meat, vegetables and fruit (Fig 4c, 4d and 4e). The third category included food groups that were consumed on a weekly to monthly basis only, fish, for example (Fig 4f). For other food groups, such as milk and alcoholic beverages, the intake varied greatly (Fig 3g and 3h). This frequency data will later be combined with portion sizes and the Dutch food composition table [NEVO 2006, RIVM, Bilthoven] to estimate nutrient intake in grams per day.

Fig 4
  • Download figure
  • Open in new tab
Fig 4

Qualitative intake of a. bread, b. coffee, c. meat and poultry, d. vegetables, e. fruit, f. fish, g. milk and buttermilk, and h. alcoholic beverages in LifeLines DEEP (n=1539). Bars represent: ‘not this month’, ‘1 day a month’, ‘2-3 days a month’, ‘1 day a week’, ‘2-3 days a week’, ‘4-5 days a week’, and ‘6-7 days a week’.

For all individuals, additional biomaterials were collected (Fig 1) for future systems epidemiological studies [36] integrating multi-level ‘omics’ data with environmental, physical and epidemiological data to provide a deeper and more detailed view of the LifeLines DEEP population. These biomaterials include plasma to examine the concentration of metabolites, peripheral blood mononuclear cells to determine genome-wide transcription and methylation profiles, exhaled air to analyze volatile organic compounds and feces to establish composition of the gut microbiome. Moreover, genetic data has been generated for all individuals allowing for the construction of genetic risk profiles for a wide variety of common diseases. These multiple data levels will provide rich opportunities for future research into the molecular underpinnings of human health and disease, as well as research into the interaction between molecular and environmental components including behavior, socio-demographic factors, and analysis of specific subgroups.

DISCUSSION AND FUTURE PERSPECTIVES

In the design of a population cohort study it is important to balance breadth (the number of samples included) and depth (the amount of phenotypic data). LifeLines is a large prospective cohort which includes more than 165,000 individuals and measures several thousands of phenotypes ranging from biochemical parameters, physical measurements, psychosocial characteristics and environmental factors to detailed information on health status. However, the cohort was not set up to include molecular data levels for the study of health and disease in human populations. With LifeLines DEEP, we are performing a pilot study of additional deep molecular measurements in 1,500 individuals using biomaterials from different domains that were all collected contemporaneously from fasting individuals. Although the LifeLines DEEP cohort is relatively small, it will allow for proof-of-concept studies into systems epidemiology. LifeLines DEEP is unique in having exhaled air measurements from all individuals and a level of information that, to our knowledge, is rarely present in other population-based cohorts. Additionally, both the collection of cells for telomere length measurements and further functional studies and the fecal sample collection are unique. LifeLines DEEP will not only contribute to a better understanding of the association between genetic variation and molecular function, but can also be integrated with other population cohorts that have similar molecular data. In particular, the collection and analysis of fecal material is crucial given increasing evidence that the gut microbiome can play an important role in health and disease [37]. Nevertheless, harmonization and linking of data across multiple cohorts might be needed to achieve critical numbers. The Biobanking and Biomolecular Research Infrastructure in the Netherlands (BBMRI-NL) [38] and Europe [39] will allow for such studies.

LifeLines DEEP is also, by itself, a unique data source. For 1500 individuals, we will be able to construct genetic risk profiles for predisposition to many common diseases based on genome-wide association data. Next, we will be able to link these risk profiles to phenotype information, as well as clinical, immunological and other parameters. Using this information, we may already be able to compare high-risk individuals with and without disease complaints to generate hypotheses on resilient individuals. At the same time, LifeLines DEEP also allows for integration across different data levels to study, for example, the association between molecular and phenotypic data to increase our understanding of pathogenic mechanisms[40].

One area of particular interest is the domain related to gastrointestinal health. We therefore studied the prevalence of IBS in LifeLines DEEP. We identified IBS in 21% of the participants. These data should be interpreted with caution as our diagnosis is based solely on a participant-administered Rome III criteria questionnaire, and results therefore may be slightly inflated. Nevertheless, our result is consistent with previous suggestions that almost one quarter of the population encounters irritable bowel symptoms over the course of their lifetimes.[41] The prevalence in our cohort confirms that IBS is a common disease and thus research aimed on improved diagnosis and treatment will serve the society. Gastrointestinal complaints are multifactorial making large cohorts necessary to study them in more detail. For IBS in particular there is an urgent need to develop biomarkers that are predictive of the disease. We have selected six gastrointestinal-health-related biomarkers that are currently being analyzed. In a next phase, we will combine our results with other cohorts that study gut health and IBS including the Maastricht IBS cohort (currently including 400 cases and 200 healthy controls, recruitment is ongoing) from the Netherlands, and the TwinGene (n=11,000) and LifeGene (n=30,000, recruitment is ongoing) cohorts from Sweden.

Despite the high prevalence of IBS, the quality of life in our study population in general was higher compared to a random selection of the Dutch population as reported by Aaronson et al.[42] This might be due to age and gender differences, since the National sample included 56% men with a mean(sd) age of 47.6(18.0) years[42], compared to 41% men with a mean(sd) age of 44.6(13.8) years in the LifeLines DEEP cohort. Secular changes may also play a role, since the national survey was conducted more than 15 years ago.

In conclusion, we have established a cohort whose multiple data layers allow for integrative analysis of populations, for translation of this information into biomarkers for disease and for providing new insights into disease mechanisms and prevention.

ETHICAL STATEMENTS

The LifeLines DEEP study has been approved by the ethical committee of the University Medical Centre Groningen, document no. METC UMCG LLDEEP: M12.113965. All participants signed their informed consent prior to study enrolment.

CONFLICT OF INTEREST

The authors declare that they have no conflict of interest

ACKNOWLEDGEMENTS

We would like to thank the LifeLines participants and the personnel from the LifeLines study site Groningen for the collaboration. In addition, we would like to thank the LifeLines DEEP research assistants, Wilma Westerhuis-van der Tuuk, Marc Jan Bonder, Astrid Maatman, Mathieu Platteel, Kim de Lange and Debbie van Dussen for their practical and analytical work. We kindly acknowledge Jackie Senior and Kate Mc Intyre for editing our manuscript. Furthermore, we would like to thank The Target project (http://www.rug.nl/target) for providing the compute infrastructure and the BigGrid/eBioGrid project (http://www.ebiogrid.nl) for sponsoring the imputation pipeline implementation.

This project was funded by Top Institute Food and Nutrition Wageningen GH001 to CW, the Biobanking and Biomolecular Research Infrastructure Netherlands (BBMRI-NL) RP3 to LF and an ERC advanced grant ERC-671274 to CW. Sasha Zhernakova holds a Rosalind Franklin fellowship (University of Groningen).

Footnotes

  • e.f.tigchelaar{at}umcg.nl Telephone: 0031 652 724 618

REFERENCES

  1. 1.↵
    Welter D, Macarthur J, Morales J, Burdett T, Hall P, Junkins H, et al. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res. 2014;42:D1001–6.
    OpenUrlCrossRefPubMedWeb of Science
  2. 2.↵
    Kumar V, Wijmenga C, Withoff S. From genome-wide association studies to disease mechanisms: celiac disease as a model for autoimmune diseases. Semin. Immunopathol. 2012;34:567–80.
    OpenUrlCrossRefPubMed
  3. 3.↵
    Romanos J, Rosén A, Kumar V, Trynka G, Franke L, Szperl A, et al. Improving coeliac disease risk prediction by testing non-HLA variants additional to HLA variants. Gut. 2014;63:415–22.
    OpenUrlAbstract/FREE Full Text
  4. 4.↵
    Manolio TA, Bailey-Wilson JE, Collins FS. Genes, environment and the value of prospective cohort studies. Nat. Rev. National Human Genome Research Institute, 31 Center Drive, Room 4B09, Bethesda, Maryland 20892-2154, USA. manolio{at}nih.gov; 2006;7:812–20.
    OpenUrl
  5. 5.↵
    Stolk RP, Rosmalen JGM, Postma DS, de Boer RA, Navis G, Slaets JPJ, et al. Universal risk factors for multifactorial diseases: LifeLines: a three-generation population-based study. Eur. J. Epidemiol. Department of Epidemiology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands. R.P.Stolk{at}epi.umcg.nl; 2008;23:67– 74.
    OpenUrlCrossRefPubMedWeb of Science
  6. 6.↵
    El-Serag HB, Olden K, Bjorkman D. Health-related quality of life among persons with irritable bowel syndrome: a systematic review. Aliment. Pharmacol. Ther. Sections of Gastroenterology & Health Services Research, Houston Veterans Affairs Medical Center (152), 2002 Holcombe Boulevard, Houston, TX 77030, USA. hasheme{at}bcm.tmc.edu; 2002;16:1171–85.
    OpenUrl
  7. 7.
    Gralnek IM, Hays RD, Kilbourne A, Naliboff B, Mayer EA. The impact of irritable bowel syndrome on health-related quality of life. Gastroenterology. Greater Los Angeles VA Health Care System, Los Angeles, California 90073, USA. igralnek{at}mednet.ucla-edu; 2000;119:654–60.
    OpenUrlCrossRefPubMedWeb of Science
  8. 8.↵
    Ten Berg MJ, Goettsch WG, van den Boom G, Smout AJ, Herings RM. Quality of life of patients with irritable bowel syndrome is low compared to others with chronic diseases. Eur. J. Gastroenterol. Hepatol. PHARMO Institute for Drug Outcomes Research, 3508 AE Utrecht, The Netherlands.; 2006;18:475–81.
    OpenUrlCrossRefPubMedWeb of Science
  9. 9.↵
    Longstreth GF, Thompson WG, Chey WD, Houghton LA, Mearin F, Spiller RC. Functional bowel disorders. Gastroenterology. Kaiser Permanente Medical Care Program, San Diego, California 92120, USA. George.F.Longstreth{at}kp.org; 2006;130:1480–91.
    OpenUrlCrossRefPubMedWeb of Science
  10. 10.↵
    Drossman D a, Camilleri M, Mayer E a, Whitehead WE. AGA technical review on irritable bowel syndrome. Gastroenterology. 2002;123:2108–31.
    OpenUrlCrossRefPubMedWeb of Science
  11. 11.↵
    Harris RA, Nagy-Szakal D, Pedersen N, Opekun A, Bronsky J, Munkholm P, et al. Genome-wide peripheral blood leukocyte DNA methylation microarrays identified a single association with inflammatory bowel diseases. Inflamm. Bowel Dis. 2012;18:2334–41.
    OpenUrlCrossRefPubMed
  12. 12.↵
    Boots AW, van Berkel JJ, Dallinga JW, Smolinska A, Wouters EF, van Schooten FJ. The versatile use of exhaled volatile organic compounds in human health and disease. J. Breath Res. Department of Toxicology, Maastricht University, The Netherlands.; 2012;6:27108.
    OpenUrl
  13. 13.↵
    Baranska A, Tigchelaar E, Smolinska A, Dallinga JW, Moonen EJ, Dekens JA, et al. Profile of volatile organic compounds in exhaled breath changes as a result of gluten-free diet. J. Breath Res. Top Institute Food and Nutrition, Wageningen, the Netherlands.; 2013;7:037104–7155/7/3/037104. Epub 2013 Jun 18.
  14. 14.↵
    Kettunen J, Tukiainen T, Sarin A-P, Ortega-Alonso A, Tikkanen E, Lyytikäinen L-P, et al. Genome-wide association study identifies multiple loci influencing human serum metabolite levels. Nat. Genet. 2012;44:269–76.
    OpenUrlCrossRefPubMed
  15. 15.↵
    Dolmans GH, Werker PM, Hennies HC, Furniss D, Festen EA, Franke L, et al. Wnt signaling and Dupuytren's disease. N. Engl. J. Med. 2011;365:307–17.
    OpenUrlCrossRefPubMedWeb of Science
  16. 16.↵
    Trynka G, Hunt KA, Bockett NA, Romanos J, Mistry V, Szperl A, et al. Dense genotyping identifies and localizes multiple common and rare variant association signals in celiac disease. Nat. Genet. Genetics Department, University Medical Center and University of Groningen, The Netherlands.; 2011;43:1193–201.
    OpenUrl
  17. 17.↵
    Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007;81:559–75.
    OpenUrlCrossRefPubMed
  18. 18.↵
    Boomsma DI, Wijmenga C, Slagboom EP, Swertz MA, Karssen LC, Abdellaoui A, et al. The Genome of the Netherlands: design, and project goals. Eur. J. Hum. Genet. 2014;22:221–7.
    OpenUrlCrossRefPubMed
  19. 19.
    Deelen P, Menelaou A, van Leeuwen EM, Kanterakis A, van Dijk F, Medina-Gomez C, et al. Improved imputation quality of low-frequency and rare variants in European samples using the “Genome of The Netherlands.” Eur. J. Hum. Genet. 2014;
  20. 20.↵
    Francioli LC, Menelaou A, Pulit SL, van Dijk F, Palamara PF, Elbers CC, et al. Whole-genome sequence variation, population structure and demographic history of the Dutch population. Nat. Genet. Nature Publishing Group, a division of Macmillan Publishers Limited. All Rights Reserved.; 2014;advance on.
  21. 21.↵
    Delaneau O, Zagury J-F, Marchini J. Improved whole-chromosome phasing for disease and population genetic studies. Nat. Methods. 2013;10:5–6.
    OpenUrlCrossRefPubMedWeb of Science
  22. 22.↵
    Howie B, Marchini J, Stephens M. Genotype imputation with thousands of genomes. G3 (Bethesda). 2011;1:457–70.
    OpenUrlPubMed
  23. 23.↵
    Byelas H, Dijkstra M, Neerincx P, van Dijk F, Kanterakis A, Deelen P, et al. Scaling bio-analyses from computational clusters to grids. IWSG. 2013;
  24. 24.↵
    Jia X, Han B, Onengut-Gumuscu S, Chen W-M, Concannon PJ, Rich SS, et al. Imputing amino acid polymorphisms in human leukocyte antigens. PLoS One. 2013;8:e64683.
    OpenUrlCrossRefPubMed
  25. 25.↵
    Baerlocher GM, Vulto I, de Jong G, Lansdorp PM. Flow cytometry and FISH to measure the average length of telomeres (flow FISH). Nat. Protoc. 2006;1:2365–76.
    OpenUrlCrossRefPubMedWeb of Science
  26. 26.↵
    Gevers D, Kugathasan S, Denson LA, Vázquez-Baeza Y, Van Treuren W, Ren B, et al. The Treatment-Naive Microbiome in New-Onset Crohn's Disease. Cell Host Microbe. 2014;15:382–92.
    OpenUrlCrossRefPubMedWeb of Science
  27. 27.↵
    Caporaso JG, Kuczynski J, Stombaugh J, Bittinger K, Bushman FD, Costello EK, et al. QIIME allows analysis of high-throughput community sequencing data. Nat. Methods. 2010;7:335–6.
    OpenUrlCrossRefPubMedWeb of Science
  28. 28.↵
    DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, et al. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl. Environ. Microbiol. 2006;72:5069–72.
    OpenUrlAbstract/FREE Full Text
  29. 29.↵
    O'Donnell LJ, Virjee J, Heaton KW. Detection of pseudodiarrhoea by simple clinical assessment of intestinal transit rate. BMJ. Department of Medicine and Radiology, Bristol Royal Infirmary.; 1990;300:439–40.
    OpenUrl
  30. 30.↵
    D'Agostino RB S, Vasan RS, Pencina MJ, Wolf PA, Cobain M, Massaro JM, et al. General cardiovascular risk profile for use in primary care: the Framingham Heart Study. Circulation. Boston University, Department of Mathematics and Statistics, 111 Cummington St, Boston, MA 02215, USA.; 2008;117:743–53.
    OpenUrlAbstract/FREE Full Text
  31. 31.↵
    Ware JE Jr., Kosinski M, Bayliss MS, McHorney CA, Rogers WH, Raczek A. Comparison of methods for the scoring and statistical analysis of SF-36 health profile and summary measures: summary of results from the Medical Outcomes Study. Med. Care. Health Institute, New England Medical Center, Boston, MA, USA.; 1995;33:AS264–79.
    OpenUrlPubMedWeb of Science
  32. 32.↵
    Ware JE, Kosinski M, Keller S. SF-36 Physical and Mental Summary Scales: A User's Manual. Boston; 1994.
  33. 33.↵
    Ware JE, Kosinski M. SF-36 Physical and Mental Health Summary Scales: A Manual for Users of Version 1. Second. Lincoln, RI: QualityMetric Incorporated; 2001.
  34. 34.↵
    Ware JE, Gandek B, Kosinski M, Aaronson NK, Apolone G, Brazier J, et al. The Equivalence of SF-36 Summary Health Scores Estimated Using Standard and Country-Specific Algorithms in 10 Countries. J. Clin. Epidemiol. 1998;51:1167–70.
    OpenUrlCrossRefPubMedWeb of Science
  35. 35.↵
    Siebelink E, Geelen A, de Vries JHM. Self-reported energy intake by FFQ compared with actual energy intake to maintain body weight in 516 adults. Br. J. Nutr. Division of Human Nutrition, Wageningen University, PO Box 8129, 6700 EV Wageningen, The Netherlands.: Cambridge University Press; 2011;106:274–81.
    OpenUrlCrossRefPubMed
  36. 36.↵
    Haring R, Wallaschofski H. Diving through the “-omics”: the case for deep phenotyping and systems epidemiology. OMICS. 2012;16:231–4.
    OpenUrlCrossRefPubMedWeb of Science
  37. 37.↵
    Tremaroli V, Bäckhed F. Functional interactions between the gut microbiota and host metabolism. Nature. 2012;489:242–9.
    OpenUrlCrossRefPubMedWeb of Science
  38. 38.↵
    Brandsma M, Baas F, Bakker PIW de, Beem EP, Boomsma DI, Bovenberg J, et al. How to kickstart a national biobanking infrastructure – experiences and prospects of BBMRI-NL. Nor. Epidemiol. 2012.
  39. 39.↵
    Wichmann H-E, Kuhn KA, Waldenberger M, Schmelcher D, Schuffenhauer S, Meitinger T, et al. Comprehensive catalog of European biobanks. Nat. Biotechnol. 2011;29:795–7.
    OpenUrlCrossRefPubMed
  40. 40.↵
    Dumas M-E, Kinross J, Nicholson JK. Metabolic phenotyping and systems biology approaches to understanding metabolic syndrome and fatty liver disease. Gastroenterology. 2014;146:46–62.
    OpenUrlCrossRefPubMed
  41. 41.↵
    Jones R, Lydeard S. Irritable bowel syndrome in the general population. BMJ. 1992;304:87–90.
    OpenUrlAbstract/FREE Full Text
  42. 42.↵
    Aaronson NK, Muller M, Cohen PD, Essink-Bot ML, Fekkes M, Sanderman R, et al. Translation, validation, and norming of the Dutch language version of the SF-36 Health Survey in community and chronic disease populations. J. Clin. Epidemiol. Division of Psychosocial Research and Epidemiology, The Netherlands Cancer Institute, Amsterdam. naaron{at}nki.nl; 1998;51:1055–68.
    OpenUrlCrossRefPubMedWeb of Science
Back to top
PreviousNext
Posted September 16, 2014.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
An introduction to LifeLines DEEP: study design and baseline characteristics
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
An introduction to LifeLines DEEP: study design and baseline characteristics
Ettje F. Tigchelaar, Alexandra Zhernakova, Jackie A.M. Dekens, Gerben Hermes, Agnieszka Baranska, Zlatan Mujagic, Morris A. Swertz, Angélica M. Muñoz, Patrick Deelen, Maria C. Cénit, Lude Franke, Salome Scholtens, Ronald P. Stolk, Cisca Wijmenga, Edith J.M. Feskens
bioRxiv 009217; doi: https://doi.org/10.1101/009217
Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation Tools
An introduction to LifeLines DEEP: study design and baseline characteristics
Ettje F. Tigchelaar, Alexandra Zhernakova, Jackie A.M. Dekens, Gerben Hermes, Agnieszka Baranska, Zlatan Mujagic, Morris A. Swertz, Angélica M. Muñoz, Patrick Deelen, Maria C. Cénit, Lude Franke, Salome Scholtens, Ronald P. Stolk, Cisca Wijmenga, Edith J.M. Feskens
bioRxiv 009217; doi: https://doi.org/10.1101/009217

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (2543)
  • Biochemistry (4994)
  • Bioengineering (3497)
  • Bioinformatics (15279)
  • Biophysics (6926)
  • Cancer Biology (5427)
  • Cell Biology (7771)
  • Clinical Trials (138)
  • Developmental Biology (4558)
  • Ecology (7180)
  • Epidemiology (2059)
  • Evolutionary Biology (10261)
  • Genetics (7532)
  • Genomics (9826)
  • Immunology (4899)
  • Microbiology (13304)
  • Molecular Biology (5165)
  • Neuroscience (29568)
  • Paleontology (203)
  • Pathology (842)
  • Pharmacology and Toxicology (1470)
  • Physiology (2153)
  • Plant Biology (4780)
  • Scientific Communication and Education (1015)
  • Synthetic Biology (1343)
  • Systems Biology (4022)
  • Zoology (771)