Chromatin Landscapes and Genetic Risk For Juvenile Idiopathic Arthritis

Lisha Zhu; Kaiyu Jiang; Karstin Webber; Laiping Wong; Tao Liu; Yanmin Chen; James N. Jarvis

doi:10.1101/075721

Abstract

Juvenile idiopathic arthritis (JIA) is considered to be an autoimmune disease mediated by interactions between genes and the environment. To gain a better understanding of the cellular basis for genetic risk, we studied known JIA genetic risk loci, the majority of which are located in non-coding regions, in human neutrophils and CD4 primary T cells to identify genes and functional elements located within those risk loci. We analyzed RNA-Seq data, H3K27ac and H3K4me1 chromatin immunoprecipitation-sequencing (ChIP-Seq) data, and previously published chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) data to characterize the chromatin landscapes within the know JIA-associated risk loci. In both neutrophils and primary CD4+ T cells, the majority of the JIA-associated LD blocks contained H3K27ac and/or H3K4me1 marks. These LD blocks were also binding sites for a small group of transcription factors, particularly in neutrophils. Furthermore, these regions showed abundant intronic and intergenic transcription in neutrophils. In neutrophils, none of the genes that were differentially expressed between untreated JIA patients and healthy children was located within the JIA risk LD blocks. In CD4+ T cells, multiple genes, including HLA-DQA1, HLA-DQB2, TRAF1, and IRF1 were associated with the long-distance interacting regions within the LD regions as determined from ChIA-PET data. These findings suggest that aberrant transcriptional control is the underlying pathogenic mechanism in JIA. Furthermore, these findings demonstrate the challenges of identifying the actual causal variants within complex genomic/chromatin landscapes.

Introduction

In recent years, there has been considerable progress in our ability to identify genetic risk in juvenile idiopathic arthritis (JIA) [1, 2]. Furthermore, we are beginning to understand the contribution that genetic variance makes to the heterogeneity of clinical response to first-line agents such as methotrextate [3]. However, taking genetic risk information and integrating it into a coherent, testable hypothesis of disease pathogenesis has remained challenging. A recent genetic fine mapping study by Hinks et al [4] illustrates that challenge. While these authors were able to identify multiple previously unknown regions of genetic risk for JIA using the Illumina Immunochip, the majority of the risk-associated variants were located within non-coding regions of the genome. In this regard, JIA resembles almost every other complex trait that has been studied using genome-wide association studies (GWAS) [5, 6], including immunologic/autoimmune diseases [7]. Thus, although it’s common to identify genetic risk loci in terms of the closest protein-coding gene, we are learning that the genome is far more complex than previously imagined, and that the presence of a risk-conferring single nucleotide polymorphism (SNP) near a specific gene is not prima facie evidence that the causal variants have anything to do with that particular gene [8].

We have previously demonstrated how the field might begin to make sense of the wealth of genetic data that is being generated via GWAS and fine mapping studies [9]. We recently demonstrated that the majority of the disease-associated SNPs identified on a genetic fine mapping study by Hinks et al that are situated within the non-coding genome are situated within linkage disequilibrium (LD) blocks that are enriched for H3K4me1/H3K27ac histone marks, epigenetic signatures associated with enhancer function, in both neutrophils and CD4+ T cells. Several of these same LD blocks contain non-coding RNAs that were identified on RNA sequencing (RNA-Seq) and verified by reverse transcriptase polymerase chain reaction (rtPCR) [9].

Our earlier paper focused entirely on the novel risk regions identified in the Hinks study [4]. In the current study, we examined additional regions of genetic risk as recently reviewed by Hersh et al [1] and Herlin et al [2]. We demonstrate how understanding the transcriptome and functional, non-coding genome of allows us to better understand the nature of genetic risk in JIA. In the current paper, we examine the chromatin landscapes in both CD4+ T cells and neutrophils. The former are widely accepted as important mediators of the pathobiology of JIA [10], and the latter have become the subject of increasing interest from the standpoint of the role(s) in childhood onset rheumatic diseases [11]. We have previously shown, for example that neutrophils from children with JIA display specific aberrations in gene expression that are linked to perturbations in fundamental metabolic processes [12]. We used publically available genomic data as well as our own RNA sequencing data to gain mechanistic insights into JIA disease processes from genetic risk data.

Materials and Methods

Patients and Patient Samples–Neutrophils for RNA sequencing (RNA-Seq) were obtained from 3 children (2 girls and 1 boys) with newly diagnosed, active, untreated JIA (ADU group) and 2 healthy control children (HC group) recruited from the University of Oklahoma General Pediatrics clinic. Patients with JIA all had the polyarticular, rheumatoid factor negative phenotype as defined by the International League Against Rheumatism [13]. In addition, we studied pediatric-specific transcriptomes in CD4+ T cells from 5 healthy children. For the CD4+ T cell sample collection, children were excluded if they were taking antibiotics or steroids, had fever within the previous 36 hr, had an underlying autoimmune (e.g., type 1 diabetes) or inflammatory disease (e.g., asthma), or had a body mass index of > 30.

Whole blood was drawn into 10 mL CPT tubes (Becton Dickinson, Franklin Lakes, NJ), which is an evacuated blood collection tube system containing sodium citrate anticoagulant and blood separation media composed of a thixotropic polyester gel and a FICOLL^TM Hypaque^TM solution. Cell separation procedures were started within one hour from the time the specimens were drawn. Neutrophils were separated by density-gradient centrifugation at 1,700x g for 20 minutes. After removing red cells from neutrophils by hypotonic lysis, neutrophils were then immediately placed in TRIzol^® reagent (Invitrogen, Carlsbad, CA) and stored at −80°C until used for RNA isolation. Cells prepared in this fashion are more than 98% CD66b+ by flow cytometry and contain no contaminating CD14+ cells, as previously reported [14]. Thus, although these cell preparations contained small numbers of other granulocytes, they will be referred to here as “neutrophils” for brevity and convenience.

CD4+ T cells were obtained using the same collection techniques. CD4+ T cells were isolated from PBMC using a negative selection strategy.

All human subjects research was reviewed and approved by the University of Oklahoma and the University at Buffalo Woman & Children’s Institutional Review Boards (IRB). All research was conducted in accordance with the IRB-approved protocols. Written informed consent was obtained from the parents of children (patients and controls) participating in this study.

RNA isolation and sequencing

Total RNA was extracted using Trizol^® reagent according to manufacturer's directions. RNA was further purified using RNeasy MiniElute Cleanup kit including a DNase digest according to the manufacturer's instructions (QIAGEN, Valencia, CA). RNA was quantified spectrophotometrically (Nanodrop, Thermo Scientific, Wilmington, DE) and assessed for quality by capillary gel electrophoresis (Agilent 2100 Bioanalyzer; Agilent Technologies, Inc., Palo Alto, CA). Paired-end cDNA libraries were prepared for each sample and sequenced using the Illumina TruSeq RNA Sample Preparation Kit by following the manufacture's recommended procedures and sequenced using the Illumina HiSeq 2500. Library construction and RNA sequencing were performed in the University at Buffalo Genomics and Sequencing Core Facility.

Analysis of RNA-Seq Data

The raw reads obtained from paired-end RNA-Seq were mapped to human genome hg19, downloaded from the University of California Santa Cruz Genome Bioinformatics Site (http://genome.ucsc.edu), with no more than 2 read mismatches using tophat v2.0.10 [15]. Gene expression level was calculated as FPKM (Fragments Per Kb per Million reads) with Cufflinks v2.2.1 [16], the annotation gtf file provided for genes is gencode v19 annotation gtf file from GENCODE (http://www.gencodegenes.org) [17]. We used Cuffdiff v2.2.1 [18] for pairwise comparisons of ADU and HC to identify differentially expressed genes in human neutrophils, with a false discovery rate of 5%.

Verification of non-coding RNA transcripts identified in neutrophils within the JIA-Associated LD blocks

Complementary DNA was synthesized from total RNA using an iScript cDNA Synthesis kit (Bio-Rad). RT-PCR was performed using a HotStar Master PCR kit (Qiagen) with a Veriti thermocycler (Life Technologies) and included a control with no reverse transcription to exclude the possibility of an artifactual result due to contaminating genomic DNA. The temperature profile consisted of an initial step of 95°C for 10 minutes, followed by 35 cycles of 95°C for 30 seconds, 60°C for 30 seconds and 72°C for 30 seconds, and then a final step 72°C for 10 minutes. PCR products were resolved in 1.5% agarose gel. The nucleotide sequences of the primers were as follows: for ncTNFa (chr6: 31,544,379-31,544,485), 5′-CCTAATTCTGGGTTTGGGTTTGGG-3′ (forward) and 5′-CCTACTTTCACCTCCATCCATCCT-3′ (reverse); for ncSTAT4 (chr2:191,915,516-191,915,617), 5′-GTCTTGTGCAACTTCTTCCTTTC-3′ (forward) and 5′-ACCCTGTGACTGTTTGAGATTAC-3′ (reverse).

Defining LD regions

SNPs used in this query are listed in Table 1 and were previously reviewed by Hersh et al [1], Herlin et al [2] and Hinks et al [19]. We used a SNP Annotation And Proxy search (SNAP) database (http://www.broadinstitute.org/mpg/snap) [20] to define LD blocks based on the location of each SNP. In brief, we used the settings as follows: SNP dataset: 1000 Genome pilot 1 and HapMap 3 (release 2), r² threshhold: 0.9, Population Panel: CEU, Distance limit: 500. We selected the smallest number as our start location and the largest number as our stop location for each defined LD block.

View this table:

Table 1:

Histone marks in the SNP LD blocks in neutrophils and CD4+T cells

ENCODE Transcription Factor Binding Site (TFBS) enrichment

ENCODE TFBS data were downloaded from UCSC Genome Browser ENCODE data (http://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeRegTfbsClustered/). Only the TFBS information annotated for blood cells was used. The whole genome was binned to 100bp bins and intersected with H3K4me1 or H3K27ac peak regions, which were used as background. Fisher’s exact test was applied to test the significant of the enrichment of TFBS of each transcription factor within H3K4me1 or H3K27ac peaks in LD blocks compared to peak regions in non-LD blocks across the whole genome. We set the cutoff of false discovery rate (FDR) to 0.05 for all the analyses.

Identification of differentially expressed neutrophil genes within LD regions

To determine whether any of the differentially expressed genes (DEGs) identified in the comparison of neutrophil transcriptomes of children with untreated JIA (ADU) and HC were located within the JIA-associated LD blocks, we used gencode v19 annotation and intersectBed to refine the gencode data to focus specifically on those LD blocks.

Identification of H3K4me1/H327ac histone marks within LD regions

We used H3K4me1/H3K27ac chromatin immunoprecipitation-sequencing (ChIP-Seq) data previously generated by our group from neutrophils of healthy adults [9] as our reference source for H3K4me1/H3K27ac genomic locations in neutrophils (GEO accession # GSE66896). The H3K4me1/H3K27ac ChIP-Seq data reported on the Roadmap Epigenomics collection were used as our reference source for H3K4me1/H3K27ac genomic locations in CD4+ T cells (GEO accession # 198927). The intersection of H3K4me1/H3K27ac peaks within LD regions was obtained using the bedtools intersect procedure [21] as described in our previous paper [9].

Identifying chromatin interactions in CD4+ T cells within JIA risk loci

Functional DNA elements such as enhancers do not always regulate the nearest gene. Indeed, in a recent study of chromatin conformation using HiC in T and B cell lines, Martin et al [8] showed that > 80% of long distance chromatin interactions occurred at distances of > 500 kb. We therefore queried an existing chromatin interaction analysis by paired-end tag sequencing (CHIA-PET) dataset for CD4+ T cells (GSE32677) [22] to determine chromatin interactions within the JIA-associated risk loci. This data set used H3K4me2 as a marker for active enhancers, and thus provides a means of estimating those genes/gene networks that are regulated by specific enhancers or groups of enhancers. The chromosome coordinates for the observed 6520 long-distance chromatin interactions were converted from hg18 to hg19 using UCSC Genome browser LiftOver tool (http://www.genome.ucsc.edu/cgi-bin/hgLiftOver). For these analyses, we queried all known JIA-associated risk regions.

Identifying molecular pathways of disease-associated SNPs

One means of using genetic risk data to provide mechanistic insights is to identify specific biological pathways encompassed by SNP-associated genes [23, 24]. However, there is a considerable breadth of opinion about what constitutes a “SNP-associated gene,” as the actual causal variants may be a considerable distance (in genomic terms) from the risk-associated SNP. We therefore used the method of Brodie et al [25] to identify molecular pathways encompassed by JIA-associated SNPs across broad genomic distances. We queried regions 200 kb upstream and downstream of the JIA-associated SNPs, as outlined in [25] to define genes and pathways associated with JIA risk SNPs.

Data availability

The authors state that all data necessary for confirming the conclusions presented in the article are represented fully within the article.

RESULTS

Results from RNA-Seq and location of differentially expressed genes within LD blocks

Our previous work had queried new SNPs recently identified on a genetic fine mapping study published by Hinks et al [4]. In the current study, we queried SNPs that were identified in reviews by Hersh et al [1] and Herlin et al [2] and those SNPs in Hinks et al [19] that have not been analyzed in previous work and have established associations with JIA (Table 1). We used these SNPs to generate linkage disequilibrium blocks (LD blocks) using SNAP [20] and used these defined regions to determine DEGs located within these regions. We included SNPs located within non-coding regions of the genome (introns or intergenic areas), i.e., the majority of the JIA-associated SNPs.

We identified 99 genes that showed differential expression between children with newly diagnosed, untreated JIA (ADU) and HC (FDR < 0.05). Predictably, GO analysis of these genes (compared to the background of neutrophil-expressed genes) showed enrichment for responses to viruses and as well as other immune responses. None of the differentially expressed genes was located within the LD blocks defined by the JIA-associated SNPs. This finding is consistent with what we have seen in whole blood microarray expression profiles [26].

Verification of non-coding transcripts within neutrophils in JIA-associated LD blocks

Review of RNA-Seq data using the University of California Santa Cruz (UCSC) Genome Browser suggested the presence of multiple non-coding RNA transcripts in neutrophils within the JIA-associated LD blocks. We used an rtPCR approach to confirm the presence of 2 of these transcripts, i.e., those within the LD blocks containing the rs3821236 and rs1800629 SNPs. Figure 1 shows detectable expression of these transcripts in neutrophils from a healthy child and a JIA patient. This finding corroborates the idea that the JIA-associated LD blocks contain functional, non-coding RNA transcripts.

Figure 1.

UCSC Genome Browser screen shot showing abundant intronic transcription in the TNF gene (A) and in the STAT4 gene (B) within introns. The black bars indicate the region amplified by PCR experiments, as described in the Methods section. c and d, Agarose gel images showing qualitative PCR amplification of intronic transcription (ncRNA) in the TNF gene (C) and in the STAT4 gene (D) in neutrophils. N = neutrophil RNA not subjected to reverse transcription; R = neutrophil RNA subjected to reverse transcription prior to amplification. Neutrophils were isolated from patients with juvenile idiopathic arthritis (JIA) and healthy children (HC). Absence of signal in PCR amplifications that were performed without prior reverse transcription indicates that the signal is not being produced by contaminating DNA.

Location of H3K4me1/H3K27ac marks within LD blocks

For these analyses, we examined only those LD blocks not previously examined in our earlier paper [9] as noted in the Methods section. We identified LD blocks for all of the 30 risk SNPs, which were located in a total of 23 LD blocks. For those 23 LD blocks, we examined whether there are functional elements located in neutrophils or CD4+ T cells within these risk-conferring regions.

Our first analyses were in neutrophils. Of the 23 risk loci, 16 LD blocks contained H3K4me1 marks, while 14 LD blocks contained H3K27ac marks; 13 LD blocks contained both H3K4me1 and H3K27ac marks (Table 1). In all, 17 out of the 23 LD blocks contained either H3K4me1 or H3K27ac marks. The H3K4me1 and H3K27ac marks are significantly enriched in these LD block regions compared with non-LD block regions as determined by Fisher’s exact test (p-value < 2.2e-16 for both H3K4me1 and H3K27ac marks). Representative screen shots from the UCSC genome browser within 2 of the LD regions of interest are shown in Figure 2. The LD block containing rs755622, for example (Figure 2A), includes the promoter for the macrophage inhibitory factor (MIF) gene, but this region is also rich in TF binding regions and contains H3K4me1/H3K27ac histone marks. Elevated MIF levels have been found in patients with all 3 major JIA subtypes [27], although these levels are most elevated in children with the oligoarticular and systemic forms of the disease. Similarly, the neutrophil chromatin landscape around rs2900180 (Figure 2B) shows considerable genomic complexity. There is an expressed gene within this region, i.e., the tumor necrosis factor receptor-associated factor 1 [TRFA1]); there has been strong interest in the tumor necrosis factor (TNF) signaling pathways since the efficacy of anti-TNF therapies was demonstrated in children with JIA [28]. At the same time, this region contains both H3K4me1/H3K27 marked enhancers and abundant TF binding not isolated to the TRAF promoter. Similarly, Figure S1 show multiple functional elements adjacent to potentially disease-relevant genes such as PTPN2 (S1A) and NFkB inhibitor-like protein 1 (NFkBIL1, S1B). Thus, these findings corroborate what we previously reported [9] regarding the genetic risk regions identified in the Hinks paper [4], i.e., the genetic risk largely encompasses regions of the genome rich in functional elements that serve to regulate and coordinate transcription on a genome-wide basis.

Figure 2. Representative screen shots from the UCSC Genome Browser showing functional elements within neutrophil genomes

The black horizontal bar at the top represents the LD blocks of the corresponding GWAS SNP. The black horizontal bars between individual tracks represent H3K27ac and H3K4me1 peak regions generated from ChIP-Seq data. The grey bars at the bottom of each image represent the transcription factor ChIP-seq of 161 factors from ENCODE with factorbook motifs. Panel (A) shows the LD block of rs755622 in neutrophils. Note there are expressed genes within this LD block, and the LD block contains H3K27ac and H3K4me1 regions with multiple potential TFBS within those regions. Panel (B) shows the LD block of rs2900180 in neutrophils. Note that there is one gene, the tumor necrosis factor receptor associated factor (TRAF1), which is expressed within this LD block. This LD block also contains both H3K27ac and H3K4me1 peak regions with multiple TFBS within those regions. Potential active enhancers (region containing both H3K27ac and H3K4me1 peak regions) are highlighted in red.

We next examined H3K4me1/H3K27ac marked regions in CD4+ T cells in those same 23 LD blocks. We queried whether there are functional elements located in CD4+ T cells within these risk-conferring regions. Of the 23 LD blocks, 20 contained H3K4me1 marks and 15 contained H3K27ac marks. All of the 15 LD blocks that contained H3K27ac marks also contained H3K4me1 marks (Table 1). Thus, 20 out of 23 of the LD blocks contained either H3K27ac or H3K4me1 marks in CD4+ T cells. The H3K27ac and H3K4me1 marks are significantly enriched in these LD block regions compared with non-LD block regions as determined by Fisher’s exact test (p-value < 2.2e-16 for both H3K27ac and H3K4me1 marks).

Transcription factor enrichment within the LD blocks

While H3K4me1/H3K27ac marks in and of themselves do not unambiguously identify functional enhancers[29], the appearance of transcription factor binding sites within H3K4me1/H3K27ac-marked regions is strong supportive evidence that these regions are functional [30]. Therefore, for the each of the 38 LD blocks containing the 51 JIA-associated risk SNPs, we queried whether there was significant enrichment for TF binding in the H3K27ac/H3K4me1 marked regions compared to non-LD block regions.

In these analyses, we included genetic risk information from the Hinks, Hersh, and Herlin papers. Thus, to determine whether there was additional evidence that the H3K4me1/H3K27ac marked regions within the JIA-associated LD blocks had functional significance, we queried whether there were significantly enriched TFBS within those loci compared to non-LD regions that have H3K4me1/H3K27ac marks.

For neutrophils, we observed highly enriched TF binding in promoter regions (defined as (-5K, 1K) of TSS) within these LD blocks. TFs that were particularly enriched included POLR2A, CHD1 and TAF1 (Figure 3A). We note that RUNX3 binding was also enriched within the H3K27ac-marked regions (Figure 3A), an interesting finding in that the RUNX3 locus itself is associated with JIA disease risk. This locus is characterized, in neutrophils, by dense intergenic TF binding, the presence of both H3K4me1 and H3K27ac peaks overlapping with the TF binding regions, and both intronic and intergenic RNA transcripts (not yet verified by rtPCR experiments; see Figure S3). These findings suggest complex relationships between enhancers and promoters, and perhaps non-coding RNA transcripts, leading to complex levels of gene regulation in neutrophils in health and as well as in disease states such as JIA.

Figure 3.

Enrichment of TFs in H3K4me1/H3K27ac mark regions. Heatmap of significantly enriched TFs with their TFBS in H3K27ac or H3K4me1 peak regions in (A) neutrophil cells and (B) CD4+T cells, considering promoter and distal regions separately. TFs are ranked according to FDR (the one lowest FDR with the highest rank) and normalized to (0,1) by dividing the absolute highest rank value for enrichment (red) and depletion (blue) separately. (normalized rank: > 0 enrichment; <0 depletion).

For CD4+ T cells, we observed a limited number of TFBS that were significantly enriched in the LD blocks conferring risk for JIA that co-localized with H3K27ac/H3K4me1 histone marks (Figure 3B). These regions were seen almost exclusively in promoters, defined as −5K − 1K of the transcription start site (TSS). These transcription factors included POLR2A and GTF2F1. The majority of the distal (non-promoter) regions within these LD blocks were relatively depleted of TFBS. Thus, the LD blocks with H3K27ac/H3K4me1 marks are not highly enriched with TFs. This finding contrasts with what we observed in neutrophils, as noted above, where there is significant enrichment of TF binding within H3K4me1/H3K27ac-marked regions.

Chromatin interactions in JIA risk loci

We used the 6520 long-distance chromatin interactions from a previously-reported ChIA-PET data analysis of CD4+ T cells [22] to define chromatin interactions in all the JIA-associated LD blocks. We identified 11 LD blocks that contained 48 long-range interactions. We note that all 6 of the interaction regions within the LD block containing the SNP, rs10818488, are identical with the regions within the LD block of rs2900180. This left 42 unique long-range interactions, 11 of which displayed more than one interaction within or intersected with the same LD blocks (S. Table 1). For these 42 long-range interactions, 20 were promoter-promoter (defined as (-5K, 1K) of TSS) interactions and 21 were promoter-distal (non-promoter) interactions, and one was a distal-distal interaction.

We next analyzed the CHIA-PET data in light of transcriptomes from CD4+ T cells derived from healthy children. We identified several genes within the CHIA-PET interacting regions, including ILB, DDX39B, RASSF5 and JAK1, that demonstrated high levels of expression (average FPKM > 100). We note that there are multiple small RNA molecules (snoRNA, miRNA, misc_RNA, snRNA) that are encoded within these regions, but the sequencing libraries were not prepared to detect their presence since the FPKM of those molecules is 0.

Among the 42 long-range interactions, 66 regions contained both H3K4me1 and H3K27ac marks, while 2 regions contained only H3K27ac marks and 11 contained only H3K4me1 marks. Only 5 of the interacting regions contained neither H3K4me1 nor H3K27ac marks. There were in all 61 genes associated with these long-range interacting regions. The genes associated with regions containing both H3K4me1 and H3K27ac marks have average higher FPKM than those genes that contained only H3K4me1 marks or those with no histone marks (Figure S2).

The CHIA-PET data point to complex layers of interaction and gene regulation within the JIA-associated risk loci. Note, for example, multiple points of interaction between the IRF1 and Corf56 genes, a locus that also encodes a Y-RNA species. These genes are physically adjacent to one-another on chromosome 5, as shown in Figure 4A. This region is dense with functional elements, including an H3K4me1/H3K27ac-marked enhancer, dense transcription factor binding, and multiple DNase1 hypersensitivity sites. It is interesting to note that our previous work from whole blood expression profiles of untreated JIA patients has suggested an important role for IRF1 networks in JIA pathogenesis [26]. Equally interesting is the detected physical interaction between the HLADQB1 and HLADQB2 loci. Association between JIA risk and class II HLA alleles has been long recognized, although the alleles at the DRB1 locus are thought to have stronger effects than those at the DQB1 or DQB2 loci [31]. The genome browser shot in Figure 4B demonstrates the complexity of this locus. Complex interactions between TRAF1 and multiple adjacent genes (C5, PHF19, and CNTRL) as well as TRAF1 intragenic interactions were also observed on the CHIA-PET data by Chepelev et al [22].

Figure 4.

Genome browser shot of the IRF1/C5orf56 locus in CD4 primary T cells (A) which is an epigenetically rich region showing dense transcription factor binding, multiple DNase1 hypersensity sites, and active enhancers with both H3K27ac and H3K4me1 marks. (B) TRAF1 locus with rich epigenetic architecture in this locus, including multiple TF binding sites, DNase1 hypersenitive sites, and active enhancers (H3K27ac and H3K4me1 marked regions).

Enriched molecular pathways of disease-associated SNPs

We used the method of Brodie et al [25] and the PANTHER database [32] to identify significantly enriched biological process and pathways for the 246 protein-coding genes and processed transcripts associated with the 51 JIA risk SNPs. Using a cutoff of Bonferroni <=0.05, the enriched PANTHER pathways are related to interleukin signaling and inflammation mediated by chemokine and cytokine signaling. The enriched biological processes involve antigen processing and presentation, cellular defense response as well as other immunologically relevant processes (S. Table 2).

Discussion

In this paper, we mined existing data sets in order to gain a better understanding of the nature of genetic risk in JIA. We find that most of the JIA-associated SNPs lie within the non-coding genome, in regions that are dense with TF binding sites, DNaseI hypersensitive sites, epigenetic signatures of enhancer function, and, in the case of neutrophils, non-coding RNA transcripts. These findings demonstrate the importance of considering the chromatin context around disease-associated SNPs, which may have no functional association at all with the “nearest gene.”

At the completion of the Human Genome Project, there was some surprise and even a little dismay at how little of the genome contains protein-encoding genes. When the entire project was done, only ~2% of the DNA was found to ENCODE proteins [33]. One of the transformative findings from the ENCODE and Roadmap Epigenomics projects has been the discovery of the richness of the non-coding genome. Enhancers, insulators, CTCF binding sites (which regulate chromatin conformation) etc are broadly distributed throughout the genome and demonstrate that the processes of regulating gene expression and coordinating expression on a genome-wide basis is extraordinarily complex [34]. The field of immunology is now making considerable progress in identifying these functional elements and how they regulate transcription and therefore immune responses [35].

The findings from ENCODE and Roadmap Epigenomics, and the work we describe here, demonstrate that identifying specific causal variants in JIA is going to be an ongoing challenge. A primary reason for this is the complexity of the risk loci. While many of these loci encompass genes that are of strong immunologic interest (e.g., MIF, TRAF1, PTNP2, NFkBIL1), these same loci are also rich in other functional elements that regulate transcription on a global basis. For example, Figure 2A is a simplified genome browser screen shot that shows the neutrophil chromatin landscape around rs755622.

While the actual SNP is within the promoter of the macrophage inhibitory factor gene, the entire LD block contains both H3K4me1 and H3K27ac histone marks (strong evidence for an active enhancer in this region) as well as rich TF binding that is not localized exclusively to the promoter. Similar genomic features can also be seen in this region in CD4+ T cells (data not shown). We also find find a large degree of complexity around rs2900180 (Figure 2B), and rs2847293/rs7234029, which are in the same LD block. The region containing rs2847293/rs7234029 is of broad general interest to the field of immunology, as this LD block includes the PTPN2 gene. While PTPN2 is generally considered a regulator of T cell responses, this protein tyrosine phosphatase is broadly expressed in innate immune cells, including in neutrophils, where it serves as a negative regulator of cytokine signaling [36]. However, the PTPN2 locus is also characterized by H3K4me1/H3K27ac-marked enhancers in neutrophils and CD4+ T cells, as well as an apparent non-coding RNA molecule in the intron between the first and second exons in neutrophils (Figure S1A). Thus, while it is tempting to focus on the protein-encoding genes within these regions as the location of the causal variants that lead to perturbed immune function, our data invite other interpretations. Enhancers, for example, may function at considerable distances (in genomic terms) from the genes they regulate, and often do not regulate the closest gene [37, 38]. This idea is shown in the recent work of Martin et al [8]. Using Hi-C chromatin conformation capture approaches, these authors demonstrated that > 80% of long-distance chromatin interactions within T and B cell lines occurred at distances > 500 MB. Thus, until there is more solid experimental evidence to indicate otherwise, we must be open to the possibility that it is the enhancer function of these regions and/or the function of the ncRNA that are perturbed by genetic variance rather than specific functions or regulation of the nearby proteins.

This idea is corroborated by our analysis of the published CD4+ T cells CHIA-PET data set. CHIA-PET, uses cross-linked DNA to identify physical interactions between regions of DNA, which may either be in close proximity or many megabases apart on a specific chromosome (or even regions that are on different chromosomes) [22, 39]. The dataset we queried used H3K4me2 as a general mark for active enhancers. We found that the JIA-associated genetic risk loci display abundant and complex physical interactions with (usually) nearby genomic elements. The region encoding the TRAF1 gene is emblematic of this complexity. We detected interactions between an H3K4me2-marked region adjacent to the TRAF1 gene and at least 3 other genes. These genes included C5, the 5^th component of the complement cascade, an important inflammatory mediator. The region also interacts with PHF19, which is itself a regulator of gene expression. PHF19 is a member of the polycomb repressive complex-2 (PRC-2) that is involved in gene silencing [40]. The significance of the interaction between the TRAF1 region and CNTRL (formerly for as CEP110), a centrosomic protein [41] is not immediately clear.

ENCODE also revealed that much of the genome is transcribed, and that many (if not most) of these non-coding RNA species are functional. In rheumatology, miRNA are the best-known of these non-coding RNA families, as numerous papers have shown links between specific miRNA molecules and specific immunologic features of the rheumatic diseases (e.g., [42]). Our own work has recently shown extensive re-wiring of mRNA-miRNA networks within JIA neutrophils [14]. However, the non-coding RNA world is broad, and includes RNA species that regulate chromatin accessibility [43], serve enhancer functions [44-46] and encode for small peptides [47]. Our current work also demonstrates the presence of at least two intronic, non-coding RNA molecules in the JIA-associated genetic risk loci. Such transcripts are abundant in neutrophils [48] and may regulate the decay of otherwise unstable mRNA transcripts [45]. The specific roles of these transcripts in neutrophil biology and/or the pathogenesis of JIA remain to be clarified. Their presence, however, serves to demonstrate that the localization of a disease-associated polymorphism within a specific gene does not necessarily mean that genetic risk is conferred by alterations in the protein-coding function of that specific gene.

This paper corroborates and expands upon a recent report from our laboratory demonstrating that regions of genetic risk in JIA generally do not encode for genes that show differential expression when children with the disease are compared with healthy children [26]. Using whole blood gene expression data from subjects of a National Institutes of Health-funded clinical trial, corroborated by an independent patient cohort, we reported that none of the >150 genes that showed differential expression between patients and healthy controls were encoded within the LD blocks associated with JIA risk. Thus, the mechanisms through which genetic variance leads to perturbed gene expression are likely to be more complex than was previously considered.

This paper adds to the mounting evidence that suggests that the field of pediatric rheumatology may require a fundamental re-assessment regarding theories of JIA pathogenesis. Long accepted as an “autoimmune disease” triggered by the recognition of a “self” peptide by the adaptive immune system, new evidence from the fields of genetics and genomics ought to prompt a reconsideration of this view. We propose, instead, that JIA emerges because of genetic and epigenetically-mediated alterations in leukocyte genomes. These alterations, we propose, involve both innate and adaptive immune systems and lead to expression of inflammatory mediators in the absence of the external signals normally required to initiate or sustain an inflammatory response. Our work in both whole blood buffy coats [49] and neutrophils [12] is consistent with that interpretation of the available gene expression and genetic data.

In conclusion, we demonstrate that the known genetic risk loci for JIA are enriched for multiple functional elements within the non-coding genome of human neutrophils. These functional elements include rich TF binding sites, H3K4me1 and H3K27ac-marked enhancers, and non-coding RNAs. We believe that these data as well newer understandings of genetic risk and genome function invite new understanding of JIA pathogenesis and may provide the basis for radically new approaches to therapy.

Footnotes

Lisha Zhu and Kaiyu Jiang contributed equally to this work.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

Lisha Zhu and Karstin Webber performed the primary bioinformatics analyses.

Kaiyu Jiang prepared the neutrophils and CD4+ T cells for RNA sequencing, directed and coordinated the RNA sequencing experiments, and performed rtPCR for non-coding transcripts.

Laiping Wong performed analyses to identify differentially expressed genes within the JIA-associated LD blocks.

Yanmin Chen assisted with cell isolation and RNA preparation for both neutrophils and CD4+ T cells.

Tal Liu oversaw and directed the bioinformatics analysis performed by Karstin Webber and Lisha Zhu.

James N. Jarvis designed the study, directed its implementation, as directed data analysis and interpretation.

Contributor Information

Lisha Zhu: lishazhu{at}buffalo.edu

Kaiyu Jiang: kaiyujia{at}buffalo.edu

Karstin Webber: karstinw{at}buffalo.edu

Laiping Wong: laipingw{at}buffalo.edu

Tao Liu: tliu4{at}buffalo.edu

Yanmin Chen: yanminch{at}buffalo.edu

James N. Jarvis: jamesjar{at}buffalo.edu

Acknowledgements

This work was supported by R01-AR060604 from the National institutes of Health (USA)

Footnotes

None of the authors receives compensation from any commercial entity for work relevant to the contents and findings in this manuscript

References

1.↵
Hersh AO, Prahalad S: Immunogenetics of juvenile idiopathic arthritis: A comprehensive review. Journal of autoimmunity 2015, 64:113–124.
OpenUrl CrossRef PubMed
2.↵
Herlin MP; Petersen MB; Herlin T: Update on genetic susceptibility and pathogenesis in juvenile idiopathic arthritis. EMJ Rheumatol 2014, 1:73–83.
OpenUrl
3.↵
Moncrieffe H, Hinks A, Ursu S, Kassoumeri L, Etheridge A, Hubank M, Martin P, Weiler T, Glass DN, Thompson SD et al. Generation of novel pharmacogenomic candidates in response to methotrexate in juvenile idiopathic arthritis: correlation between gene expression and genotype. Pharmacogenet Genomics 2010, 20(11):665–676.
OpenUrl CrossRef PubMed Web of Science
4.↵
Hinks A, Cobb J, Marion MC, Prahalad S, Sudman M, Bowes J, Martin P, Comeau ME, Sajuthi S, Andrews R et al. Dense genotyping of immune-related disease regions identifies 14 new susceptibility loci for juvenile idiopathic arthritis. Nat Genet 2013, 45(6):664–669.
OpenUrl CrossRef PubMed
5.↵
Maurano MT, Humbert R, Rynes E, Thurman RE, Haugen E, Wang H, Reynolds AP, Sandstrom R, Qu H, Brody J et al. Systematic localization of common disease-associated variation in regulatory DNA. Science (New York, NY) 2012, 337(6099):1190–1195.
OpenUrl Abstract/FREE Full Text
6.↵
Schaub MA, Boyle AP, Kundaje A, Batzoglou S, Snyder M: Linking disease associations with regulatory information in the human genome. Genome Res 2012, 22(9):1748–1759.
OpenUrl Abstract/FREE Full Text
7.↵
Farh KK, Marson A, Zhu J, Kleinewietfeld M, Housley WJ, Beik S, Shoresh N, Whitton H, Ryan RJ, Shishkin AA et al. Genetic and epigenetic fine mapping of causal autoimmune disease variants. Nature 2015, 518(7539):337–343.
OpenUrl CrossRef PubMed
8.↵
Martin P, McGovern A, Orozco G, Duffus K, Yarwood A, Schoenfelder S, Cooper NJ, Barton A, Wallace C, Fraser P et al. Capture Hi-C reveals novel candidate genes and complex long-range interactions with related autoimmune risk loci. Nature communications 2015, 6:10069.
OpenUrl
9.↵
Jiang K, Zhu L, Buck MJ, Chen Y, Carrier B, Liu T, Jarvis JN: Disease-Associated Single-Nucleotide Polymorphisms From Noncoding Regions in Juvenile Idiopathic Arthritis Are Located Within or Adjacent to Functional Genomic Elements of Human Neutrophils and CD4+ T Cells. Arthritis & rheumatology (Hoboken, NJ) 2015, 67(7):1966–1977.
OpenUrl
10.↵
Grom AA, Hirsch R: T-cell and T-cell receptor abnormalities in the immunopathogenesis of juvenile rheumatoid arthritis. Current opinion in rheumatology 2000, 12(5):420–424.
OpenUrl CrossRef PubMed Web of Science
11.↵
Huttenlocher A, Smith JA: Neutrophils in pediatric autoimmune disease. Curr Opin Rheumatol 2015, 27(5):500–504.
OpenUrl
12.↵
Jarvis JN, Petty HR, Tang Y, Frank MB, Tessier PA, Dozmorov I, Jiang K, Kindzelski A, Chen Y, Cadwell C et al. Evidence for chronic, peripheral activation of neutrophils in polyarticular juvenile rheumatoid arthritis. Arthritis research & therapy 2006, 8(5):R154.
OpenUrl
13.↵
Petty RE, Southwood TR, Manners P, Baum J, Glass DN, Goldenberg J, He X, Maldonado-Cocco J, Orozco-Alcala J, Prieur AM et al. International League of Associations for Rheumatology classification of juvenile idiopathic arthritis: second revision, Edmonton, 2001. J Rheumatol 2004, 31(2):390–392.
OpenUrl FREE Full Text
14.↵
Hu Z, Jiang K, Frank MB, Chen Y, Jarvis JN: Complexity and Specificity of the Neutrophil Transcriptomes in Juvenile Idiopathic Arthritis. Scientific reports 2016, 6:27453.
OpenUrl
15.↵
Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL: TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome biology 2013, 14(4):R36.
OpenUrl CrossRef PubMed
16.↵
Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, Salzberg SL, Wold BJ, Pachter L: Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nature biotechnology 2010, 28(5):511–515.
OpenUrl CrossRef PubMed Web of Science
17.↵
Harrow J, Frankish A, Gonzalez JM, Tapanari E, Diekhans M, Kokocinski F, Aken BL, Barrell D, Zadissa A, Searle S et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome research 2012, 22(9):1760–1774.
OpenUrl Abstract/FREE Full Text
18.↵
Trapnell C, Hendrickson DG, Sauvageau M, Goff L, Rinn JL, Pachter L: Differential analysis of gene regulation at transcript resol ution with RNA-seq. Nature biotechnology 2013, 31(1):46–53.
OpenUrl CrossRef PubMed
19.↵
Hinks A, Cobb J, Marion MC, Prahalad S, Sudman M, Bowes J, Martin P, Comeau ME, Sajuthi S, Andrews R et al. Dense genotyping of immune-related disease regions identifies 14 new susceptibility loci for juvenile idiopathic arthritis. Nature genetics 2013, 45(6):664–669.
OpenUrl CrossRef PubMed
20.↵
Johnson AD, Handsaker RE, Pulit SL, Nizzari MM, O'Donnell CJ, de Bakker PI: SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap. Bioinformatics 2008, 24(24):2938–2939.
OpenUrl CrossRef PubMed Web of Science
21.↵
Quinlan AR, Hall IM: BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 2010, 26(6):841–842.
OpenUrl CrossRef PubMed Web of Science
22.↵
Chepelev I, Wei G, Wangsa D, Tang Q, Zhao K: Characterization of genome-wide enhancer-promoter interactions reveals co-expression of interacting genes and modes of higher order chromatin organization. Cell research 2012, 22(3):490–503.
OpenUrl CrossRef PubMed Web of Science
23.↵
Wang K, Li M, Hakonarson H: Analysing biological pathways in genome-wide association studies. Nature reviews Genetics 2010, 11(12):843–854.
OpenUrl CrossRef PubMed Web of Science
24.↵
Brodie A, Tovia-Brodie O, Ofran Y: Large scale analysis of phenotype-pathway relationships based on GWAS results. PloS one 2014, 9(7):e100887.
OpenUrl CrossRef PubMed
25.↵
Brodie A, Azaria JR, Ofran Y: How far from the SNP may the causative genes be? Nucleic acids research 2016, 44(13):6046–6054.
OpenUrl CrossRef PubMed
26.↵
Jiang K, Wong L, Sawle AD, Frank MB, Chen Y, Wallace CA, Jarvis JN: Whole blood expression profiling from the TREAT trial: insights for the pathogenesis of polyarticular juvenile idiopathic arthritis. Arthritis research & therapy 2016, 18(1):157.
OpenUrl CrossRef PubMed
27.↵
Meazza C, Travaglino P, Pignatti P, Magni-Manzoni S, Ravelli A, Martini A, De Benedetti F: Macrophage migration inhibitory factor in patients with juvenile idiopathic arthritis. Arthritis and rheumatism 2002, 46(1):232–237.
OpenUrl CrossRef PubMed Web of Science
28.↵
Lovell DJ, Giannini EH, Reiff A, Cawkwell GD, Silverman ED, Nocton JJ, Stein LD, Gedalia A, Ilowite NT, Wallace CA et al. Etanercept in childr en with polyarticular juvenile rheumatoid arthritis. Pediatric Rheumatology Collaborative Study Group. The New England journal of medicine 2000, 342(11):763–769.
OpenUrl CrossRef PubMed Web of Science
29.↵
Creyghton MP, Cheng AW, Welstead GG, Kooistra T, Carey BW, Steine EJ, Hanna J, Lodato MA, Frampton GM, Sharp PA et al. Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proceedings of the National Academy of Sciences of the United States of America 2010, 107(50):21931–21936.
OpenUrl Abstract/FREE Full Text
30.↵
Alexander RP, Fang G, Rozowsky J, Snyder M, Gerstein MB: Annotating non-coding regions of the genome. Nature reviews Genetics 2010, 11(8):559–571.
OpenUrl CrossRef PubMed Web of Science
31.↵
Hollenbach JA, Thompson SD, Bugawan TL, Ryan M, Sudman M, Marion M, Langefeld CD, Thomson G, Erlich HA, Glass DN: Juvenile idiopathic arthritis and HLA class I and class II interactions and age-at-onset effects. Arthritis and rheumatism 2010, 62(6):1781–1791.
OpenUrl CrossRef PubMed Web of Science
32.↵
Mi H, Muruganujan A, Casagrande JT, Thomas PD: Large-scale gene function analysis with the PANTHER classification system. Nature protocols 2013, 8(8):1551–1566.
OpenUrl
33.↵
An integrated encyclopedia of DNA elements in the human genome. Nature 2012, 489(7414):57–74.
OpenUrl CrossRef PubMed Web of Science
34.↵
Gandin V, Sikstrom K, Alain T, Morita M, McLaughlan S, Larsson O, Topisirovic I: Polysome fractionation and analysis of mammalian translatomes on a genome-wide scale. Journal of visualized experiments : JoVE 2014(87).
35.↵
Winter DR, Jung S, Amit I: Making the case for chromatin profiling: a new tool to investigate the immune-regulatory landscape. Nature reviews Immunology 2015, 15(9):585–594.
OpenUrl
36.↵
Spalinger MR, McCole DF, Rogler G, Scharl M: Role of protein tyrosine phosphatases in regulating the immune system: implications for chronic intestinal inflammation. Inflammatory bowel diseases 2015, 21(3):645–655.
OpenUrl
37.↵
Shlyueva D, Stampfel G, Stark A: Transcriptional enhancers: from properties to genome-wide predictions. Nature reviews Genetics 2014, 15(4):272–286.
OpenUrl CrossRef PubMed
38.↵
Levine M, Cattoglio C, Tjian R: Looping back to leap forward: transcription enters a new era. Cell 2014, 157(1):13–25.
OpenUrl CrossRef PubMed Web of Science
39.↵
Li G, Cai L, Chang H, Hong P, Zhou Q, Kulakova EV, Kolchanov NA, Ruan Y: Chromatin Interaction Analysis with Paired-End Tag (ChIA-PET) sequencing technology and application. BMC genomics 2014, 15 Suppl 12:S11.
40.↵
Brien GL, Gambero G, O’Connell DJ, Jerman E, Turner SA, Egan CM, Dunne EJ, Jurgens MC, Wynne K, Piao L et al. Polycomb PHF19 binds H3K36me3 and recruits PRC2 and demethylase NO66 to embryonic stem cell genes during differentiation. Nature structural & molecular biology 2012, 19(12):1273–1281.
OpenUrl
41.↵
Ou YY, Mack GJ, Zhang M, Rattner JB: CEP110 and ninein are located in a specific domain of the centrosome associated with centrosome maturation. Journal of cell science 2002, 115(Pt 9):1825–1835.
OpenUrl Abstract/FREE Full Text
42.↵
Liang D, Shen N: MicroRNA involvement in lupus: the beginning of a new tale. Current opinion in rheumatology 2012, 24(5):489–498.
OpenUrl CrossRef PubMed
43.↵
Li LC: Chromatin remodeling by the small RNA machinery in mammalian cells. Epigenetics 2014, 9(1):45–52.
OpenUrl
44.↵
Birnbaum RY, Clowney EJ, Agamy O, Kim MJ, Zhao J, Yamanaka T, Pappalardo Z, Clarke SL, Wenger AM, Nguyen L et al. Coding exons function as tissue-specific enhancers of nearby genes. Genome research 2012, 22(6):1059–1068.
OpenUrl Abstract/FREE Full Text
45.↵
Li W, Notani D, Rosenfeld MG: Enhancers as non-coding RNA transcription units: recent insights and future perspectives. Nature reviews Genetics 2016, 17(4):207–223.
OpenUrl CrossRef PubMed
46.↵
Ahituv N: Exonic enhancers: proceed with caution in exome and genome sequencing studies. Genome medicine 2016, 8(1):14.
OpenUrl
47.↵
Pauli A, Rinn JL, Schier AF: Non-coding RNAs as regulators of embryogenesis. Nature reviews Genetics 2011, 12(2):136–149.
OpenUrl CrossRef PubMed
48.↵
Wong JJ, Ritchie W, Ebner OA, Selbach M, Wong JW, Huang Y, Gao D, Pinello N, Gonzalez M, Baidya K et al. Orchestrated intron retention regulates normal granulocyte differentiation. Cell 2013, 154(3):583–595.
OpenUrl CrossRef PubMed Web of Science
49.↵
Dozmorov I, Knowlton N, Tang Y, Shields A, Pathipvanich P, Jarvis JN, Centola M: Hypervariable genes--experimental error or hidden dynamics. Nucleic acids research 2004, 32(19):e147.
OpenUrl CrossRef PubMed

View the discussion thread.

Posted September 17, 2016.

Download PDF

Citation Tools

Subject Area

Genetics

Subject Areas

All Articles

Animal Behavior and Cognition (5213)
Biochemistry (11744)
Bioengineering (8751)
Bioinformatics (29193)
Biophysics (14968)
Cancer Biology (12094)
Cell Biology (17411)
Clinical Trials (138)
Developmental Biology (9421)
Ecology (14178)
Epidemiology (2067)
Evolutionary Biology (18303)
Genetics (12244)
Genomics (16801)
Immunology (11866)
Microbiology (28082)
Molecular Biology (11592)
Neuroscience (60959)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4957)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2885)
Systems Biology (7339)
Zoology (1651)

[1] 1.↵
Hersh AO, Prahalad S: Immunogenetics of juvenile idiopathic arthritis: A comprehensive review. Journal of autoimmunity 2015, 64:113–124.
OpenUrl CrossRef PubMed

[2] 2.↵
Herlin MP; Petersen MB; Herlin T: Update on genetic susceptibility and pathogenesis in juvenile idiopathic arthritis. EMJ Rheumatol 2014, 1:73–83.
OpenUrl

[3] 3.↵
Moncrieffe H, Hinks A, Ursu S, Kassoumeri L, Etheridge A, Hubank M, Martin P, Weiler T, Glass DN, Thompson SD et al. Generation of novel pharmacogenomic candidates in response to methotrexate in juvenile idiopathic arthritis: correlation between gene expression and genotype. Pharmacogenet Genomics 2010, 20(11):665–676.
OpenUrl CrossRef PubMed Web of Science

[4] 4.↵
Hinks A, Cobb J, Marion MC, Prahalad S, Sudman M, Bowes J, Martin P, Comeau ME, Sajuthi S, Andrews R et al. Dense genotyping of immune-related disease regions identifies 14 new susceptibility loci for juvenile idiopathic arthritis. Nat Genet 2013, 45(6):664–669.
OpenUrl CrossRef PubMed

[5] 5.↵
Maurano MT, Humbert R, Rynes E, Thurman RE, Haugen E, Wang H, Reynolds AP, Sandstrom R, Qu H, Brody J et al. Systematic localization of common disease-associated variation in regulatory DNA. Science (New York, NY) 2012, 337(6099):1190–1195.
OpenUrl Abstract/FREE Full Text

[6] 6.↵
Schaub MA, Boyle AP, Kundaje A, Batzoglou S, Snyder M: Linking disease associations with regulatory information in the human genome. Genome Res 2012, 22(9):1748–1759.
OpenUrl Abstract/FREE Full Text

[7] 7.↵
Farh KK, Marson A, Zhu J, Kleinewietfeld M, Housley WJ, Beik S, Shoresh N, Whitton H, Ryan RJ, Shishkin AA et al. Genetic and epigenetic fine mapping of causal autoimmune disease variants. Nature 2015, 518(7539):337–343.
OpenUrl CrossRef PubMed

[8] 8.↵
Martin P, McGovern A, Orozco G, Duffus K, Yarwood A, Schoenfelder S, Cooper NJ, Barton A, Wallace C, Fraser P et al. Capture Hi-C reveals novel candidate genes and complex long-range interactions with related autoimmune risk loci. Nature communications 2015, 6:10069.
OpenUrl

[9] 9.↵
Jiang K, Zhu L, Buck MJ, Chen Y, Carrier B, Liu T, Jarvis JN: Disease-Associated Single-Nucleotide Polymorphisms From Noncoding Regions in Juvenile Idiopathic Arthritis Are Located Within or Adjacent to Functional Genomic Elements of Human Neutrophils and CD4+ T Cells. Arthritis & rheumatology (Hoboken, NJ) 2015, 67(7):1966–1977.
OpenUrl

[10] 10.↵
Grom AA, Hirsch R: T-cell and T-cell receptor abnormalities in the immunopathogenesis of juvenile rheumatoid arthritis. Current opinion in rheumatology 2000, 12(5):420–424.
OpenUrl CrossRef PubMed Web of Science

[11] 11.↵
Huttenlocher A, Smith JA: Neutrophils in pediatric autoimmune disease. Curr Opin Rheumatol 2015, 27(5):500–504.
OpenUrl

[12] 12.↵
Jarvis JN, Petty HR, Tang Y, Frank MB, Tessier PA, Dozmorov I, Jiang K, Kindzelski A, Chen Y, Cadwell C et al. Evidence for chronic, peripheral activation of neutrophils in polyarticular juvenile rheumatoid arthritis. Arthritis research & therapy 2006, 8(5):R154.
OpenUrl

[13] 13.↵
Petty RE, Southwood TR, Manners P, Baum J, Glass DN, Goldenberg J, He X, Maldonado-Cocco J, Orozco-Alcala J, Prieur AM et al. International League of Associations for Rheumatology classification of juvenile idiopathic arthritis: second revision, Edmonton, 2001. J Rheumatol 2004, 31(2):390–392.
OpenUrl FREE Full Text

[14] 14.↵
Hu Z, Jiang K, Frank MB, Chen Y, Jarvis JN: Complexity and Specificity of the Neutrophil Transcriptomes in Juvenile Idiopathic Arthritis. Scientific reports 2016, 6:27453.
OpenUrl

[15] 15.↵
Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL: TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome biology 2013, 14(4):R36.
OpenUrl CrossRef PubMed

[16] 16.↵
Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, Salzberg SL, Wold BJ, Pachter L: Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nature biotechnology 2010, 28(5):511–515.
OpenUrl CrossRef PubMed Web of Science

[17] 17.↵
Harrow J, Frankish A, Gonzalez JM, Tapanari E, Diekhans M, Kokocinski F, Aken BL, Barrell D, Zadissa A, Searle S et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome research 2012, 22(9):1760–1774.
OpenUrl Abstract/FREE Full Text

[18] 18.↵
Trapnell C, Hendrickson DG, Sauvageau M, Goff L, Rinn JL, Pachter L: Differential analysis of gene regulation at transcript resol ution with RNA-seq. Nature biotechnology 2013, 31(1):46–53.
OpenUrl CrossRef PubMed

[19] 19.↵
Hinks A, Cobb J, Marion MC, Prahalad S, Sudman M, Bowes J, Martin P, Comeau ME, Sajuthi S, Andrews R et al. Dense genotyping of immune-related disease regions identifies 14 new susceptibility loci for juvenile idiopathic arthritis. Nature genetics 2013, 45(6):664–669.
OpenUrl CrossRef PubMed

[20] 20.↵
Johnson AD, Handsaker RE, Pulit SL, Nizzari MM, O'Donnell CJ, de Bakker PI: SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap. Bioinformatics 2008, 24(24):2938–2939.
OpenUrl CrossRef PubMed Web of Science

[21] 21.↵
Quinlan AR, Hall IM: BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 2010, 26(6):841–842.
OpenUrl CrossRef PubMed Web of Science

[22] 22.↵
Chepelev I, Wei G, Wangsa D, Tang Q, Zhao K: Characterization of genome-wide enhancer-promoter interactions reveals co-expression of interacting genes and modes of higher order chromatin organization. Cell research 2012, 22(3):490–503.
OpenUrl CrossRef PubMed Web of Science

[23] 23.↵
Wang K, Li M, Hakonarson H: Analysing biological pathways in genome-wide association studies. Nature reviews Genetics 2010, 11(12):843–854.
OpenUrl CrossRef PubMed Web of Science

[24] 24.↵
Brodie A, Tovia-Brodie O, Ofran Y: Large scale analysis of phenotype-pathway relationships based on GWAS results. PloS one 2014, 9(7):e100887.
OpenUrl CrossRef PubMed

[25] 25.↵
Brodie A, Azaria JR, Ofran Y: How far from the SNP may the causative genes be? Nucleic acids research 2016, 44(13):6046–6054.
OpenUrl CrossRef PubMed

[26] 26.↵
Jiang K, Wong L, Sawle AD, Frank MB, Chen Y, Wallace CA, Jarvis JN: Whole blood expression profiling from the TREAT trial: insights for the pathogenesis of polyarticular juvenile idiopathic arthritis. Arthritis research & therapy 2016, 18(1):157.
OpenUrl CrossRef PubMed

[27] 27.↵
Meazza C, Travaglino P, Pignatti P, Magni-Manzoni S, Ravelli A, Martini A, De Benedetti F: Macrophage migration inhibitory factor in patients with juvenile idiopathic arthritis. Arthritis and rheumatism 2002, 46(1):232–237.
OpenUrl CrossRef PubMed Web of Science

[28] 28.↵
Lovell DJ, Giannini EH, Reiff A, Cawkwell GD, Silverman ED, Nocton JJ, Stein LD, Gedalia A, Ilowite NT, Wallace CA et al. Etanercept in childr en with polyarticular juvenile rheumatoid arthritis. Pediatric Rheumatology Collaborative Study Group. The New England journal of medicine 2000, 342(11):763–769.
OpenUrl CrossRef PubMed Web of Science

[29] 29.↵
Creyghton MP, Cheng AW, Welstead GG, Kooistra T, Carey BW, Steine EJ, Hanna J, Lodato MA, Frampton GM, Sharp PA et al. Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proceedings of the National Academy of Sciences of the United States of America 2010, 107(50):21931–21936.
OpenUrl Abstract/FREE Full Text

[30] 30.↵
Alexander RP, Fang G, Rozowsky J, Snyder M, Gerstein MB: Annotating non-coding regions of the genome. Nature reviews Genetics 2010, 11(8):559–571.
OpenUrl CrossRef PubMed Web of Science

[31] 31.↵
Hollenbach JA, Thompson SD, Bugawan TL, Ryan M, Sudman M, Marion M, Langefeld CD, Thomson G, Erlich HA, Glass DN: Juvenile idiopathic arthritis and HLA class I and class II interactions and age-at-onset effects. Arthritis and rheumatism 2010, 62(6):1781–1791.
OpenUrl CrossRef PubMed Web of Science

[32] 32.↵
Mi H, Muruganujan A, Casagrande JT, Thomas PD: Large-scale gene function analysis with the PANTHER classification system. Nature protocols 2013, 8(8):1551–1566.
OpenUrl

[33] 33.↵
An integrated encyclopedia of DNA elements in the human genome. Nature 2012, 489(7414):57–74.
OpenUrl CrossRef PubMed Web of Science

[34] 34.↵
Gandin V, Sikstrom K, Alain T, Morita M, McLaughlan S, Larsson O, Topisirovic I: Polysome fractionation and analysis of mammalian translatomes on a genome-wide scale. Journal of visualized experiments : JoVE 2014(87).

[35] 35.↵
Winter DR, Jung S, Amit I: Making the case for chromatin profiling: a new tool to investigate the immune-regulatory landscape. Nature reviews Immunology 2015, 15(9):585–594.
OpenUrl

[36] 36.↵
Spalinger MR, McCole DF, Rogler G, Scharl M: Role of protein tyrosine phosphatases in regulating the immune system: implications for chronic intestinal inflammation. Inflammatory bowel diseases 2015, 21(3):645–655.
OpenUrl

[37] 37.↵
Shlyueva D, Stampfel G, Stark A: Transcriptional enhancers: from properties to genome-wide predictions. Nature reviews Genetics 2014, 15(4):272–286.
OpenUrl CrossRef PubMed

[38] 38.↵
Levine M, Cattoglio C, Tjian R: Looping back to leap forward: transcription enters a new era. Cell 2014, 157(1):13–25.
OpenUrl CrossRef PubMed Web of Science

[39] 39.↵
Li G, Cai L, Chang H, Hong P, Zhou Q, Kulakova EV, Kolchanov NA, Ruan Y: Chromatin Interaction Analysis with Paired-End Tag (ChIA-PET) sequencing technology and application. BMC genomics 2014, 15 Suppl 12:S11.

[40] 40.↵
Brien GL, Gambero G, O’Connell DJ, Jerman E, Turner SA, Egan CM, Dunne EJ, Jurgens MC, Wynne K, Piao L et al. Polycomb PHF19 binds H3K36me3 and recruits PRC2 and demethylase NO66 to embryonic stem cell genes during differentiation. Nature structural & molecular biology 2012, 19(12):1273–1281.
OpenUrl

[41] 41.↵
Ou YY, Mack GJ, Zhang M, Rattner JB: CEP110 and ninein are located in a specific domain of the centrosome associated with centrosome maturation. Journal of cell science 2002, 115(Pt 9):1825–1835.
OpenUrl Abstract/FREE Full Text

[42] 42.↵
Liang D, Shen N: MicroRNA involvement in lupus: the beginning of a new tale. Current opinion in rheumatology 2012, 24(5):489–498.
OpenUrl CrossRef PubMed

[43] 43.↵
Li LC: Chromatin remodeling by the small RNA machinery in mammalian cells. Epigenetics 2014, 9(1):45–52.
OpenUrl

[44] 44.↵
Birnbaum RY, Clowney EJ, Agamy O, Kim MJ, Zhao J, Yamanaka T, Pappalardo Z, Clarke SL, Wenger AM, Nguyen L et al. Coding exons function as tissue-specific enhancers of nearby genes. Genome research 2012, 22(6):1059–1068.
OpenUrl Abstract/FREE Full Text

[45] 45.↵
Li W, Notani D, Rosenfeld MG: Enhancers as non-coding RNA transcription units: recent insights and future perspectives. Nature reviews Genetics 2016, 17(4):207–223.
OpenUrl CrossRef PubMed

[46] 46.↵
Ahituv N: Exonic enhancers: proceed with caution in exome and genome sequencing studies. Genome medicine 2016, 8(1):14.
OpenUrl

[47] 47.↵
Pauli A, Rinn JL, Schier AF: Non-coding RNAs as regulators of embryogenesis. Nature reviews Genetics 2011, 12(2):136–149.
OpenUrl CrossRef PubMed

[48] 48.↵
Wong JJ, Ritchie W, Ebner OA, Selbach M, Wong JW, Huang Y, Gao D, Pinello N, Gonzalez M, Baidya K et al. Orchestrated intron retention regulates normal granulocyte differentiation. Cell 2013, 154(3):583–595.
OpenUrl CrossRef PubMed Web of Science

[49] 49.↵
Dozmorov I, Knowlton N, Tang Y, Shields A, Pathipvanich P, Jarvis JN, Centola M: Hypervariable genes--experimental error or hidden dynamics. Nucleic acids research 2004, 32(19):e147.
OpenUrl CrossRef PubMed

Chromatin Landscapes and Genetic Risk For Juvenile Idiopathic Arthritis

Abstract

Introduction

Materials and Methods

RNA isolation and sequencing

Analysis of RNA-Seq Data

Verification of non-coding RNA transcripts identified in neutrophils within the JIA-Associated LD blocks

Defining LD regions

ENCODE Transcription Factor Binding Site (TFBS) enrichment

Identification of differentially expressed neutrophil genes within LD regions

Identification of H3K4me1/H327ac histone marks within LD regions

Identifying chromatin interactions in CD4+ T cells within JIA risk loci

Identifying molecular pathways of disease-associated SNPs

Data availability

RESULTS

Results from RNA-Seq and location of differentially expressed genes within LD blocks

Verification of non-coding transcripts within neutrophils in JIA-associated LD blocks

Location of H3K4me1/H3K27ac marks within LD blocks

Transcription factor enrichment within the LD blocks

Chromatin interactions in JIA risk loci

Enriched molecular pathways of disease-associated SNPs

Discussion

Footnotes

Competing interests

Authors’ contributions

Contributor Information

Acknowledgements

Footnotes

References

Citation Manager Formats

Subject Area