Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Investigation of an LPA KIV-2 nonsense mutation in 11,000 individuals: the importance of linkage disequilibrium structure in LPA genetics

Silvia Di Maio, Gertraud Streiter, Rebecca Grüneis, Claudia Lamina, Manuel Maglione, Dietmar Öfner, Barbara Thorand, Annette Peters, Kai-Uwe Eckardt, Anna Köttgen, Florian Kronenberg, View ORCID ProfileStefan Coassin
doi: https://doi.org/10.1101/848945
Silvia Di Maio
1Institute of Genetic Epidemiology, Department of Genetics and Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gertraud Streiter
1Institute of Genetic Epidemiology, Department of Genetics and Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rebecca Grüneis
1Institute of Genetic Epidemiology, Department of Genetics and Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Claudia Lamina
1Institute of Genetic Epidemiology, Department of Genetics and Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Manuel Maglione
2Department of Visceral, Transplant and Thoracic Surgery, Medical University of Innsbruck, Innsbruck, Austria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dietmar Öfner
2Department of Visceral, Transplant and Thoracic Surgery, Medical University of Innsbruck, Innsbruck, Austria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Barbara Thorand
3Institute of Epidemiology, Helmholtz Zentrum München – German Research Center for Environmental Health (GmbH), Neuherberg, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Annette Peters
3Institute of Epidemiology, Helmholtz Zentrum München – German Research Center for Environmental Health (GmbH), Neuherberg, Germany
4German Centre for Cardiovascular Research (DZHK), Partner Site Munich Heart Alliance, Munich, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kai-Uwe Eckardt
5Department of Nephrology and Hypertension, Friedrich-Alexander Universität Erlangen-Nürnberg, Erlangen, Germany
6Department of Nephrology and Medical Intensive Care, Charité—Universitätsmedizin Berlin, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Anna Köttgen
7Institute of Genetic Epidemiology, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Florian Kronenberg
1Institute of Genetic Epidemiology, Department of Genetics and Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Stefan Coassin
1Institute of Genetic Epidemiology, Department of Genetics and Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Stefan Coassin
  • For correspondence: Stefan.Coassin@i-med.ac.at
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Objective Elevated Lp(a) plasma concentrations are determined mainly genetically by the LPA gene locus, but up to 70% of the coding sequence is located in the so-called “kringle IV type 2” (KIV-2) copy number variation. This region is not resolved by common genotyping technologies and large epidemiological studies on this region are therefore missing. The Arg21Ter variant (R21X, variant frequency ≈2%) is a functional variant in this region, but it has never been analyzed in large cohorts and is it unknown whether it is captured by genome-wide association studies.

Approach and Results We developed a highly sensitive allele-specific qPCR assay and genotyped R21X in 10,910 individuals from three populations (GCKD, KORA F3, KORA F4). R21X carriers showed significantly lower mean Lp(a) concentrations (−11.7 mg/dL [−15.5;−7.82], p=3.39e-32). Of particular note, virtually all R21X carriers also carried the splice mutation rs41272114 (D’=0.957, R2=0.275), as confirmed by pulsed-field gel electrophoresis and long-range haplotyping. This proposes that the R21X mutation arose on the background of the rs41272114 splice variant.

Conclusions We performed the largest epidemiological study on an LPA KIV-2 variant so far. Interestingly, R21X is located on the same haplotype as the splice mutation rs41272114, creating “double-null” LPA alleles that are inactivated by two independent mutations. The effect of the R21X nonsense mutation can thus not be discerned from the effect of rs41272114 splice site mutation. This emphasizes the importance of assessing the complex LD structure within LPA even for functional variants.

Introduction

The lipoprotein(a) [Lp(a)] plasma concentration is one of the most prominent genetically determined factors for cardiovascular diseases1–7. Elevated Lp(a) concentrations concern 14-25% of the general population2 and increase the cardiovascular disease (CVD) risk up to more than three-fold for very high concentrations above the 99th percentile8. More than 90% of Lp(a) variance is genetically determined by the LPA gene locus9, which encodes apolipoprotein(a) [apo(a)], the distinctive structural protein of the Lp(a) particle. The apo(a) protein consists of 10 types of so-called kringle-IV domains (KIV-1 to −10), one kringle V domain and a protease domain. The KIV-2 domain is encoded by a hypervariable copy number variation (CNV), which is present in up to ≈40 copies per gene allele, resulting in a pronounced size polymorphism of the apo(a) protein10 and theoretically ≈1600 possible genotypes in the population. The KIV-2-CNV explains 30-70% of the Lp(a) 1 concentrations.

The apo(a) isoform size is inversely correlated with Lp(a) concentrations1. Low molecular weight (LMW) apo(a) isoforms (≤22 KIV domains) are associated with 5-10 fold higher median Lp(a) concentrations than high molecular weight (HMW) isoforms (>22 KIV domains)1 but the Lp(a) levels of unrelated individuals carrying a same-sized isoform combination can still vary by up to 200-fold11,12. In contrast, same-sized alleles within families (i.e. alleles that are identical-by-descent) are associated with a much smaller variation in Lp(a) (typically <2.5-fold11). This indicates that genetic variants exist that regulate the Lp(a) concentrations in addition to isoform size 12,13. Genome-wide association studies (GWAS) have identified dozens of SNPs distributed over a two megabases region14–16, but few bona-fide functional variants have been identified13,17–19.

The KIV-2 region encompasses up to 70% of the LPA coding sequence13 and is therefore an obvious candidate region to search for functional variants affecting Lp(a) concentrations. Very little is known about the impact of sequence variation in the KIV-2 CNV on Lp(a) concentrations because commonly used sequencing and genotyping technologies are not able to resolve variation within this region. Accordingly, it is also unclear whether KIV-2 variants are captured by known GWAS hits via linkage disequilibrium (LD) or whether they are indeed mostly independent. Some studies suggested a strong LD structure spanning the whole kringle region20–23, but few details have been reported.

Recently, an ultra-deep next generation sequencing (NGS) approach with a customized bioinformatic analysis pipeline has allowed cataloging variation within the KIV-2 region24. This revealed hundreds of variants and provided several new putative regulators of Lp(a) levels. For example, the G4925A variant13 found in 20% of the population is associated with an Lp(a) reduction of up to ≈30 mg/dL and explains a considerable fraction of the individuals presenting low Lp(a) concentrations despite carrying a LMW isoform. This aspect of the relationship between Lp(a) concentrations and apo(a) isoform size is poorly understood so far.

The nonsense mutation KIV-2 R21X (g. 61 C>T in 19, 640C>T in 24) is another likely causal single nucleotide polymorphism (SNP) in the KIV-2 region. It leads to a truncated protein that is rapidly degraded19. Parson et al.19 identified it by a laborious cloning approach and reported a minor allele frequency [MAF] of 1.67% in 405 individuals19. However, because standard genotyping technologies like TaqMan genotyping assays and SNP microarrays are not specific and sensitive enough to detect a variant that is present in only one (or a few) of up to 80 nearly identical repeats (i.e. 1.2% mutation level), the R21X variant was not further explored in large epidemiological studies. Accordingly, R21X has never been put into the context of the findings from GWAS on Lp(a)14–16 and it is unknown whether any of the LPA SNPs detected in GWAS studies is in LD with R21X.

We developed an allele-specific TaqMan PCR assay (ast-PCR) targeting the R21X variant, as well as the previously described KIV-2 variant G4925A13, and assessed the effect of R21X on Lp(a) concentrations in nearly 11,000 individuals. We then used genome-wide SNP data from the German Chronic Kidney Disease (GCKD) study to assess the LD of R21X with SNPs outside the KIV-2 region to link it to available GWAS datasets. Finally, given the fact that the effect of a functional LPA SNP depends strongly on which gene allele it is located, we determined the allelic location of R21X by pulsed-field electrophoresis (PFGE) and show the existence of moderately frequent “double-null” LPA alleles that are inactivated by two independent loss-of-function mutations.

Materials and Methods

Populations

Our study involved 10,910 individuals from three studies, namely GCKD25 (German Chronic Kidney Disease), KORA (Cooperative Health Research in the region of Augsburg) F326 and KORA F426. Informed consent was obtained from each participant and the studies were approved by the respective Institutional Review Boards. Details on the studies are given in Table 1 and in the Supplemental Methods. In brief, the KORA F3 and F4 studies are follow-up examinations of two nonoverlapping surveys drawn from the general population living in the region of Augsburg, Southern Germany (KORA S3 and S4). GCKD is an ongoing prospective observational study of 5,217 Caucasian patients with moderately severe chronic kidney disease at enrollment recruited at nine institutions in Germany.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1 Descriptive statistics.

Continuous variables are provided as mean ± SD and [25%, 50%, 75%] percentiles. LMW: low molecular weight apo(a) isoforms. HMW: high molecular weight apo(a) isoforms. F: females. M: males. eGFR: estimated glomerular filtration rate. Lp(a): lipoprotein(a).

DNA specimens from common study populations are not suitable for pulsed-field gel electrophoresis (PFGE), because it requires megabase-sized DNA. Since the buffy coat samples that are required for such an agarose-plug DNA preparation are not commonly available for population studies, we used for the PFGE experiments samples from the CAVASIC study27,28, from an ongoing collection of liver tissue specimens for Lp(a) research (IRB Medical University of Innsbruck, AN2015-0056) and from anonymous blood samples obtained from the blood bank of the University Hospital of Innsbruck, Austria.

Ast-PCR for R21X typing

We designed an allele-specific triplex TaqMan PCR assay (ast-PCR) amplifying the mutant bases of R21X19 and the G4925A13, as well as a positive amplification control (design illustrated in Supplemental Figure I). The R21X and the G4925A variants were introduced into pSPL3 plasmids containing one KIV-2 repeat13 using the Agilent Technologies QuickChange II Site-Directed Mutagenesis Kit with minor modifications, Supplemental Methods). To provide additional thermodynamic disadvantage to unspecific pairings 29, various base mismatches were introduced in the allele-specific primers on positions −2 or −3 (from the 3’ end) and the performance of different primer designs was tested on plasmid mixes mimicking mutation levels from 100% to 0% mutant fraction (Supplemental Figure II). The most specific primers were taken forward (Supplemental Figure II). Fluorescently labelled, locus-specific TaqMan probes were added to allow amplification detection in a high throughput setting. An amplicon in PNPLA3 served as positive amplification control to detect false-negative reactions due to PCR failure. Technical details are provided in the Supplemental Methods and in Supplemental Tables I and II. The assay was run on a 384-well ThermoFisher QuantStudio 6 qPCR system. The R21X assay was validated both against ultra-deep NGS data from Coassin and Schönherr et al, 201924 and against a commercial cast-PCR assay (ThermoFisher; as used in 13) with a sensitivity of 0.2% mutant fraction (determined according to manufacturer instructions on a NGS-validated sample). For validation, our assay was run on 376 samples from KORA F4, identifying 14 R21X-carriers, which were all confirmed also by the commercial castPCR assay. The reproducibility was tested on 477 samples run in duplicates. Additionally, each 384 well qPCR plate (n=34) contained a positive control sample. A slightly modified ast-PCR protocol was used to genotype the gene alleles separated by PFGE (Supplemental Methods).

Ast-PCR data analysis

Figure 1 exemplifies the assay data analysis rationale. A possible unspecific amplification signal of the WT variant alleles will occur later in the qPCR amplification than the true specific amplification of the mutant variant allele [CT(carriers)<CT(non-carriers)]. This creates two CT distributions whose widths are defined by stochastic fluctuation in the amplification of the target (e.g. due to slightly varying input amount) and, for the mutant, the fraction of KIV-2 repeats affected (Figure 1A). DNA input was 20 ng in all samples. Previous data13,24 indicates that no more than one to three repeats are affected by R21X, which translates to maximum ≈1.6 cycles differences due to the mutation level.

Figure 1
  • Download figure
  • Open in new tab
Figure 1 Analysis strategy

A. Exemplary ast-PCR amplification plot. B. Discrimination of the two CT distributions using a systematic clustering approach. The orange and pink horizontal bars (below the x-axis) identify the samples, which cannot be uniquely assigned to one of the two distributions and are therefore excluded from the analysis. Orange bar: upper 1% of the carrier distribution and lower 1% of the non-carrier distribution. Red bar: upper 2.5% of the carrier distribution and lower 2.5% of the noncarrier distribution (more conservative; used in this analysis). Plot generated using the provided R script.

To avoid human bias and have a systematic approach for sample assignment beyond pure visual clustering of the amplification curves, the optimal discrimination threshold between the CT distributions of carriers and non-carriers was estimated using a bagged clustering algorithm30 implemented in the R function ‘classIntervals’ (package ‘classInt’) and two normal distributions were fitted to the two CT distributions using the R package ‘VGAM’31 (Figure 1B). Details are provided in the Supplemental Methods. Samples that could not be assigned unambiguously to one of the CT distributions (i.e. which cannot be unambiguously identified as carriers or non-carriers) were excluded. The exclusion rate was 0.7% in GCKD (35/4974), 1.6% in KORA F3 (52/3157) and 0.5% in KORA F4 (15/3063). The custom R function used for analysis is available at https://github.com/scogi/r21x_analysis.

Lp(a) phenotyping

The Lp(a) concentrations and apo(a) isoforms were determined by ELISA and Western blotting, respectively, as described earlier32,33. All analyses were performed in the same laboratory at the Institute of Genetic Epidemiology, Medical University of Innsbruck, Austria and evaluated by the same experienced researcher.

Identification of proxy SNPs

We used the genome-wide SNP data available for the GCKD study to search for a proxy SNP for R21X that would allow linking R21X to existing results from GWAS. Following the rationale that a SNP in LD with R21X will present a similar effect on Lp(a) and that the observed effect of R21X on Lp(a) should have been easily detected by our recent GWAS on Lp(a) (n=13,781)14, we created a contingency table of each of the 66 top hits of the isoform-adjusted model of our recent GWAS metaanalysis on the Lp(a) concentrations with the R21X14 and analyzed it using the Fisher’s exact test. We selected the two SNPs with the most significant p-values (rs2489940, rs41272114) and calculated the LD using CubeX34.

Pulsed-field gel electrophoresis

We performed LPA PFGE-based genotyping13,35–37 to assess on which gene allele R21X is located and to confirm the co-localization of R21X and rs41272114 experimentally. Two different restriction enzymes were used for LPA PFGE. KpnI excises a region from KIV-1 to KIV-535–37 and allows precise allele sizing. Conversely, Kpn2I digestion excises a much larger fragment that spans from LPA to MAP3K438 (Supplemental Figure III) and allows long-range haplotyping of variants by performing the genotyping directly on the previously separated gene alleles.

The LPA gene alleles of eight R21X carriers and three R21X-negative samples were separated by PFGE and detected by Southern blotting using a probe against KIV-2 (detailed in 36 and 38). In brief, DNA agarose plugs have been prepared as previously described38 and digested for 4 hours at 37°C (KpnI) or 55°C (Kpn2I). Half plug for each sample was applied on the agarose gel twice, separated on a Bio-Rad CHEF Mapper system (for technical details see Supplemental Table III) and the separated alleles were isolated from the gel as described previously13,36. DNA was extracted from the gel slices using the peqGOLD Gel Extraction Kit (VWR). Genotyping was done using a modified ast-PCR protocol for R21X (Supplemental methods) and Sanger sequencing for rs41272114 (Supplemental Table IV).

Statistical methods

Differences in medians were assessed by Wilcoxon tests. The association between the LPA KIV-2 variant R21X and the Lp(a) levels was assessed by linear regression analysis in each population, adjusted for age and sex. GCKD analysis was also repeated adjusting additionally for the estimated glomerular filtration rate (eGFR; estimated according to the CKD-EPI equation39) and urine albumin-to-creatinine ratio. Since R21X has been previously shown to cause a null LPA allele19 and therefore completely abolishes the respective isoform in plasma, all remaining Lp(a) is produced by the nonmutant allele. Therefore, the regression analysis was not adjusted for isoform, as this would imply to adjust for a major part of the Lp(a) concentration itself. β-estimates were obtained on the originals cale of Lp(a), while p-value and coefficients of determination were derived after inverse-normal transformation of the Lp(a) concentrations due to the skewed distribution. All analyses were done in R software version 3.5.0 (www.r-project.org). R package metafor40 (www.metafor-project.org) was used for fixed effect meta-analysis.

Results

Assay performance

We established a cost-effective ast-PCR for the detection of carriers of the R21X variant19 and G4925A13 in large epidemiological sample collections. Our ast-PCR is a high throughput-capable assay with three multiplexed targets: two KIV-2 variants (R21X19 and G4925A13) and an amplification control fragment in PNPLA3. In this manuscript we report the results for the R21X variant. The results of G4925A have already been reported earlier using a different assay approach13.

Our assay showed excellent sensitivity down to 0.5% mutant fraction and no amplification at 0% (Supplemental Figure II). The R21X assay also correctly classified six samples from Coassin and Schönherr et al, 201924, were the R21X status has been determined by ultra-deep NGS (3 positive samples with mutation level 2.4-5.1% and 3 negative samples; all measured in triplicates; more positive validation samples were not available due to the low carrier frequency). The Ct values of genomic DNA samples ranged from 30.3 to 31.7 for the positive samples and from 37.3 to 39.7 for the negative samples, providing a clear separation between positive and negative samples. The validation of the assay against the commercial castPCR (ThermoFisher Scientific) showed no discordances. Reproducibility was tested, firstly, by typing 477 GCKD samples twice during assay establishment and, secondly, by typing 5-10% of the samples of each study twice during data generation (in total nQC_samples=879; see Supplemental methods for details). No discordances were observed (Supplemental Table V). Moreover, each 384 well qPCR plate (n=34) included the same positive control sample, with consistent results over all plates. Sample call rates of the single studies ranged from 97.8% to 99.0% (Supplemental Table V).

R21X is associated with reduced lipoprotein(a) concentrations

We determined the carrier status for the R21X in 10,910 samples. Carrier frequency was 1.6% in GCKD, 1.8% in KORA F3 and 2.1% in KORA F4 resulting in 193 carriers in the combined data set. The R21X variant was associated with reduced Lp(a) levels in all three populations (Figure 2, Table 2) with consistent effect estimates (Table 2). A fixed-effect meta-analysis resulted in an overall effect estimate of −11.7 mg/dL (95% confidence interval (CI): −15.5 to −7.82; p=1.08e-32). Adjustment in GCKD for eGFR (and urine albumin-to-creatinine ratio) altered the estimates only marginally (Table 2, footnote). Positive R21X mutation carrier status explained 1.1% to 1.5% of the inverse-normal Lp(a) variance (Table 2).

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2 Linear regression analysis on the association between the R21X variant and the Lp(a) levels.

Lp(a) concentrations are given as mg/dL. The β estimate is given on original scale (mg/dL).

Figure 2
  • Download figure
  • Open in new tab
Figure 2 Association of the R21X variant with reduced Lp(a) levels.

Lp(a) is lower in R21X variant carriers (i.e. at least one KIV-2 repeat carrying the R21X variant) in each population. Due to the highly skewed distribution of Lp(a), outliers were omitted for better representation. The same boxplots including the outliers are provided in Supplemental Figure IV.

PFGE shows location of R21X on moderately large alleles

We assessed the allelic location of R21X by PFGE in eight individuals. The LPA alleles separated by PFGE were isolated from the gel and genotyped using our ast-PCR. In all analyzed individuals R21X was located on HMW alleles in the range 27-32 KIV. This is in line with the observed effect magnitude. The PFGE genotypes and the gene allele carrying the variant are reported in Supplemental Table VI.

Linkage disequilibrium with the splice variant rs41272114 creates double-null alleles

Genetic variants located in the KIV-2 region are not represented in GWAS datasets. To investigate whether any of the lately reported GWAS hits14 may pick up the signal of R21X and to assess the contribution of the R21X nonsense variant to cardiovascular outcomes, we searched among the 66 top hits of our recent GWAS on Lp(a)14 for a proxy SNP for R21X. This identified two possible proxy SNPs: rs2489940 in PLG (MAF=0.5%, R2= 0.38, D’=0.74) and rs41272114 in LPA KIV-817 (MAF=2.6%, R2= 0.275, D’=0.957). The linkage disequilibrium to rs41272114 is noteworthy because 17,41,42 rs41272114 is a widely studied splice donor mutation variant that causes itself17,41,42 LPA null alleles.

The high D’ coefficient indicates that virtually all R21X carriers also carry rs41272114. To substantiate this implication experimentally, we separated the LPA gene alleles of five individuals using PFGE with Kpn2I-digested DNA plugs. Kpn2I excises a large genomic region38 around LPA (Supplemental Figure III) and performing SNP genotyping on the separated alleles allows direct long-range haplotyping38. In all tested individuals, the LPA allele carrying the R21X mutation carried also the rs41272114 splice site mutation (i.e. the two variants formed one haplotype). Accordingly, the association between R21X and Lp(a) vanished, if the linear regression model for R21X on Lp(a) in GCKD was adjusted for rs41272114 (β=−0.67 (95% CI: −9.14; 7.81), p=0.504, age, sex and eGFR-adjusted). Vice versa, rs41272114 was still associated with Lp(a) also when the linear regression was adjusted for R21X (β=−12.26 (95% CI: −17.00; −7.55), p=3.18e-29), respectively when linear regression was performed only in R21X-negative samples (β=−12.26 (95% CI: −17.06; −7.46), p=3.90e-28). No difference was found between median Lp(a) of heterozygous individuals with both variants and such with rs41272114 but not R21X (p=0.47, Figure 3).

Figure 3
  • Download figure
  • Open in new tab
Figure 3 Distribution of Lp(a) values in the carriers of the various combinations of R21X carrier status and rs41272114 genotype.

No significant difference is observed between heterozygous individuals carrying only rs41272114 and those carrying both R21X and rs41272114 (p=0.47). The minor allele of rs41272114 is the adenine base. For R21X no genotype can be reported because most of the KIV-2 repeats still carry the wild type base at any time. Genotype is thus given as positive or negative carrier status. Lp(a) values are reported in mg/dL.

Discussion

The KIV-2 repeat polymorphism in the LPA gene, respectively the apo(a) isoform explains 30-70%1 of Lp(a) variance in the population. However, the impact of the KIV-2 repeat to the individual Lp(a) concentrations varies widely and it has been shown that genetic variants modify the impact of the isoforms on Lp(a) concentrations13. The interpretation of such variants is complicated by the different contribution of the two gene alleles to the Lp(a) concentrations and the complex LD patterns in the LPA gene13,43.

The LPA KIV-2 R21X variant is a nonsense mutation located in the KIV-2 region and results in a truncated protein, which is degraded quickly19. Until recently13,24, the R21X variant was the only functional KIV-2 variant that had been investigated in a relatively large sample set (n=40519). However, no study up to now has investigated the contribution of R21X to the Lp(a) levels in general or high risk populations nor it is known whether this variant is captured by any of the many GWAS hits that have been reported for LPA14–16.

Using a newly developed high-throughput capable PCR assay, we determined the carrier status for R21X in 10,910 individuals from three independent studies and found that R21X is associated with a reduction of mean Lp(a) concentrations by 9.9 to 13.0 mg/dL (Table 2). This effect is of moderate magnitude for a nonsense mutation. Since location on LMW LPA alleles would likely be associated with a much stronger Lp(a) decrease (it is e.g. ≈30 mg/dL for G4925A, which is located in the isoform range 19-2513), the observed effect magnitude suggests that R21X is located on rather large LPA alleles. To investigate this assumption we separated the LPA gene alleles of eight individuals by PFGE and typed R21X on the separated gene alleles. As postulated, the R21X mutation was located on HMW LPA alleles in all investigated samples.

Surprisingly, the best proxy SNP for R21X among all top hits of a genome-wide association metaanalysis on the Lp(a) concentrations14 was rs41272114 (MAF=2.6%). This SNP is a well-known17,41,42 splice site mutation in the KIV-8 domain of LPA and results in a null apo(a) allele, too17. The combination of a low determination coefficient (R2=0.27) but a high Lewontin’s D’ (D’=0.957) indicates that R21X-carrying alleles constitute a subset of the more frequent rs41272114-carrying alleles, where virtually all R21X carriers carry also rs41272114 (indicated by the high D’), but, vice versa, not all rs41272114 carry R21X (indicated by the low R2). This suggests that R21X is a more recent mutation than rs41272114 and indeed arose on the background of an rs41272114-carrying haplotype. Accordingly, R21X (termed 640C>T the supplementary materials of Coassin and Schönherr et al, 201924) is found mostly in Europeans and South-Asians but is absent in Africans24, while rs41272114 is rare but present also in Africans (MAF = 0.7%)45.

By separating the LPA gene alleles and typing them independently, we have been able to confirm this statistical inference also experimentally. This demonstrated that R21X-carrying alleles indeed represent “double-null” LPA alleles that are inactivated by two independent loss-of-function variants. Since both variants are located on the same haplotype, no significant difference in Lp(a) concentrations is found between rs41272114-only carriers and double-null allele carriers (Figure 3) and single causality cannot be assigned. Therefore, despite R21X is a nonsense mutation and would be clearly functional in an isolated manner (e.g. in-vitro), within its proper genomic context its effect is masked by rs41272114. The effect of R21X on Lp(a) and cardiovascular outcomes therefore merges with rs41272114, which has been repeatedly shown to be protective against coronary artery disease (OR=0.79 [0.66-0.97] in PROCARDIS41 and OR=0.891 [0.86-0.92] in UK Biobank+CardiogrammC4D46).

Strengths and limitations of the study

Our high throughput ast-PCR capable of typing two variants within the KIV-2 CNV (R21X and G4925A) in a single multiplex reaction, can be seen as a major technical strength of this work. Some commercial high sensitivity assays like castPCR (ThermoFisher Scientific), Agena MALDI-TOF Ultraseek47 and droplet digital PCR48, are able to type mutations in the KIV-2, too, but their exceedingly high costs (several Euro per sample) precludes their application to large epidemiological studies. In the study at hand, we typed nearly 11,000 individuals, making this study the largest assessment of a variant located in the LPA KIV-2 region performed so far.

The allelic location of functional LPA mutations is rarely assessed in Lp(a) epidemiology. However, to fully understand the effect size of an LPA mutation, it plays a major role whether a mutation is located on a low or a high molecular weight LPA allele. We have experimentally demonstrated the allelic localization of R21X on moderately large alleles and also experimentally confirmed the co-localization of two loss-of-function variants on the same gene allele.

Conversely, the relatively low number of samples assessed by PFGE is a limitation of our study. PFGE requires preparation of agarose-plug embedded DNA. This requires buffy coat, which is not commonly available in population studies. The low MAF of R21X further complicates the retrieval of a large number of individuals for PFGE. Therefore, only a limited number of suitable samples were available in our laboratory and the results of our PFGE experiments might not be fully generalizable. However, the localization of R21X on medium to large sized alleles is in line with the effect magnitude observed in the whole dataset. Furthermore, the co-localization of rs41272114 and R21X on the same haplotype is supported by three independent lines of evidence: (1) the experimental PFGE data from five individuals, (2) the regression analysis in the whole dataset, where the effect of R21X, but not of rs41272114, vanishes after reciprocal adjustment, and (3) the R2/D’ values in GCKD.

Conclusion

We developed a high-throughput capable assay for the KIV-2 variant R21X and found that this variant is located on high molecular weight apo(a) alleles, lowers Lp(a) by 11.7 mg/dL, and most surprisingly, that it is in nearly perfect LD with another null mutation (rs41272114). These two variants create LPA alleles that are inactivated by two independent loss-of-function mutations and their effects cannot be genetically separated. While previous studies have shown the impact of LD between SNPs and apo(a) isoforms13,43, our study is the first example of a strong LD between two clearly functional LPA variants. This emphasizes the complexity of LPA genetics and exemplifies the importance of assessing LD patterns even for seemingly obvious functional variants.

Sources of funding

The study was supported by the Austrian Science Fund (FWF) projects P31458 to SC and P266600-B13 to CL and the Austrian Genome Project ‘GOLD’ to FK. FK and SC gratefully acknowledge the support of the Lipoprotein(a) Center And Research InstitutE [Lp(a)CARE] to their lipoprotein(a) research.

The KORA-Study Group consists of A. Peters (speaker), J. Heinrich, R. Holle, R. Leidl, C. Meisinger, K. Strauch and their co-workers, who are responsible for the design and conduct of the KORA studies. The KORA study was initiated and financed by the Helmholtz Zentrum München – German Research Center for Environmental Health, which is funded by the German Federal Ministry of Education and Research (BMBF) and by the State of Bavaria. Furthermore, KORA research was supported within the Munich Center of Health Sciences (MC-Health), Ludwig-Maximilians-Universität, as part of LMUinnovativ.

The GCKD study is funded by grants from the German Ministry of Education and Research (BMBF) (www.gesundheitsforschung-bmbf.de/de/2101.php, 25 March 2017; grant number 01ER 0804, 01ER 0818, 01ER 0819, 01ER 0820 and 01ER 0821) and the KfH Foundation for Preventive Medicine (http://www.kfh-stiftung-praeventivmedizin.de/content/stiftung, September 17th, 2019). Genotyping of the SNP microarrays in the GCKD study was supported by Bayer Pharma Aktiengesellschaft (AG).

Disclosures

None

Acknowledgments

We are grateful for the willingness of all study participants of the involved studies. The enormous effort of the study personnel of the various study centers is highly appreciated. We would also like to thank the teams from the surgery room of the Department of Visceral, Transplant and Thoracic Surgery of the Medical University of Innsbruck for ongoing support in liver sample collection.

Abbreviations

apo(a)
apolipoprotein(a)
ast-PCR
allele-specific TaqMan PCR
CKD
chronic kidney disease
CNV
copy number variation
eGFR
estimated glomerular filtration rate
GWAS
genome-wide association studies
HMW
high molecular weight isoform
KIV
kringle IV
LD
linkage disequilibrium
LMW
low molecular weight isoform
MAF
minor allele frequency
SNP
single nucleotide polymorphism
WT
wild type

References

  1. 1.↵
    Kronenberg F, Utermann G. Lipoprotein(a): resurrected by genetics. J Intern Med. 2013;273:6–30.
    OpenUrlCrossRefPubMed
  2. 2.↵
    Kronenberg F. Human Genetics and the Causal Role of Lipoprotein(a) for Various Diseases. Cardiovasc Drugs Ther. 2016;30:87–100.
    OpenUrlCrossRefPubMed
  3. 3.
    Clarke R, Peden JF, Hopewell JC, et al. Genetic variants associated with Lp(a) lipoprotein level and coronary disease. N Engl J Med. 2009;361:2518–2528.
    OpenUrlCrossRefPubMedWeb of Science
  4. 4.
    Erqou S, Thompson A, Di Angelantonio E, et al. Apolipoprotein(a) isoforms and the risk of vascular disease: systematic review of 40 studies involving 58,000 participants. J Am Coll Cardiol. 2010;55:2160–2167.
    OpenUrlFREE Full Text
  5. 5.
    Kamstrup PR, Tybjaerg-Hansen A, Steffensen R, Nordestgaard BG. Genetically elevated lipoprotein(a) and increased risk of myocardial infarction. JAMA. 2009;301:2331–2339.
    OpenUrlCrossRefPubMedWeb of Science
  6. 6.
    Kamstrup PR, Nordestgaard BG. Elevated Lipoprotein(a) Levels, LPA Risk Genotypes, and Increased Risk of Heart Failure in the General Population. JACC Heart Fail. 2016;4:78–87.
    OpenUrlAbstract/FREE Full Text
  7. 7.↵
    Thanassoulis G, Campbell CY, Owens DS, et al. Genetic Associations with Valvular Calcification and Aortic Stenosis. N Engl J Med. 2013;368:503–512.
    OpenUrlCrossRefPubMedWeb of Science
  8. 8.↵
    Kamstrup PR, Benn M, Tybjaerg-Hansen A, Nordestgaard BG. Extreme lipoprotein(a) levels and risk of myocardial infarction in the general population: the Copenhagen City Heart Study. Circulation. 2008;117:176–184.
    OpenUrlAbstract/FREE Full Text
  9. 9.↵
    Kraft HG, Köchl S, Menzel HJ, Sandholzer C, Utermann G. The apolipoprotein (a) gene: a transcribed hypervariable locus controlling plasma lipoprotein (a) concentration. Hum Genet. 1992;90:220–230.
    OpenUrlPubMedWeb of Science
  10. 10.↵
    Utermann G, Menzel HJ, Kraft HG, Duba HC, Kemmler HG, Seitz C. Lp(a) glycoprotein phenotypes. Inheritance and relation to Lp(a)-lipoprotein concentrations in plasma. J Clin Invest. 1987;80:458–465.
    OpenUrlCrossRefPubMedWeb of Science
  11. 11.↵
    Perombelon YFN, Soutar AK, Knight BL. Variation in lipoprotein(a) concentration associated with different apolipoprotein(a) alleles. J Clin Invest. 1994;93:1481–1492.
    OpenUrlCrossRefPubMedWeb of Science
  12. 12.↵
    Cohen JC, Chiesa G, Hobbs HH. Sequence polymorphisms in the apolipoprotein (a) gene. Evidence for dissociation between apolipoprotein(a) size and plasma lipoprotein(a) levels. J Clin Invest. 1993;91:1630–1636.
    OpenUrlCrossRefPubMedWeb of Science
  13. 13.↵
    Coassin S, Erhart G, Weissensteiner H, et al. A novel but frequent variant in LPA KIV-2 is associated with a pronounced Lp(a) and cardiovascular risk reduction. Eur Heart J. 2017;38:1823–1831.
    OpenUrlPubMed
  14. 14.↵
    Mack S, Coassin S, Rueedi R, et al. A genome-wide association meta-analysis on lipoprotein (a) concentrations adjusted for apolipoprotein (a) isoforms. J Lipid Res. 2017;58:1834–1844.
    OpenUrlAbstract/FREE Full Text
  15. 15.
    Li J, Lange LA, Sabourin J, et al. Genome- and exome-wide association study of serum lipoprotein (a) in the Jackson Heart Study. J Hum Genet. 2015;60:755–761.
    OpenUrlCrossRefPubMed
  16. 16.↵
    Ober C, Nord AS, Thompson EE, et al. Genome-wide association study of plasma lipoprotein(a) levels identifies multiple genes on chromosome 6q. J Lipid Res. 2009;50:798–806.
    OpenUrlAbstract/FREE Full Text
  17. 17.↵
    Ogorelkova M, Gruber A, Utermann G. Molecular basis of congenital lp(a) deficiency: a frequent apo(a) “null” mutation in caucasians. Hum Mol Genet. 1999;8:2087–2096.
    OpenUrlCrossRefPubMedWeb of Science
  18. 18.
    Zysow BR, Lindahl GE, Wade DP, Knight BL, Lawn RM. C/T polymorphism in the 5’ untranslated region of the apolipoprotein(a) gene introduces an upstream ATG and reduces in vitro translation. Arter Thromb Vasc Biol. 1995;15:58–64.
    OpenUrlAbstract/FREE Full Text
  19. 19.↵
    Parson W, Kraft HG, Niederstätter H, et al. A common nonsense mutation in the repetitive Kringle IV-2 domain of human apolipoprotein(a) results in a truncated protein and low plasma Lp(a). Hum Mutat. 2004;24:474–480.
    OpenUrlCrossRefPubMedWeb of Science
  20. 20.↵
    Lanktree MB, Anand SS, Yusuf S, Hegele RA. Comprehensive analysis of genomic variation in the LPA locus and its relationship to plasma lipoprotein(a) in south asians, chinese, and european caucasians. Circ Cardiovasc Genet. 2010;3:39–46.
    OpenUrlAbstract/FREE Full Text
  21. 21.
    Crawford DC, Peng Z, Cheng J-F, et al. LPA and PLG sequence variation and kringle IV-2 copy number in two populations. Hum Hered. 2008;66:199–209.
    OpenUrlCrossRefPubMed
  22. 22.
    Prins J, Leus FR, Bouma BN, van Rijn HJ. The identification of polymorphisms in the coding region of the apolipoprotein (a) gene--association with earlier identified polymorphic sites and influence on the lipoprotein (a) concentration. Thromb Haemost. 1999;82:1709–1717.
    OpenUrlPubMedWeb of Science
  23. 23.↵
    Puckey LH, Lawn RM, Knight BL. Polymorphisms in the apolipoprotein(a) gene and their relationship to allele size and plasma lipoprotein(a) concentration. Hum Mol Genet. 1997;6:1099–1107.
    OpenUrlCrossRefPubMed
  24. 24.↵
    Coassin S, Schönherr S, Weissensteiner H, et al. A comprehensive map of single-base polymorphisms in the hypervariable LPA kringle IV type 2 copy number variation region. J Lipid Res. 2019;60:186–199.
    OpenUrlAbstract/FREE Full Text
  25. 25.↵
    Titze S, Schmid M, Köttgen A, et al. Disease burden and risk profile in referred patients with moderate chronic kidney disease: Composition of the German Chronic Kidney Disease (GCKD) cohort. Nephrol Dial Transplant. 2015;30:441–451.
    OpenUrlCrossRefPubMed
  26. 26.↵
    Wichmann H-E, Gieger C, Illig T, MONICA/KORA Study Group. KORA-gen--resource for population genetics, controls and a broad spectrum of disease phenotypes. Gesundheitswesen. 2005;67 Suppl 1:S26–30.
    OpenUrlCrossRefPubMedWeb of Science
  27. 27.↵
    Rantner B, Kollerits B, Anderwald-Stadler M, et al. Association between the UGT1A1 TA-repeat polymorphism and bilirubin concentration in patients with intermittent claudication: results from the CAVASIC study. Clin Chem. 2008;54:851–857.
    OpenUrlAbstract/FREE Full Text
  28. 28.↵
    Vogl L, Pohlhammer J, Meinitzer A, et al. Serum concentrations of L-arginine and L-homoarginine in male patients with intermittent claudication: a cross-sectional and prospective investigation in the CAVASIC Study. Atherosclerosis. 2015;239:607–614.
    OpenUrl
  29. 29.↵
    You FM, Huo N, Gu YQ, et al. BatchPrimer3: a high throughput web application for PCR and sequencing primer design. BMC Bioinformatics. 2008;9:253.
    OpenUrlCrossRefPubMed
  30. 30.↵
    Leisch F. Bagged Clustering. Working Papers SFB “Adaptive Information Systems and Modelling in Economics and Management Science”, 51. SFB Adaptive Information Systems and Modelling in Economics and Management Science, WU Vienna University of Economics and Business, Vienna; 1999.
  31. 31.↵
    Yee TW. The VGAM Package for Categorical Data Analysis. J Stat Softw. 2010;32:481–493.
    OpenUrl
  32. 32.↵
    Erhart G, Lamina C, Lehtimäki T, et al. Genetic Factors Explain a Major Fraction of the 50% Lower Lipoprotein(a) Concentrations in Finns. Arterioscler Thromb Vasc Biol. 2018;38:1230–1241.
    OpenUrlAbstract/FREE Full Text
  33. 33.↵
    Kronenberg F, Kuen E, Ritz E, et al. Lipoprotein(a) serum concentrations and apolipoprotein(a) phenotypes in mild and moderate renal failure. J Am Soc Nephrol. 2000;11:105–115.
    OpenUrlAbstract/FREE Full Text
  34. 34.↵
    Gaunt TR, Rodríguez S, Day INM. Cubic exact solutions for the estimation of pairwise haplotype frequencies: Implications for linkage disequilibrium analyses and a web tool “CubeX.” BMC Bioinformatics. 2007;8:1–9.
    OpenUrlCrossRefPubMed
  35. 35.↵
    Kraft HG, Sandholzer C, Menzel HJ, Utermann G. Apolipoprotein (a) alleles determine lipoprotein (a) particle density and concentration in plasma. Arter Thromb. 1992;12:302–306.
    OpenUrl
  36. 36.↵
    Noureen A, Fresser F, Utermann G, Schmidt K. Sequence Variation within the KIV-2 Copy Number Polymorphism of the Human LPA Gene in African, Asian, and European Populations. PLoS One. 2015;10:e0121582.
    OpenUrlCrossRefPubMed
  37. 37.↵
    Lackner C, Boerwinkle E, Leffert CC, Rahmig T, Hobbs HH. Molecular basis of apolipoprotein (a) isoform size heterogeneity as revealed by pulsed-field gel electrophoresis. J Clin Invest. 1991;87:2153–2161.
    OpenUrlCrossRefPubMedWeb of Science
  38. 38.↵
    Khalifa M, Noureen A, Ertelthalner K, et al. Lack of association of rs3798220 with small apolipoprotein(a) isoforms and high lipoprotein(a) levels in East and Southeast Asians. Atherosclerosis. 2015;242:521–528.
    OpenUrlCrossRefPubMed
  39. 39.↵
    Levey AS, Stevens LA, Schmid CH, et al. A New Equation to Estimate Glomerular Filtration Rate. Ann Intern Med. 2009;150:604.
    OpenUrlCrossRefPubMedWeb of Science
  40. 40.↵
    Viechtbauer W. Conducting Meta-Analyses in R with the metafor Package. J Stat Softw. 2010;36.
  41. 41.↵
    Kyriakou T, Seedorf U, Goel A, et al. A Common LPA Null Allele Associates With Lower Lipoprotein(a) Levels and Coronary Artery Disease Risk. Arter Thromb Vasc Biol. 2014;34:2095–2099.
    OpenUrlAbstract/FREE Full Text
  42. 42.↵
    Emdin CA, Khera A V., Natarajan P, et al. Phenotypic Characterization of Genetically Lowered Human Lipoprotein(a) Levels. J Am Coll Cardiol. 2016;68:2761–2772.
    OpenUrlFREE Full Text
  43. 43.↵
    Kraft HG, Windegger M, Menzel HJ, Utermann G. Significant impact of the +93 C/T polymorphism in the apolipoprotein(a) gene on Lp(a) concentrations in Africans but not in Caucasians: confounding effect of linkage disequilibrium. Hum Mol Genet. 1998;7:257–264.
    OpenUrlCrossRefPubMedWeb of Science
  44. 44.
    1000 Genomes Project Consortium, Auton A, Brooks LD, et al. A global reference for human genetic variation. Nature. 2015;526:68–74.
    OpenUrlCrossRefPubMed
  45. 45.↵
    Chretien J-P, Coresh J, Berthier-Schaad Y, et al. Three single-nucleotide polymorphisms in LPA account for most of the increase in lipoprotein(a) level elevation in African Americans compared with European Americans. J Med Genet. 2006;43:917–923.
    OpenUrlAbstract/FREE Full Text
  46. 46.↵
    van der Harst P, Verweij N. Identification of 64 Novel Genetic Loci Provides an Expanded View on the Genetic Architecture of Coronary Artery Disease. Circ Res. 2018;122:433–443.
    OpenUrlAbstract/FREE Full Text
  47. 47.↵
    Mosko MJ, Nakorchevsky AA, Flores E, et al. Ultrasensitive Detection of Multiplexed Somatic Mutations Using MALDI-TOF Mass Spectrometry. J Mol Diagn. 2016;18:23–31.
    OpenUrl
  48. 48.↵
    Reid AL, Freeman JB, Millward M, Ziman M, Gray ES. Detection of BRAF-V600E and V600K in melanoma circulating tumour cells by droplet digital PCR. Clin Biochem. 2015;48:999–1002.
    OpenUrlCrossRefPubMed
View Abstract
Back to top
PreviousNext
Posted November 20, 2019.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Investigation of an LPA KIV-2 nonsense mutation in 11,000 individuals: the importance of linkage disequilibrium structure in LPA genetics
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Investigation of an LPA KIV-2 nonsense mutation in 11,000 individuals: the importance of linkage disequilibrium structure in LPA genetics
Silvia Di Maio, Gertraud Streiter, Rebecca Grüneis, Claudia Lamina, Manuel Maglione, Dietmar Öfner, Barbara Thorand, Annette Peters, Kai-Uwe Eckardt, Anna Köttgen, Florian Kronenberg, Stefan Coassin
bioRxiv 848945; doi: https://doi.org/10.1101/848945
Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation Tools
Investigation of an LPA KIV-2 nonsense mutation in 11,000 individuals: the importance of linkage disequilibrium structure in LPA genetics
Silvia Di Maio, Gertraud Streiter, Rebecca Grüneis, Claudia Lamina, Manuel Maglione, Dietmar Öfner, Barbara Thorand, Annette Peters, Kai-Uwe Eckardt, Anna Köttgen, Florian Kronenberg, Stefan Coassin
bioRxiv 848945; doi: https://doi.org/10.1101/848945

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetics
Subject Areas
All Articles
  • Animal Behavior and Cognition (2409)
  • Biochemistry (4757)
  • Bioengineering (3299)
  • Bioinformatics (14579)
  • Biophysics (6588)
  • Cancer Biology (5128)
  • Cell Biology (7376)
  • Clinical Trials (138)
  • Developmental Biology (4315)
  • Ecology (6821)
  • Epidemiology (2057)
  • Evolutionary Biology (9838)
  • Genetics (7308)
  • Genomics (9467)
  • Immunology (4506)
  • Microbiology (12587)
  • Molecular Biology (4901)
  • Neuroscience (28096)
  • Paleontology (198)
  • Pathology (799)
  • Pharmacology and Toxicology (1372)
  • Physiology (1993)
  • Plant Biology (4449)
  • Scientific Communication and Education (966)
  • Synthetic Biology (1293)
  • Systems Biology (3891)
  • Zoology (717)