ABSTRACT
Guanine-rich DNA strands can fold into non-canonical four-stranded secondary structures named G-quadruplexes (G4s). G4s folded in proximal promoter regions (PPR) are associated either with positive or negative transcriptional regulation. Given that single nucleotide variants (SNVs) affecting G4 folding (G4-Vars) may alter gene transcription, and that SNVs are associated with the human diseases’ onset, we undertook a comprehensive study of the G4-Vars genome-wide (G4-variome) to find disease-associated G4-Vars located into PPRs. We developed a bioinformatics strategy to find disease-related SNVs located into PPRs simultaneously overlapping with putative G4-forming sequences (PQSs). We studied five G4-Vars disturbing in vitro the folding and stability of the G4s located into PPRs, which had been formerly associated with sporadic Alzheimer’s disease (GRIN2B), a severe familiar coagulopathy (F7), atopic dermatitis (CSF2), myocardial infarction (SIRT1), and deafness (LHFPL5). Results obtained in cellulo for GRIN2B and F7 suggest that the G4 disruption due to the identified G4-Vars affect the transcription and are responsible for the mentioned diseases. Collectively, data suggest that G4-Vars may account for the different susceptibilities to human genetic diseases’ onset, and could be novel targets for diagnosis and drug design in precision medicine.
Competing Interest Statement
The authors have declared no competing interest.
ABBREVIATIONS
- G4,
- G-quadruplex;
- PQS,
- putative G4-forming sequence;
- SNV,
- single-nucleotide variant;
- TSS,
- transcription start site;
- PPR,
- proximal promoter region;
- HSV,
- Human Short Variants;
- HGMD-PUBLIC,
- Public Human Gene Mutation Database;
- dbSNP,
- Single Nucleotide Polymorphism database;
- HSSV,
- Human Somatic Short Variants;
- COSMIC,
- Catalogue Of Somatic Mutations In Cancer;
- AA,
- ancestral allele;
- VA,
- variation allele;
- pG4-Var,
- genetic variant occurring within a PQS;
- G4-Var,
- genetic variant occurring within a sequence that forms a G4;
- QGRS,
- Quadruplex forming G-Rich Sequence;
- CD,
- circular dichroism;
- NMR,
- nuclear magnetic resonance;
- qPSA,
- qPCR Stop Assays;
- qPCR,
- real-time quantitative PCR;
- HEK-293,
- human embryonic kidney 293 cell line;
- DMEM,
- Dulbecco’s Modified Eagle Medium;
- FBS,
- foetal bovine serum;
- hpt,
- hours post-transfection;
- LRA,
- luciferase reporter assay;
- FL,
- Firefly luciferase;
- β-gal,
- β-galactosidase;
- GO,
- gene ontology;
- SAD,
- sporadic Alzheimer’s disease;
- OQ,
- observed G4 sequence;
- Tm,
- melting temperature;
- PDS,
- pyridostatin;
- (TFBS),
- transcription factor–binding site;
- TF,
- transcription factors.