Abstract
Transcriptional regulation is associated with a broad range of diseases. Methods associating genetic polymorphism with gene transcription levels offer key insights for understanding the transcriptional regulation plan. The majority of gene imputation methods focus on modeling polymorphism in the cis regions of the gene, partially owing to the large genetic search space. We hypothesize that polymorphism within transcription factors (TFs) may help explain transcription levels of their transcribed genes.
Here, we test this hypothesis by developing TF-TWAS: imputation models that integrate transcription factor information with transcription-wide association study methodology. By comparing TF-TWAS models to base models that use only gene cis information, we are able to estimate possible mechanisms of the TF polymorphism effect – TF expression or binding affinity within four tissues – whole blood, liver, brain hippocampus and coronary artery.
We identified 48 genes where the TF-TWAS models explain significantly better their expression than cis models alone in at least one of the four tissues. Sixteen of these genes are associated with various diseases, including cancer, neurological, psychiatric and rare genetic diseases. Our method is a new expansion to transcriptome-wide association studies and enables the identification of new associations between polymorphism in transcription factor and gene transcription levels.