RT Journal Article SR Electronic T1 Identification of putative G-quadruplex forming sequences in three manatee papillomaviruses JF bioRxiv FD Cold Spring Harbor Laboratory SP 138602 DO 10.1101/138602 A1 Maryam Zahin A1 William L. Dean A1 Shin-je Ghim A1 Joongho Joh A1 Robert D. Gray A1 Sujita Khanal A1 Gregory D. Bossart A1 Antonio A. Mignucci-Giannoni A1 Eric C. Rouchka A1 Alfred B. Jenson A1 Jonathan B. Chaires A1 Julia H. Chariker YR 2017 UL http://biorxiv.org/content/early/2017/05/16/138602.abstract AB The Florida manatee (Trichechus manatus latirotris) is considered a threatened aquatic mammal in United States coastal waters. Over the past decade, the appearance of papillomavirus-induced lesions and viral papillomatosis in manatees has been a concern for those involved in the management and rehabilitation of this species. To date, three manatee papillomaviruses (PVs) have been identified in Florida manatees, one forming cutaneous lesions (TmPV1) and two forming genital lesions (TmPV3 and TmPV4). In this study, we identified DNA sequences with the potential to form G-quadruplex structures in all three PVs. G-quadruplex structures (G4) are guanine-rich nucleic acid sequences capable of forming secondary structures in DNA and RNA. In humans, G4 are known to regulate molecular processes such as transcription and translation. Although G4 have been identified in several viral genomes, including human PVs, no attempt has been made to identify G4 in animal PVs. We found that sequences capable of forming G4 were present on both DNA strands and across coding and non-coding regions on all PVs. The vast majority of the identified sequences would allow the formation of non-canonical structures with only two G-tetrads. The formation of one such structure was supported through biophysical analysis. Computational analysis demonstrated enrichment of G4 sequences on the reverse strand in the E2/E4 region on all manatee PVs and on the forward strand in the E2/E4 region on one genital PV. Several G4 sequences occurred at similar regional locations on all PVs, most notably on the reverse strand in the E2 region. In other cases, G4 were identified at similar regional locations only on PVs forming genital lesions. On all PVs, G4 sequences were located near putative E2 binding sites in the non-coding region. Together, these findings suggest that G4 are likely regulatory elements in manatee PVs.Author summary G-quadruplex structures (G4) are found in the DNA and RNA of many species and are known to regulate the expression of genes and the synthesis of proteins, among other important molecular processes. Recently, these structures have been identified in several viruses, including the human papillomavirus (PV). As regulatory structures, G4 are of great interest to researchers as drug targets for viral control. In this paper, we identify the first G4 sequences in three PVs infecting a non-human animal, the Florida manatee. Through computational and biophysical analysis, we find that a greater variety of sequence patterns may underlie the formation of these structures than previously identified. The sequences are found in all protein coding regions of the virus and near sites for viral replication in non-coding regions. Furthermore, the distribution of these sequences across the PV genomes supports the notion that sequences are conserved across PV types, suggesting they are under selective pressure. This paper extends previous research on G4 in human PVs with additional evidence for their role as regulators. The G4 sequences we identified also provide potential regulatory targets for researchers interested in controlling this virus in the Florida manatee, a threatened aquatic mammal.