Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Reconfiguring Okazaki fragment start sites on a genome by using a data-driven approach

Adam Soffer, Morya Ifrach, Stefan Ilic, Ariel Afek, Dan Vilenchik, View ORCID ProfileBarak Akabayov
doi: https://doi.org/10.1101/2020.09.29.317842
Adam Soffer
1Department of Chemistry, Ben-Gurion University of the Negev, Beer-Sheva, Israel
2Data Science Research Center, Ben-Gurion University of the Negev, Beer-Sheva, Israel
3School of Computer and Electrical Engineering, Ben-Gurion University of the Negev, Beer-Sheva, Israel
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Morya Ifrach
1Department of Chemistry, Ben-Gurion University of the Negev, Beer-Sheva, Israel
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Stefan Ilic
1Department of Chemistry, Ben-Gurion University of the Negev, Beer-Sheva, Israel
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ariel Afek
1Department of Chemistry, Ben-Gurion University of the Negev, Beer-Sheva, Israel
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dan Vilenchik
2Data Science Research Center, Ben-Gurion University of the Negev, Beer-Sheva, Israel
3School of Computer and Electrical Engineering, Ben-Gurion University of the Negev, Beer-Sheva, Israel
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Barak Akabayov
1Department of Chemistry, Ben-Gurion University of the Negev, Beer-Sheva, Israel
2Data Science Research Center, Ben-Gurion University of the Negev, Beer-Sheva, Israel
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Barak Akabayov
  • For correspondence: akabayov@bgu.ac.il
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

ABSTRACT

DNA primase is an essential enzyme that synthesizes short RNA primers on specific DNA sequences. These RNA primers are elongated by DNA polymerase to form Okazaki fragments on the lagging DNA strand. It is therefore reasonable to assume that the binding of DNA primase on a genome marks the start sites of the Okazaki fragments. It has long been known that the frequency of the occurrence of primase trinucleotide recognition on a genome sequence has no influence on the size of the Okazaki fragments. The unresolved enigma that we address in this study is therefore why some, but not all, primase-DNA recognition sequences (PDRSs) become Okazaki fragment start sites. To this end, we applied machine-learning algorithms to analyze a massive amount of data obtained from protein-DNA binding microarrays (PBM) with the aim of identifying the essential elements on DNA that are needed for the binding of bacteriophage T7 primase. A PBM data learning algorithm enabled the prediction of binding values of T7 primase for any given DNA sequence with unprecedented accuracy and flexibility. On the basis of the principles learned about DNA-primase binding, we generated novel DNA sequences with improved binding of T7 primase and improved RNA primer synthesis, as validated experimentally.

Competing Interest Statement

The authors have declared no competing interest.

  • ABBREVIATIONS
    PBM
    protein-DNA binding microarray
    PDRS
    primase-DNA recognition sequence
    EDA
    exploratory data analysis
    PCA
    principal component analysis
    WD
    Ward distance
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
    Back to top
    PreviousNext
    Posted September 30, 2020.
    Download PDF

    Supplementary Material

    Email

    Thank you for your interest in spreading the word about bioRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Reconfiguring Okazaki fragment start sites on a genome by using a data-driven approach
    (Your Name) has forwarded a page to you from bioRxiv
    (Your Name) thought you would like to see this page from the bioRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Reconfiguring Okazaki fragment start sites on a genome by using a data-driven approach
    Adam Soffer, Morya Ifrach, Stefan Ilic, Ariel Afek, Dan Vilenchik, Barak Akabayov
    bioRxiv 2020.09.29.317842; doi: https://doi.org/10.1101/2020.09.29.317842
    Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
    Citation Tools
    Reconfiguring Okazaki fragment start sites on a genome by using a data-driven approach
    Adam Soffer, Morya Ifrach, Stefan Ilic, Ariel Afek, Dan Vilenchik, Barak Akabayov
    bioRxiv 2020.09.29.317842; doi: https://doi.org/10.1101/2020.09.29.317842

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Biochemistry
    Subject Areas
    All Articles
    • Animal Behavior and Cognition (3689)
    • Biochemistry (7796)
    • Bioengineering (5674)
    • Bioinformatics (21283)
    • Biophysics (10578)
    • Cancer Biology (8174)
    • Cell Biology (11945)
    • Clinical Trials (138)
    • Developmental Biology (6762)
    • Ecology (10401)
    • Epidemiology (2065)
    • Evolutionary Biology (13865)
    • Genetics (9708)
    • Genomics (13072)
    • Immunology (8146)
    • Microbiology (20014)
    • Molecular Biology (7853)
    • Neuroscience (43055)
    • Paleontology (319)
    • Pathology (1279)
    • Pharmacology and Toxicology (2258)
    • Physiology (3351)
    • Plant Biology (7232)
    • Scientific Communication and Education (1312)
    • Synthetic Biology (2006)
    • Systems Biology (5538)
    • Zoology (1128)