Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Robust detection of tandem repeat expansions from long DNA reads

Satomi Mitsuhashi, Martin C Frith, Takeshi Mizuguchi, Satoko Miyatake, Tomoko Toyota, Hiroaki Adachi, Yoko Oma, Yoshihiro Kino, Hiroaki Mitsuhashi, Naomichi Matsumoto
doi: https://doi.org/10.1101/356931
Satomi Mitsuhashi
1Department of Human Genetics, Yokohama City University Graduate School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: mcfrith@edu.k.u-tokyo.ac.jp satomits@yokohama-cu.ac.jp
Martin C Frith
2Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST)
3Graduate School of Frontier Sciences, University of Tokyo
4Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), AIST
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: mcfrith@edu.k.u-tokyo.ac.jp satomits@yokohama-cu.ac.jp
Takeshi Mizuguchi
1Department of Human Genetics, Yokohama City University Graduate School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Satoko Miyatake
1Department of Human Genetics, Yokohama City University Graduate School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tomoko Toyota
5Department of Neurology, University of Occupational and Environmental Health School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hiroaki Adachi
5Department of Neurology, University of Occupational and Environmental Health School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yoko Oma
6Department of Liberal Arts, Faculty of Medicine, Saitama Medical University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yoshihiro Kino
7Department of Bioinformatics and Molecular Neuropathology, Meiji Pharmaceutical University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hiroaki Mitsuhashi
8Department of Applied Biochemistry, School of Engineering, Tokai University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Naomichi Matsumoto
1Department of Human Genetics, Yokohama City University Graduate School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Tandemly repeated sequences are highly mutable and variable features of genomes. Tandem repeat expansions are responsible for a growing list of human diseases, even though it is hard to determine tandem repeat sequences with current DNA sequencing technology. Recent long-read technologies are promising, because the DNA reads are often longer than the repetitive regions, but are hampered by high error rates. Here, we report robust detection of human repeat expansions from careful alignments of long (PacBio and nanopore) reads to a reference genome. Our method (tandem-genotypes) is robust to systematic sequencing errors, inexact repeats with fuzzy boundaries, and low sequencing coverage. By comparing to healthy controls, we can prioritize pathological expansions within the top 10 out of 700000 tandem repeats in the genome. This may help to elucidate the many genetic diseases whose causes remain unknown.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted July 24, 2018.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Robust detection of tandem repeat expansions from long DNA reads
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Robust detection of tandem repeat expansions from long DNA reads
Satomi Mitsuhashi, Martin C Frith, Takeshi Mizuguchi, Satoko Miyatake, Tomoko Toyota, Hiroaki Adachi, Yoko Oma, Yoshihiro Kino, Hiroaki Mitsuhashi, Naomichi Matsumoto
bioRxiv 356931; doi: https://doi.org/10.1101/356931
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Robust detection of tandem repeat expansions from long DNA reads
Satomi Mitsuhashi, Martin C Frith, Takeshi Mizuguchi, Satoko Miyatake, Tomoko Toyota, Hiroaki Adachi, Yoko Oma, Yoshihiro Kino, Hiroaki Mitsuhashi, Naomichi Matsumoto
bioRxiv 356931; doi: https://doi.org/10.1101/356931

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4246)
  • Biochemistry (9176)
  • Bioengineering (6808)
  • Bioinformatics (24070)
  • Biophysics (12162)
  • Cancer Biology (9569)
  • Cell Biology (13847)
  • Clinical Trials (138)
  • Developmental Biology (7663)
  • Ecology (11741)
  • Epidemiology (2066)
  • Evolutionary Biology (15548)
  • Genetics (10674)
  • Genomics (14368)
  • Immunology (9518)
  • Microbiology (22919)
  • Molecular Biology (9136)
  • Neuroscience (49170)
  • Paleontology (358)
  • Pathology (1488)
  • Pharmacology and Toxicology (2584)
  • Physiology (3851)
  • Plant Biology (8353)
  • Scientific Communication and Education (1473)
  • Synthetic Biology (2302)
  • Systems Biology (6207)
  • Zoology (1304)