Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Large-scale whole-genome sequencing of three diverse Asian populations in Singapore

Degang Wu, Jinzhuang Dou, Xiaoran Chai, Claire Bellis, Andreas Wilm, Chih Chuan Shih, Wendy Wei Jia Soon, Nicolas Bertin, Chiea Chuen Khor, Michael DeGiorgio, Sonia Maria Davila Dominguez, Patrick Tan, Asim Shabbir, Angela Moh, Eng-King Tan, Jia Nee Foo, Tan Tock Seng Hospital Healthy Control Workgroup, Roger S. Foo, Carolyn S. P. Lam, A. Mark Richards, Cheng-Yu Cheng, Ting Aung, Tien Yin Wong, Jianjun Liu, Chaolong Wang, The SG10K Consortium
doi: https://doi.org/10.1101/390070
Degang Wu
Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jinzhuang Dou
Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xiaoran Chai
Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Claire Bellis
Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Andreas Wilm
Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Chih Chuan Shih
Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Wendy Wei Jia Soon
Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nicolas Bertin
Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Chiea Chuen Khor
Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael DeGiorgio
Pennsylvania State University, USA;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sonia Maria Davila Dominguez
SingHealth Duke-NUS Institute of Precision Medicine, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Patrick Tan
SingHealth Duke-NUS Institute of Precision Medicine, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Asim Shabbir
National University Health System, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Angela Moh
Khoo Teck Puat Hospital, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Eng-King Tan
National Neuroscience Institute, Singapore General Hospital Campus, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jia Nee Foo
Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
-;
Roger S. Foo
Cardiovascular Research Institute, National University Health System, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Carolyn S. P. Lam
National Heart Centre Singapore and Duke-National University of Singapore, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
A. Mark Richards
Cardiovascular Research Institute, National University Health System, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Cheng-Yu Cheng
Singapore Eye Research Institute, Singapore National Eye Centre, Singapore
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ting Aung
Singapore Eye Research Institute, Singapore National Eye Centre, Singapore
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tien Yin Wong
Singapore Eye Research Institute, Singapore National Eye Centre, Singapore
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jianjun Liu
Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Chaolong Wang
Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: wangcl@gis.a-star.edu.sg
-;
  • Abstract
  • Info/History
  • Metrics
  • Data Supplements
  • Preview PDF
Loading

Abstract

Asian populations are currently underrepresented in human genetics research. Here we present whole-genome sequencing data of 4,810 Singaporeans from three diverse ethnic groups: 2,780 Chinese, 903 Malays, and 1,127 Indians. Despite a medium depth of 13.7X, we achieved essentially perfect (>99.8%) sensitivity and accuracy for detecting common variants and good sensitivity (>89%) for detecting extremely rare variants with <0.1% allele frequency. We found 89.2 million single-nucleotide polymorphisms (SNPs) and 9.1 million small insertions and deletions (INDELs), more than half of which have not been cataloged in dbSNP. In particular, we found 126 common deleterious mutations (MAF>0.01) that were absent in the existing public databases, highlighting the importance of local population reference for genetic diagnosis. We describe fine-scale genetic structure of Singapore populations and their relationship to worldwide populations from the 1000 Genomes Project. In addition to revealing noticeable amounts of admixture among three Singapore populations and a Malay-related novel ancestry component that has not been captured by the 1000 Genomes Project, our analysis also identified some fine-scale features of genetic structure consistent with two waves of prehistoric migration from south China to Southeast Asia. Finally, we demonstrate that our data can substantially improve genotype imputation not only for Singapore populations, but also for populations across Asia and Oceania. These results highlight the genetic diversity in Singapore and the potential impacts of our data as a resource to empower human genetics discovery in a broad geographic region.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
  • Posted August 11, 2018.

Download PDF

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Large-scale whole-genome sequencing of three diverse Asian populations in Singapore
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
Share
Large-scale whole-genome sequencing of three diverse Asian populations in Singapore
Degang Wu, Jinzhuang Dou, Xiaoran Chai, Claire Bellis, Andreas Wilm, Chih Chuan Shih, Wendy Wei Jia Soon, Nicolas Bertin, Chiea Chuen Khor, Michael DeGiorgio, Sonia Maria Davila Dominguez, Patrick Tan, Asim Shabbir, Angela Moh, Eng-King Tan, Jia Nee Foo, Tan Tock Seng Hospital Healthy Control Workgroup, Roger S. Foo, Carolyn S. P. Lam, A. Mark Richards, Cheng-Yu Cheng, Ting Aung, Tien Yin Wong, Jianjun Liu, Chaolong Wang, The SG10K Consortium
bioRxiv 390070; doi: https://doi.org/10.1101/390070
Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation Tools
Large-scale whole-genome sequencing of three diverse Asian populations in Singapore
Degang Wu, Jinzhuang Dou, Xiaoran Chai, Claire Bellis, Andreas Wilm, Chih Chuan Shih, Wendy Wei Jia Soon, Nicolas Bertin, Chiea Chuen Khor, Michael DeGiorgio, Sonia Maria Davila Dominguez, Patrick Tan, Asim Shabbir, Angela Moh, Eng-King Tan, Jia Nee Foo, Tan Tock Seng Hospital Healthy Control Workgroup, Roger S. Foo, Carolyn S. P. Lam, A. Mark Richards, Cheng-Yu Cheng, Ting Aung, Tien Yin Wong, Jianjun Liu, Chaolong Wang, The SG10K Consortium
bioRxiv 390070; doi: https://doi.org/10.1101/390070

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetics
Subject Areas
All Articles
  • Animal Behavior and Cognition (814)
  • Biochemistry (1124)
  • Bioengineering (716)
  • Bioinformatics (5722)
  • Biophysics (1943)
  • Cancer Biology (1381)
  • Cell Biology (1957)
  • Clinical Trials (71)
  • Developmental Biology (1337)
  • Ecology (2048)
  • Epidemiology (1096)
  • Evolutionary Biology (4331)
  • Genetics (3042)
  • Genomics (3923)
  • Immunology (836)
  • Microbiology (3289)
  • Molecular Biology (1220)
  • Neuroscience (8382)
  • Paleontology (62)
  • Pathology (169)
  • Pharmacology and Toxicology (304)
  • Physiology (401)
  • Plant Biology (1138)
  • Scientific Communication and Education (318)
  • Synthetic Biology (469)
  • Systems Biology (1596)
  • Zoology (210)