Abstract
On 22 January 2020, the National Genomics Data Center (NGDC), part of the China National Center for Bioinformation (CNCB), created the 2019 Novel Coronavirus Resource (2019nCoVR), an open-access SARS-CoV-2 information resource. 2019nCoVR features a comprehensive integration of sequence and clinical information for all publicly available SARS-CoV-2 isolates, which are manually curated with value-added annotations and quality evaluated by our in-house automated pipeline. Of particular note, 2019nCoVR performs systematic analyses to generate a dynamic landscape of SARS-CoV-2 genomic variations at a global scale. It provides all identified variants and detailed statistics for each virus isolate, and congregates the quality score, functional annotation, and population frequency for each variant. It also generates visualization of the spatiotemporal change for each variant and yields historical viral haplotype network maps for the course of the outbreak from all complete and high-quality genomes. Moreover, 2019nCoVR provides a full collection of SARS-CoV-2 relevant literature on COVID-19 (Coronavirus Disease 2019), including published papers from PubMed as well as preprints from services such as bioRxiv and medRxiv through Europe PMC. Furthermore, by linking with relevant databases in CNCB-NGDC, 2019nCoVR offers data submission services for raw sequence reads and assembled genomes, and data sharing with National Center for Biotechnology Information. Collectively, all SARS-CoV-2 genome sequences, variants, haplotypes and literature are updated daily to provide timely information, making 2019nCoVR a valuable resource for the global research community. 2019nCoVR is accessible at https://bigd.big.ac.cn/ncov/.
Competing Interest Statement
The authors have declared no competing interest.
Footnotes
We updated all the statistics of 2019nCoVR as for 14 July 2020; highlighted the improvements of our database in Table 1; reorganized the content and changed the titles "Database construction and visualization" to "Implementation", and "Results" to "Database content and features"; corrected typos and grammar errors; clarified the descriptions; modified figure legends and figures; listed the contributions of each author in the CRediT author statement.