PT - JOURNAL ARTICLE AU - Ivair José Morais Júnior AU - Richard Costa Polveiro AU - Gabriel Medeiros Souza AU - Daniel Inserra Bortolin AU - Flávio Tetsuo Sassaki AU - Alison Talis Martins Lima TI - The global population of SARS-CoV-2 is composed of six major subtypes AID - 10.1101/2020.04.14.040782 DP - 2020 Jan 01 TA - bioRxiv PG - 2020.04.14.040782 4099 - http://biorxiv.org/content/early/2020/04/15/2020.04.14.040782.short 4100 - http://biorxiv.org/content/early/2020/04/15/2020.04.14.040782.full AB - The World Health Organization characterized the COVID-19 as a pandemic in March 2020, the second pandemic of the 21st century. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a positive-stranded RNA betacoronavirus of the family Coronaviridae. Expanding virus populations, as that of SARS-CoV-2, accumulate a number of narrowly shared polymorphisms imposing a confounding effect on traditional clustering methods. In this context, approaches that reduce the complexity of the sequence space occupied by the SARS-CoV-2 population are necessary for a robust clustering. Here, we proposed the subdivision of the global SARS-CoV-2 population into sixteen well-defined subtypes by focusing on the widely shared polymorphisms in nonstructural (nsp3, nsp4, nsp6, nsp12, nsp13 and nsp14) cistrons, structural (spike and nucleocapsid) and accessory (ORF8) genes. Six virus subtypes were predominant in the population, but all sixteen showed amino acid replacements which might have phenotypic implications. We hypothesize that the virus subtypes detected in this study are records of the early stages of the SARS-CoV-2 diversification that were randomly sampled to compose the virus populations around the world, a typical founder effect. The genetic structure determined for the SARS-CoV-2 population provides substantial guidelines for maximizing the effectiveness of trials for testing the candidate vaccines or drugs.Competing Interest StatementThe authors have declared no competing interest.