RT Journal Article SR Electronic T1 Evidence that inconsistent gene prediction can mislead analysis of algal genomes JF bioRxiv FD Cold Spring Harbor Laboratory SP 690040 DO 10.1101/690040 A1 Yibi Chen A1 Raúl A. González-Pech A1 Timothy G. Stephens A1 Debashish Bhattacharya A1 Cheong Xin Chan YR 2019 UL http://biorxiv.org/content/early/2019/07/22/690040.abstract AB Comparative algal genomics often relies on predicted gene models from de novo assembled genomes. However, the artifacts introduced by different gene-prediction approaches, and their impact on comparative genomic analysis, remain poorly understood. Here, using available genome data from six dinoflagellate species in Symbiodiniaceae, we identified potential methodological biases in the published gene models that were predicted using different approaches. We developed and applied a comprehensive customized workflow to predict genes from these genomes. The observed variation among predicted gene models resulting from our workflow agreed with current understanding of phylogenetic relationships among these taxa, whereas those published earlier were largely biased by the distinct approaches used in each instance. Importantly, these biases mislead the inference of homologous gene families and synteny among genomes, thus impacting biological interpretation of these data. Our results demonstrate that a consistent gene-prediction approach is critical for comparative genomics, particularly for non-model algal genomes.