RT Journal Article SR Electronic T1 Evidence-based gene models for structural and functional annotations of the oil palm genome JF bioRxiv FD Cold Spring Harbor Laboratory SP 111120 DO 10.1101/111120 A1 Chan Kuang Lim A1 Tatiana V. Tatarinova A1 Rozana Rosli A1 Nadzirah Amiruddin A1 Norazah Azizi A1 Mohd Amin Ab Halim A1 Nik Shazana Nik Mohd Sanusi A1 Jayanthi Nagappan A1 Petr Ponomarenko A1 Martin Triska A1 Victor Solovyev A1 Mohd Firdaus-Raih A1 Ravigadevi Sambanthamurthi A1 Denis Murphy A1 Leslie Low Eng Ti YR 2017 UL http://biorxiv.org/content/early/2017/04/05/111120.abstract AB The advent of rapid and inexpensive DNA sequencing has led to an explosion of data that must be transformed into knowledge about genome organization and function. Gene prediction is customarily the starting point for genome analysis. This paper presents a bioinformatics study of the oil palm genome, including a comparative genomics analysis, database and tools development, and mining of biological data for genes of interest. We annotated 26,087 oil palm genes integrated from two gene-prediction pipelines, Fgenesh++ and Seqping. As case studies, we conducted comprehensive investigations on intronless, resistance and fatty acid biosynthesis genes, and demonstrated that the current gene prediction set is of high quality. 3,672 intronless genes were identified in the oil palm genome, an important resource for evolutionary study. Further scrutiny of the oil palm genes revealed 210 candidate resistance genes involved in pathogen defense. Fatty acids have diverse applications ranging from food to industrial feedstock, and we identified 42 key genes involved in fatty-acid biosynthesis in oil palm mesocarp and kernel. These results provide an important resource for studies on plant genomes and a theoretical foundation for marker-assisted breeding of oil palm and related crops.