PT  - JOURNAL ARTICLE
AU  - Dhong-gun Won
AU  - Kyoungyeul Lee
TI  - 3Cnet: Pathogenicity prediction of human variants using knowledge transfer with deep recurrent neural networks
AID  - 10.1101/2020.09.27.302927
DP  - 2020 Jan 01
TA  - bioRxiv
PG  - 2020.09.27.302927
4099  - http://biorxiv.org/content/early/2020/09/28/2020.09.27.302927.short
4100  - http://biorxiv.org/content/early/2020/09/28/2020.09.27.302927.full
AB  - Thanks to the improvement of New Generation Sequencing (NGS), genome-based diagnosis for rare disease patients become possible. However, accurate interpretation of human variants requires massive amount of knowledge gathered from previous researches and clinical cases. Also, manual analysis for each variant in the genome of patients takes enormous time and effort of clinical experts and medical doctors. Therefore, to reduce the cost of diagnosis, various computational tools have been developed for the pathogenicity prediction of human variants. Nevertheless, there has been the circularity problem of conventional tools, which leads to the overlap of training data and eventually causes overfitting of algorithms. In this research, we developed a pathogenicity predictor, named as 3Cnet, using deep recurrent neural networks which analyzes the amino-acid context of a missense mutation. 3Cnet utilizes knowledge transfer of evolutionary conservation to train insufficient clinical data without overfitting. The performance comparison clearly shows that 3Cnet can find the true disease-causing variant from a large number of missense variants in the genome of a patient with higher sensitivity (recall = 13.9 %) compared to other prediction tools such as REVEL (recall = 7.5 %) or PrimateAI (recall = 6.4 %). Consequently, 3Cnet can improve the diagnostic rate for patients and discover novel pathogenic variants with high probability.Competing Interest StatementThe authors have declared no competing interest.