RT Journal Article SR Electronic T1 Ensemble Machine Learning Modeling for the Prediction of Artemisinin Resistance in Malaria JF bioRxiv FD Cold Spring Harbor Laboratory SP 856922 DO 10.1101/856922 A1 Colby T. Ford A1 Daniel Janies YR 2019 UL http://biorxiv.org/content/early/2019/11/27/856922.abstract AB Antiparasitic resistance in malaria is a growing concern affecting many areas of the eastern world. Since the emergence of artemisinin resistance in the late 2000s in Cambodia, research into the underlying mechanisms has been underway.The 2019 Malaria Dream Challenge posited the task of developing computational models that address important problems in advancing the fight against malaria. The first goal was to accurately predict Artemisinin drug resistance levels of Plasmodium falciparum isolates, as quantified by the IC50. The second goal was to predict the parasite clearance rate of malaria parasite isolates based on in vitro transcriptional profiles.In this work, we develop machine learning models using novel methods for transforming isolate data and handling the tens of thousands of variables that result from these data transformation exercises. This is demonstrated by using massively parallel processing of the data vectorization for use in scalable machine learning. In addition, we show the utility of ensemble machine learning modeling for highly effective predictions of both goals of this challenge. This is demonstrated by the use of multiple machine learning algorithms combined with various scaling and normalization preprocessing steps. Then, using a voting ensemble, multiple models are combined to generate a final model prediction.