PT  - JOURNAL ARTICLE
AU  - Brydon Eastman
AU  - Michelle Przedborski
AU  - Mohammad Kohandel
TI  - Reinforcement learning derived chemotherapeutic schedules for robust patient-specific therapy
AID  - 10.1101/2021.04.23.441182
DP  - 2021 Jan 01
TA  - bioRxiv
PG  - 2021.04.23.441182
4099  - http://biorxiv.org/content/early/2021/04/26/2021.04.23.441182.short
4100  - http://biorxiv.org/content/early/2021/04/26/2021.04.23.441182.full
AB  - The in-silico development of a chemotherapeutic dosing schedule for treating cancer relies upon a parameterization of a particular tumour growth model to describe the dynamics of the cancer in response to the dose of the drug. In practice, it is often prohibitively difficult to ensure the validity of patient-specific parameterizations of these models for any particular patient. As a result, sensitivities to these particular parameters can result in therapeutic dosing schedules that are optimal in principle not performing well on particular patients. In this study, we demonstrate that chemotherapeutic dosing strategies learned via reinforcement learning methods are more robust to perturbations in patient-specific parameter values than those learned via classical optimal control methods. By training a reinforcement learning agent on mean-value parameters and allowing the agent periodic access to a more easily measurable metric, relative bone marrow density, for the purpose of optimizing dose schedule while reducing drug toxicity, we are able to develop drug dosing schedules that outperform schedules learned via classical optimal control methods, even when such methods are allowed to leverage the same bone marrow measurements.Competing Interest StatementThe authors have declared no competing interest.