RT Journal Article SR Electronic T1 How People Initiate Energy Optimization and Converge on Their Optimal Gaits JF bioRxiv FD Cold Spring Harbor Laboratory SP 503433 DO 10.1101/503433 A1 Jessica C. Selinger A1 Jeremy D. Wong A1 Surabhi N. Simha A1 J. Maxwell Donelan YR 2018 UL http://biorxiv.org/content/early/2018/12/25/503433.abstract AB A central principle in motor control is that the coordination strategies learned by our nervous system are often optimal. Here we combined human experiments with computational reinforcement learning models to study how the nervous system navigates possible movements to arrive at an optimal coordination. Our experiments used robotic exoskeletons to reshape the relationship between how participants walk and how much energy they consume. We found that while some participants used their relatively high natural gait variability to explore the new energetic landscape and spontaneously initiate energy optimization, most participants preferred to exploit their originally preferred, but now suboptimal, gait. We could nevertheless reliably initiate optimization in these exploiters by providing them with the experience of lower cost gaits suggesting that the nervous system benefits from cues about the relevant dimensions along which to re-optimize its coordination. Once optimization was initiated, we found that the nervous system employed a local search process to converge on the new optimum gait over tens of seconds. Once optimization was completed, the nervous system learned to predict this new optimal gait and rapidly returned to it within a few steps if perturbed away. We model this optimization process as reinforcement learning and find behavior that closely matches these experimental observations. We conclude that the nervous system optimizes for energy using a prediction of the optimal gait, and then refines this prediction with the cost of each new walking step.