Decreased transfer of value to action in Tourette syndrome

doi:10.1016/j.cortex.2019.12.027

Cortex

Volume 126, May 2020, Pages 39-48

https://doi.org/10.1016/j.cortex.2019.12.027 Get rights and content

Abstract

Objective

Tourette syndrome is a neurodevelopmental disorder putatively associated with a hyperdopaminergic state. Therefore, it seems plausible that excessive dopamine transmission in Tourette syndrome alters the ability to learn based on rewards and punishments. We tested whether Tourette syndrome patients exhibited altered reinforcement learning and corresponding feedback-related EEG deflections.

Methods

We used a reinforcement learning task providing factual and counterfactual feedback in a sample of 15 Tourette syndrome patients and matched healthy controls whilst recording EEG. The paradigm presented various reward probabilities to enforce adaptive adjustments. We employed a computational model to derive estimates of the prediction error, which we used for single-trial regression analysis of the EEG data.

Results

We found that Tourette syndrome patients showed increased choice stochasticity compared to controls. The feedback-related negativity represented an axiomatic prediction error for factual feedback and did not differ between groups. We observed attenuated P3a modulation specifically for factual feedback in Tourette syndrome patients, representing impaired coding of attention allocation.

Conclusion

Our findings indicate that cortical prediction error coding is unaffected by Tourette syndrome. Nonetheless, the transfer of learned values into choice formation is degraded, in line with a hyperdopaminergic state.

Introduction

Tourette syndrome (TS) is a childhood-onset hyperkinetic neurodevelopmental disorder characterized by the presence of motor and vocal tics. Tics share many commonalities with habitual behavior, as they are stereotyped and automatic sequences of actions that are triggered by specific internal or external stimuli (Leckman & Riddle, 2000). Common comorbid disorders in TS include obsessive-compulsive disorder (OCD), attention-deficit hyperactivity disorder (ADHD) and affective disorders (Eddy and Cavanna, 2014, Robertson, 2006, Simpson et al., 2011). The precise pathophysiology of TS remains unknown, but numerous findings point to alterations of cortico-basal ganglia-thalamo-cortical (CBGTC) loops (Mink, 2001). Although imbalances in various neurotransmitter systems have been reported for TS, the central role of dopaminergic transmission for this disorder is underlined by the effectiveness of neuroleptic medication in the treatment of TS (Huys et al., 2012, Leckman et al., 2010). The most parsimonious explanation of empirical findings might be an overall hyperdopaminergic state via increased dopaminergic innervation (Maia and Conceição, 2017, Maia and Conceição, 2018). Importantly, midbrain dopaminergic activity acts as a teaching signal (prediction error, PE) for reinforcement learning (RL) in the striatum (Balleine & O'Doherty, 2010) and probabilistic learning has been repeatedly shown to be impaired in TS (Kéri et al., 2002, Marsh et al., 2004). On the other hand, learning impairments in TS have been attributed to neuroleptic treatment, comorbid OCD or ADHD (Shephard et al., 2016a, Shephard et al., 2016b, Worbe et al., 2011).

Various theories postulate different predictions for RL alterations in a hyperdopaminergic state. One influential view hypothesizes increased impact of rewards alongside decreased impact of punishments (Palminteri et al., 2009). Alternative accounts state that increased tonic dopamine should diminish the impact of learned stimulus values on choices and thus increase choice stochasticity (Beeler, 2012, Hamid et al., 2015). Furthermore, current models propose that value computation in the striatum is context-sensitive; meaning that successful avoidance of a loss elicits a positive dopaminergic RL signal (Kishida et al., 2015, Palminteri et al., 2015). Importantly, learning from feedback is biased such that confirmatory feedback (i.e. obtained reward) is preferentially taken into account and this bias extends to counterfactual information (i.e. successfully avoided losses) (Palminteri, Lefebvre, Kilford, & Blakemore, 2017). This suggests that the hyperdopaminergic state in TS might increase learning from both rewards and successfully avoided losses (Palminteri et al., 2009, Palminteri et al., 2017) or increase choice stochasticity irrespective of factual and counterfactual information (Beeler, 2012, Hamid et al., 2015).

Cortical processes in probabilistic RL can be readily analyzed as event-related potentials (ERPs). The feedback-related negativity (FRN) is a fronto-central deflection from 200 to 300 ms following feedback presentation and encodes a PE signal (Fischer and Ullsperger, 2013, Sambrook and Goslin, 2015) which is supposed to stem from the medial frontal cortex (MFC). The P3a is a positive fronto-central ERP peaking between 300 and 500 ms following feedback and is thought to reflect allocation of attention toward relevant information (Polich, 2007). The P3b is a positive centro-parietal ERP from 400 to 600 ms which has been associated with updating of an internal prediction model (Fischer and Ullsperger, 2013, Polich, 2007). In healthy subjects, MFC activity has been linked to value updating for factual, but not counterfactual feedback while parietal activity predicted behavioral adaptation for both types of feedback (Fischer and Ullsperger, 2013, Jocham et al., 2014).

In this study, we aimed to further characterize probabilistic learning and its temporally resolved neural correlates in TS. We employed a probabilistic learning task while recording high-density EEG in TS patients and matched healthy controls. Using multiple variants of a standard RL model, we tested whether behavior is preferentially guided by confirmatory feedback. To assess the interrelation between behavior and neural activity, we employed single-trial regression analyses of model-derived predictors onto the EEG signal. We then compared the resulting regression weights between groups. We hypothesized impaired probabilistic learning in TS and expected these changes to be reflected in decreased regression weights. In an exploratory analysis we also evaluated whether the behavioral model parameters were related to clinical scores in the TS group.

Section snippets

Participants

Fifteen TS patients were recruited at the University Hospital Cologne, and a control group of 15 healthy individuals was gathered through public advertisements (for demographic data see Table 1). Healthy individuals with no history or current psychiatric or neurological disorder were matched to the patient group according to sex, age, handedness and education. TS patients had no reported comorbidities. A total of six patients were treated with neuroleptic medication (three with aripiprazole

Behavior

When testing for the difference in adaptive choices, a significant main effect of Group (F_1,28 = 14.608, p = .001, η_p² = .343), but no main effect of Condition (F_1,28 = 1.773, p = .19, η_p² = .060) and no Condition × Group interaction (F_1,28 = .007, p = .93, η_p² = .000) was observed. This indicates a general learning impairment for the TS group irrespective of reward probability (Fig. 1). In the neutral condition, no difference of choice rate was observed between groups (t₂₈ = −.008, p = .994),

Discussion

We explored probabilistic learning in TS patients by combining computational modeling and single-trial EEG regression, while differentiating between learning from factual and counterfactual feedback. TS patients showed decreased learning performance overall, which could be attributed to increased choice stochasticity rather than differences in learning from outcomes per se. On a neural level, TS patients showed reduced cortical coding of factual, but not counterfactual feedback in the P3a and a

Funding

This study was funded by the German Research Foundation (KFO-219, KU 2665/1-2).

Open Practices

The study in this article earned Open Materials and Open Data badges for transparent practices. Materials and data for the study are available at https://osf.io/9uqse/?view_only=a5dc605d0d194d5594c2276bed66120a.

CRediT authorship contribution statement

Thomas Schüller: Conceptualization, Writing - review & editing. Adrian G. Fischer: Writing - review & editing. Theo O.J. Gruendler: Conceptualization, Writing - review & editing. Juan Carlos Baldermann: Writing - review & editing. Daniel Huys: Writing - review & editing. Markus Ullsperger: Conceptualization, Writing - review & editing. Jens Kuhn: Conceptualization, Writing - review & editing.

Declaration of Competing Interest

None.

Acknowledgements

We would like to thank Elena Sildatke for her assistance with data acquisition.

References (38)

A. Delorme et al.
EEGLAB: An open source toolbox for analysis of single-trial EEG dynamics including independent component analysis
Journal of Neuroscience Methods
(2004)
C.M. Eddy et al.
Tourette syndrome and obsessive compulsive disorder: Compulsivity along the continuum
Journal of Obsessive-Compulsive and Related Disorders
(2014)
A. Fischer et al.
Real and fictive outcomes are processed differently but converge on a common adaptive mechanism
Neuron
(2013)
S. Kéri et al.
Probabilistic classification learning in Tourette syndrome
Neuropsychologia
(2002)
J.F. Leckman et al.
Tourette's syndrome: When habit-forming systems form habits of their own?
Neuron
(2000)
J.W. Mink
Basal ganglia dysfunction in Tourette's syndrome: A new hypothesis
Pediatric Neurology
(2001)
J. Polich
Updating P300: An integrative theory of P3a and P3b
Clinical Neurophysiology : Official Journal of the International Federation of Clinical Neurophysiology
(2007)
M.M. Robertson
Mood disorders and gilles de la Tourette's syndrome: An update on prevalence, etiology, comorbidity, clinical associations, and implications
Journal of Psychosomatic Research
(2006)
E. Shephard et al.
Electrophysiological correlates of reinforcement learning in young people with Tourette syndrome with and without co-occurring ADHD symptoms
International Journal of Developmental Neuroscience
(2016)
A. Vo et al.
Effects of levodopa on stimulus-response learning versus response selection in healthy young adults
Behavioural Brain Research
(2017)

M.M. Walsh et al.

Learning from experience: Event-related potential correlates of reward processing, neural adaptation, and behavioral choice

Neuroscience and Biobehavioral Reviews

(2012)

B.W. Balleine et al.

Human and rodent homologies in action control: Corticostriatal determinants of goal-directed and habitual action

Neuropsychopharmacology: Official Publication of the American College of Neuropsychopharmacology

(2010)

J.A. Beeler

Thorndike's law 2.0: Dopamine and the regulation of thrift

Frontiers in Neuroscience

(2012)

A.G.E. Collins et al.

Surprise! Dopamine signals mix action, value and error

Nature Neuroscience

(2015)

C.M.C. Correa et al.

How the level of reward awareness changes the computational and electrophysiological signatures of reinforcement learning

The Journal of Neuroscience

(2018)

H. Fuhrer et al.

Levodopa inhibits habit-learning in Parkinson's disease

Journal of Neural Transmission

(2014)

A.A. Hamid et al.

Mesolimbic dopamine signals the value of work

Nature Neuroscience

(2015)

Q.J.M. Huys et al.

Disentangling the roles of approach, activation and valence in instrumental and pavlovian responding

Plos Computational Biology

(2011)

D. Huys et al.

Update on the role of antipsychotics in the treatment of Tourette syndrome

Neuropsychiatric Disease and Treatment

(2012)

Cited by (14)

Signed and unsigned effects of prediction error on memory: Is it a matter of choice?
2023, Neuroscience and Biobehavioral Reviews
Adaptive decision-making is governed by at least two types of memory processes. On the one hand, learned predictions through integrating multiple experiences, and on the other hand, one-shot episodic memories. These two processes interact, and predictions – particularly prediction errors – influence how episodic memories are encoded. However, studies using computational models disagree on the exact shape of this relationship, with some findings showing an effect of signed prediction errors and others showing an effect of unsigned prediction errors on episodic memory. We argue that the choice-confirmation bias, which reflects stronger learning from choice-confirming compared to disconfirming outcomes, could explain these seemingly diverging results. Our perspective implies that the influence of prediction errors on episodic encoding critically depends on whether people can freely choose between options (i.e., instrumental learning tasks) or not (Pavlovian learning tasks). The choice-confirmation bias on memory encoding might have evolved to prioritize memory representations that optimize reward-guided decision-making. We conclude by discussing open issues and implications for future studies.
Feedback-related EEG dynamics separately reflect decision parameters, biases, and future choices
2022, NeuroImage
Optimal decision making in complex environments requires dynamic learning from unexpected events. To speed up learning, we should heavily weight information that indicates state-action-outcome contingency changes and ignore uninformative fluctuations in the environment. Often, however, unrelated information is hard to ignore and can potentially bias our learning. Here we used computational modelling and EEG to investigate learning behaviour in a modified probabilistic choice task that introduced two task-irrelevant factors that were uninformative for optimal task performance, but nevertheless could potentially bias learning: pay-out magnitudes were varied randomly and, occasionally, feedback presentation was enhanced by visual surprise. We found that participants’ overall good learning performance was biased by distinct effects of these non-normative factors. On the neural level, these parameters are represented in a dynamic and spatiotemporally dissociable sequence of EEG activity. Later in feedback processing the different streams converged on a central to centroparietal positivity reflecting a signal that is interpreted by downstream learning processes that adjust future behaviour.
The computational roots of positivity and confirmation biases in reinforcement learning
2022, Trends in Cognitive Sciences
Citation Excerpt :
Additional model comparison analyses showed that the four learning rate model could be reduced to a two learning rate model, featuring a single parameter for all confirmatory and disconfirmatory feedback, respectively (Figure 1C). The symmetrical pattern of learning rates, as well as the superiority of this implementation of choice confirmation bias against other models, has been replicated several times in RL tasks that include both partial and complete feedback information [47–49]. In a follow-up study that further investigated the choice-related aspects of the positivity bias, standard instrumental trials were interleaved with observational trials, where participants observed the computer making a choice for them and the resulting outcome [45].
Humans do not integrate new information objectively: outcomes carrying a positive affective value and evidence confirming one’s own prior belief are overweighed. Until recently, theoretical and empirical accounts of the positivity and confirmation biases assumed them to be specific to ‘high-level’ belief updates. We present evidence against this account. Learning rates in reinforcement learning (RL) tasks, estimated across different contexts and species, generally present the same characteristic asymmetry, suggesting that belief and value updating processes share key computational principles and distortions. This bias generates over-optimistic expectations about the probability of making the right choices and, consequently, generates over-optimistic reward expectations. We discuss the normative and neurobiological roots of these RL biases and their position within the greater picture of behavioral decision-making theories.
Tic disorders in children as polyethological nosology
2024, Obozrenie Psihiatrii i Medicinskoj Psihologii Imeni V.M. Bekhtereva
Beyond peaks and troughs: Multiplexed performance monitoring signals in the EEG
2024, Psychophysiology
Transdiagnostic inflexible learning dynamics explain deficits in depression and schizophrenia
2024, Brain

View all citing articles on Scopus

¹: Indicates shared first authorship.

²: Indicates shared senior authorship.

View full text

Special Issue “The Neuropsychology of Unwanted Thoughts and Actions”: Research ReportDecreased transfer of value to action in Tourette syndrome

Abstract

Objective

Methods

Results

Conclusion

Introduction

Section snippets

Participants

Behavior

Discussion

Funding

Open Practices

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgements

Journal of Neuroscience Methods

Journal of Obsessive-Compulsive and Related Disorders

Neuron

Neuropsychologia

Neuron

Pediatric Neurology

Clinical Neurophysiology : Official Journal of the International Federation of Clinical Neurophysiology

Journal of Psychosomatic Research

International Journal of Developmental Neuroscience

Behavioural Brain Research

Neuroscience and Biobehavioral Reviews

Human and rodent homologies in action control: Corticostriatal determinants of goal-directed and habitual action

Neuropsychopharmacology: Official Publication of the American College of Neuropsychopharmacology

Thorndike's law 2.0: Dopamine and the regulation of thrift

Frontiers in Neuroscience

Surprise! Dopamine signals mix action, value and error

Nature Neuroscience

How the level of reward awareness changes the computational and electrophysiological signatures of reinforcement learning

The Journal of Neuroscience

Levodopa inhibits habit-learning in Parkinson's disease

Journal of Neural Transmission

Mesolimbic dopamine signals the value of work

Nature Neuroscience

Disentangling the roles of approach, activation and valence in instrumental and pavlovian responding

Plos Computational Biology

Update on the role of antipsychotics in the treatment of Tourette syndrome

Neuropsychiatric Disease and Treatment

Special Issue “The Neuropsychology of Unwanted Thoughts and Actions”: Research Report
Decreased transfer of value to action in Tourette syndrome