Elsevier

Cortex

Volume 126, May 2020, Pages 39-48
Cortex

Special Issue “The Neuropsychology of Unwanted Thoughts and Actions”: Research Report
Decreased transfer of value to action in Tourette syndrome

https://doi.org/10.1016/j.cortex.2019.12.027Get rights and content

Abstract

Objective

Tourette syndrome is a neurodevelopmental disorder putatively associated with a hyperdopaminergic state. Therefore, it seems plausible that excessive dopamine transmission in Tourette syndrome alters the ability to learn based on rewards and punishments. We tested whether Tourette syndrome patients exhibited altered reinforcement learning and corresponding feedback-related EEG deflections.

Methods

We used a reinforcement learning task providing factual and counterfactual feedback in a sample of 15 Tourette syndrome patients and matched healthy controls whilst recording EEG. The paradigm presented various reward probabilities to enforce adaptive adjustments. We employed a computational model to derive estimates of the prediction error, which we used for single-trial regression analysis of the EEG data.

Results

We found that Tourette syndrome patients showed increased choice stochasticity compared to controls. The feedback-related negativity represented an axiomatic prediction error for factual feedback and did not differ between groups. We observed attenuated P3a modulation specifically for factual feedback in Tourette syndrome patients, representing impaired coding of attention allocation.

Conclusion

Our findings indicate that cortical prediction error coding is unaffected by Tourette syndrome. Nonetheless, the transfer of learned values into choice formation is degraded, in line with a hyperdopaminergic state.

Introduction

Tourette syndrome (TS) is a childhood-onset hyperkinetic neurodevelopmental disorder characterized by the presence of motor and vocal tics. Tics share many commonalities with habitual behavior, as they are stereotyped and automatic sequences of actions that are triggered by specific internal or external stimuli (Leckman & Riddle, 2000). Common comorbid disorders in TS include obsessive-compulsive disorder (OCD), attention-deficit hyperactivity disorder (ADHD) and affective disorders (Eddy and Cavanna, 2014, Robertson, 2006, Simpson et al., 2011). The precise pathophysiology of TS remains unknown, but numerous findings point to alterations of cortico-basal ganglia-thalamo-cortical (CBGTC) loops (Mink, 2001). Although imbalances in various neurotransmitter systems have been reported for TS, the central role of dopaminergic transmission for this disorder is underlined by the effectiveness of neuroleptic medication in the treatment of TS (Huys et al., 2012, Leckman et al., 2010). The most parsimonious explanation of empirical findings might be an overall hyperdopaminergic state via increased dopaminergic innervation (Maia and Conceição, 2017, Maia and Conceição, 2018). Importantly, midbrain dopaminergic activity acts as a teaching signal (prediction error, PE) for reinforcement learning (RL) in the striatum (Balleine & O'Doherty, 2010) and probabilistic learning has been repeatedly shown to be impaired in TS (Kéri et al., 2002, Marsh et al., 2004). On the other hand, learning impairments in TS have been attributed to neuroleptic treatment, comorbid OCD or ADHD (Shephard et al., 2016a, Shephard et al., 2016b, Worbe et al., 2011).

Various theories postulate different predictions for RL alterations in a hyperdopaminergic state. One influential view hypothesizes increased impact of rewards alongside decreased impact of punishments (Palminteri et al., 2009). Alternative accounts state that increased tonic dopamine should diminish the impact of learned stimulus values on choices and thus increase choice stochasticity (Beeler, 2012, Hamid et al., 2015). Furthermore, current models propose that value computation in the striatum is context-sensitive; meaning that successful avoidance of a loss elicits a positive dopaminergic RL signal (Kishida et al., 2015, Palminteri et al., 2015). Importantly, learning from feedback is biased such that confirmatory feedback (i.e. obtained reward) is preferentially taken into account and this bias extends to counterfactual information (i.e. successfully avoided losses) (Palminteri, Lefebvre, Kilford, & Blakemore, 2017). This suggests that the hyperdopaminergic state in TS might increase learning from both rewards and successfully avoided losses (Palminteri et al., 2009, Palminteri et al., 2017) or increase choice stochasticity irrespective of factual and counterfactual information (Beeler, 2012, Hamid et al., 2015).

Cortical processes in probabilistic RL can be readily analyzed as event-related potentials (ERPs). The feedback-related negativity (FRN) is a fronto-central deflection from 200 to 300 ms following feedback presentation and encodes a PE signal (Fischer and Ullsperger, 2013, Sambrook and Goslin, 2015) which is supposed to stem from the medial frontal cortex (MFC). The P3a is a positive fronto-central ERP peaking between 300 and 500 ms following feedback and is thought to reflect allocation of attention toward relevant information (Polich, 2007). The P3b is a positive centro-parietal ERP from 400 to 600 ms which has been associated with updating of an internal prediction model (Fischer and Ullsperger, 2013, Polich, 2007). In healthy subjects, MFC activity has been linked to value updating for factual, but not counterfactual feedback while parietal activity predicted behavioral adaptation for both types of feedback (Fischer and Ullsperger, 2013, Jocham et al., 2014).

In this study, we aimed to further characterize probabilistic learning and its temporally resolved neural correlates in TS. We employed a probabilistic learning task while recording high-density EEG in TS patients and matched healthy controls. Using multiple variants of a standard RL model, we tested whether behavior is preferentially guided by confirmatory feedback. To assess the interrelation between behavior and neural activity, we employed single-trial regression analyses of model-derived predictors onto the EEG signal. We then compared the resulting regression weights between groups. We hypothesized impaired probabilistic learning in TS and expected these changes to be reflected in decreased regression weights. In an exploratory analysis we also evaluated whether the behavioral model parameters were related to clinical scores in the TS group.

Section snippets

Participants

Fifteen TS patients were recruited at the University Hospital Cologne, and a control group of 15 healthy individuals was gathered through public advertisements (for demographic data see Table 1). Healthy individuals with no history or current psychiatric or neurological disorder were matched to the patient group according to sex, age, handedness and education. TS patients had no reported comorbidities. A total of six patients were treated with neuroleptic medication (three with aripiprazole

Behavior

When testing for the difference in adaptive choices, a significant main effect of Group (F1,28 = 14.608, p = .001, ηp2 = .343), but no main effect of Condition (F1,28 = 1.773, p = .19, ηp2 = .060) and no Condition × Group interaction (F1,28 = .007, p = .93, ηp2 = .000) was observed. This indicates a general learning impairment for the TS group irrespective of reward probability (Fig. 1). In the neutral condition, no difference of choice rate was observed between groups (t28 = −.008, p = .994),

Discussion

We explored probabilistic learning in TS patients by combining computational modeling and single-trial EEG regression, while differentiating between learning from factual and counterfactual feedback. TS patients showed decreased learning performance overall, which could be attributed to increased choice stochasticity rather than differences in learning from outcomes per se. On a neural level, TS patients showed reduced cortical coding of factual, but not counterfactual feedback in the P3a and a

Funding

This study was funded by the German Research Foundation (KFO-219, KU 2665/1-2).

Open Practices

The study in this article earned Open Materials and Open Data badges for transparent practices. Materials and data for the study are available at https://osf.io/9uqse/?view_only=a5dc605d0d194d5594c2276bed66120a.

CRediT authorship contribution statement

Thomas Schüller: Conceptualization, Writing - review & editing. Adrian G. Fischer: Writing - review & editing. Theo O.J. Gruendler: Conceptualization, Writing - review & editing. Juan Carlos Baldermann: Writing - review & editing. Daniel Huys: Writing - review & editing. Markus Ullsperger: Conceptualization, Writing - review & editing. Jens Kuhn: Conceptualization, Writing - review & editing.

Declaration of Competing Interest

None.

Acknowledgements

We would like to thank Elena Sildatke for her assistance with data acquisition.

References (38)

  • M.M. Walsh et al.

    Learning from experience: Event-related potential correlates of reward processing, neural adaptation, and behavioral choice

    Neuroscience and Biobehavioral Reviews

    (2012)
  • B.W. Balleine et al.

    Human and rodent homologies in action control: Corticostriatal determinants of goal-directed and habitual action

    Neuropsychopharmacology: Official Publication of the American College of Neuropsychopharmacology

    (2010)
  • J.A. Beeler

    Thorndike's law 2.0: Dopamine and the regulation of thrift

    Frontiers in Neuroscience

    (2012)
  • A.G.E. Collins et al.

    Surprise! Dopamine signals mix action, value and error

    Nature Neuroscience

    (2015)
  • C.M.C. Correa et al.

    How the level of reward awareness changes the computational and electrophysiological signatures of reinforcement learning

    The Journal of Neuroscience

    (2018)
  • H. Fuhrer et al.

    Levodopa inhibits habit-learning in Parkinson's disease

    Journal of Neural Transmission

    (2014)
  • A.A. Hamid et al.

    Mesolimbic dopamine signals the value of work

    Nature Neuroscience

    (2015)
  • Q.J.M. Huys et al.

    Disentangling the roles of approach, activation and valence in instrumental and pavlovian responding

    Plos Computational Biology

    (2011)
  • D. Huys et al.

    Update on the role of antipsychotics in the treatment of Tourette syndrome

    Neuropsychiatric Disease and Treatment

    (2012)
  • Cited by (14)

    • The computational roots of positivity and confirmation biases in reinforcement learning

      2022, Trends in Cognitive Sciences
      Citation Excerpt :

      Additional model comparison analyses showed that the four learning rate model could be reduced to a two learning rate model, featuring a single parameter for all confirmatory and disconfirmatory feedback, respectively (Figure 1C). The symmetrical pattern of learning rates, as well as the superiority of this implementation of choice confirmation bias against other models, has been replicated several times in RL tasks that include both partial and complete feedback information [47–49]. In a follow-up study that further investigated the choice-related aspects of the positivity bias, standard instrumental trials were interleaved with observational trials, where participants observed the computer making a choice for them and the resulting outcome [45].

    • Tic disorders in children as polyethological nosology

      2024, Obozrenie Psihiatrii i Medicinskoj Psihologii Imeni V.M. Bekhtereva
    View all citing articles on Scopus
    1

    Indicates shared first authorship.

    2

    Indicates shared senior authorship.

    View full text