Special Issue “The Neuropsychology of Unwanted Thoughts and Actions”: Research ReportDecreased transfer of value to action in Tourette syndrome
Introduction
Tourette syndrome (TS) is a childhood-onset hyperkinetic neurodevelopmental disorder characterized by the presence of motor and vocal tics. Tics share many commonalities with habitual behavior, as they are stereotyped and automatic sequences of actions that are triggered by specific internal or external stimuli (Leckman & Riddle, 2000). Common comorbid disorders in TS include obsessive-compulsive disorder (OCD), attention-deficit hyperactivity disorder (ADHD) and affective disorders (Eddy and Cavanna, 2014, Robertson, 2006, Simpson et al., 2011). The precise pathophysiology of TS remains unknown, but numerous findings point to alterations of cortico-basal ganglia-thalamo-cortical (CBGTC) loops (Mink, 2001). Although imbalances in various neurotransmitter systems have been reported for TS, the central role of dopaminergic transmission for this disorder is underlined by the effectiveness of neuroleptic medication in the treatment of TS (Huys et al., 2012, Leckman et al., 2010). The most parsimonious explanation of empirical findings might be an overall hyperdopaminergic state via increased dopaminergic innervation (Maia and Conceição, 2017, Maia and Conceição, 2018). Importantly, midbrain dopaminergic activity acts as a teaching signal (prediction error, PE) for reinforcement learning (RL) in the striatum (Balleine & O'Doherty, 2010) and probabilistic learning has been repeatedly shown to be impaired in TS (Kéri et al., 2002, Marsh et al., 2004). On the other hand, learning impairments in TS have been attributed to neuroleptic treatment, comorbid OCD or ADHD (Shephard et al., 2016a, Shephard et al., 2016b, Worbe et al., 2011).
Various theories postulate different predictions for RL alterations in a hyperdopaminergic state. One influential view hypothesizes increased impact of rewards alongside decreased impact of punishments (Palminteri et al., 2009). Alternative accounts state that increased tonic dopamine should diminish the impact of learned stimulus values on choices and thus increase choice stochasticity (Beeler, 2012, Hamid et al., 2015). Furthermore, current models propose that value computation in the striatum is context-sensitive; meaning that successful avoidance of a loss elicits a positive dopaminergic RL signal (Kishida et al., 2015, Palminteri et al., 2015). Importantly, learning from feedback is biased such that confirmatory feedback (i.e. obtained reward) is preferentially taken into account and this bias extends to counterfactual information (i.e. successfully avoided losses) (Palminteri, Lefebvre, Kilford, & Blakemore, 2017). This suggests that the hyperdopaminergic state in TS might increase learning from both rewards and successfully avoided losses (Palminteri et al., 2009, Palminteri et al., 2017) or increase choice stochasticity irrespective of factual and counterfactual information (Beeler, 2012, Hamid et al., 2015).
Cortical processes in probabilistic RL can be readily analyzed as event-related potentials (ERPs). The feedback-related negativity (FRN) is a fronto-central deflection from 200 to 300 ms following feedback presentation and encodes a PE signal (Fischer and Ullsperger, 2013, Sambrook and Goslin, 2015) which is supposed to stem from the medial frontal cortex (MFC). The P3a is a positive fronto-central ERP peaking between 300 and 500 ms following feedback and is thought to reflect allocation of attention toward relevant information (Polich, 2007). The P3b is a positive centro-parietal ERP from 400 to 600 ms which has been associated with updating of an internal prediction model (Fischer and Ullsperger, 2013, Polich, 2007). In healthy subjects, MFC activity has been linked to value updating for factual, but not counterfactual feedback while parietal activity predicted behavioral adaptation for both types of feedback (Fischer and Ullsperger, 2013, Jocham et al., 2014).
In this study, we aimed to further characterize probabilistic learning and its temporally resolved neural correlates in TS. We employed a probabilistic learning task while recording high-density EEG in TS patients and matched healthy controls. Using multiple variants of a standard RL model, we tested whether behavior is preferentially guided by confirmatory feedback. To assess the interrelation between behavior and neural activity, we employed single-trial regression analyses of model-derived predictors onto the EEG signal. We then compared the resulting regression weights between groups. We hypothesized impaired probabilistic learning in TS and expected these changes to be reflected in decreased regression weights. In an exploratory analysis we also evaluated whether the behavioral model parameters were related to clinical scores in the TS group.
Section snippets
Participants
Fifteen TS patients were recruited at the University Hospital Cologne, and a control group of 15 healthy individuals was gathered through public advertisements (for demographic data see Table 1). Healthy individuals with no history or current psychiatric or neurological disorder were matched to the patient group according to sex, age, handedness and education. TS patients had no reported comorbidities. A total of six patients were treated with neuroleptic medication (three with aripiprazole
Behavior
When testing for the difference in adaptive choices, a significant main effect of Group (F1,28 = 14.608, p = .001, ηp2 = .343), but no main effect of Condition (F1,28 = 1.773, p = .19, ηp2 = .060) and no Condition × Group interaction (F1,28 = .007, p = .93, ηp2 = .000) was observed. This indicates a general learning impairment for the TS group irrespective of reward probability (Fig. 1). In the neutral condition, no difference of choice rate was observed between groups (t28 = −.008, p = .994),
Discussion
We explored probabilistic learning in TS patients by combining computational modeling and single-trial EEG regression, while differentiating between learning from factual and counterfactual feedback. TS patients showed decreased learning performance overall, which could be attributed to increased choice stochasticity rather than differences in learning from outcomes per se. On a neural level, TS patients showed reduced cortical coding of factual, but not counterfactual feedback in the P3a and a
Funding
This study was funded by the German Research Foundation (KFO-219, KU 2665/1-2).
Open Practices
The study in this article earned Open Materials and Open Data badges for transparent practices. Materials and data for the study are available at https://osf.io/9uqse/?view_only=a5dc605d0d194d5594c2276bed66120a.
CRediT authorship contribution statement
Thomas Schüller: Conceptualization, Writing - review & editing. Adrian G. Fischer: Writing - review & editing. Theo O.J. Gruendler: Conceptualization, Writing - review & editing. Juan Carlos Baldermann: Writing - review & editing. Daniel Huys: Writing - review & editing. Markus Ullsperger: Conceptualization, Writing - review & editing. Jens Kuhn: Conceptualization, Writing - review & editing.
Declaration of Competing Interest
None.
Acknowledgements
We would like to thank Elena Sildatke for her assistance with data acquisition.
References (38)
- et al.
EEGLAB: An open source toolbox for analysis of single-trial EEG dynamics including independent component analysis
Journal of Neuroscience Methods
(2004) - et al.
Tourette syndrome and obsessive compulsive disorder: Compulsivity along the continuum
Journal of Obsessive-Compulsive and Related Disorders
(2014) - et al.
Real and fictive outcomes are processed differently but converge on a common adaptive mechanism
Neuron
(2013) - et al.
Probabilistic classification learning in Tourette syndrome
Neuropsychologia
(2002) - et al.
Tourette's syndrome: When habit-forming systems form habits of their own?
Neuron
(2000) Basal ganglia dysfunction in Tourette's syndrome: A new hypothesis
Pediatric Neurology
(2001)Updating P300: An integrative theory of P3a and P3b
Clinical Neurophysiology : Official Journal of the International Federation of Clinical Neurophysiology
(2007)Mood disorders and gilles de la Tourette's syndrome: An update on prevalence, etiology, comorbidity, clinical associations, and implications
Journal of Psychosomatic Research
(2006)- et al.
Electrophysiological correlates of reinforcement learning in young people with Tourette syndrome with and without co-occurring ADHD symptoms
International Journal of Developmental Neuroscience
(2016) - et al.
Effects of levodopa on stimulus-response learning versus response selection in healthy young adults
Behavioural Brain Research
(2017)
Learning from experience: Event-related potential correlates of reward processing, neural adaptation, and behavioral choice
Neuroscience and Biobehavioral Reviews
Human and rodent homologies in action control: Corticostriatal determinants of goal-directed and habitual action
Neuropsychopharmacology: Official Publication of the American College of Neuropsychopharmacology
Thorndike's law 2.0: Dopamine and the regulation of thrift
Frontiers in Neuroscience
Surprise! Dopamine signals mix action, value and error
Nature Neuroscience
How the level of reward awareness changes the computational and electrophysiological signatures of reinforcement learning
The Journal of Neuroscience
Levodopa inhibits habit-learning in Parkinson's disease
Journal of Neural Transmission
Mesolimbic dopamine signals the value of work
Nature Neuroscience
Disentangling the roles of approach, activation and valence in instrumental and pavlovian responding
Plos Computational Biology
Update on the role of antipsychotics in the treatment of Tourette syndrome
Neuropsychiatric Disease and Treatment
Cited by (14)
Signed and unsigned effects of prediction error on memory: Is it a matter of choice?
2023, Neuroscience and Biobehavioral ReviewsThe computational roots of positivity and confirmation biases in reinforcement learning
2022, Trends in Cognitive SciencesCitation Excerpt :Additional model comparison analyses showed that the four learning rate model could be reduced to a two learning rate model, featuring a single parameter for all confirmatory and disconfirmatory feedback, respectively (Figure 1C). The symmetrical pattern of learning rates, as well as the superiority of this implementation of choice confirmation bias against other models, has been replicated several times in RL tasks that include both partial and complete feedback information [47–49]. In a follow-up study that further investigated the choice-related aspects of the positivity bias, standard instrumental trials were interleaved with observational trials, where participants observed the computer making a choice for them and the resulting outcome [45].
Tic disorders in children as polyethological nosology
2024, Obozrenie Psihiatrii i Medicinskoj Psihologii Imeni V.M. BekhterevaBeyond peaks and troughs: Multiplexed performance monitoring signals in the EEG
2024, Psychophysiology