Abstract
Theoretical accounts postulate that the catecholaminergic neuromodulator noradrenaline shapes cognitive behavior by reducing the impact of prior expectations on learning, inference, and decision-making. A ubiquitous effect of dynamic priors on perceptual decisions under uncertainty is choice history bias: the tendency to systematically repeat, or alternate, previous choices, even when stimulus categories are presented in a random sequence. Here, we directly test for a causal impact of catecholamines on these priors. We pharmacologically elevated catecholamine levels through the application of the noradrenaline reuptake inhibitor atomoxetine. We quantified the resulting changes in observers’ history biases in a visual perceptual decision task. Choice history biases in this task were highly idiosyncratic, tending toward choice repetition or alternation in different individuals. Atomoxetine decreased these biases (toward either repetition or alternation) compared to placebo. Behavioral modeling indicates that this bias reduction was due to a reduced bias in the accumulation of sensory evidence, rather than of the starting point of the accumulation process. Atomoxetine had no significant effect on other behavioral measures tested, including response time and choice accuracy. We conclude that catecholamines reduce the impact of a specific form of prior on perceptual decisions.
Introduction
Biases are prevalent in human decisions ranging from those based on abstract reasoning to judgments of the sensory environment1–3. One prominent source of such biases in perceptual decisions across mammalian species is the history of previous experiences4–13. When, as common in natural environments, the environmental state evaluated in the judgment is relatively stable, such history biases reflect adaptive expectations that are dynamically updated over time and improve performance5,14,15. For example, when choosing to forage for deer in the same area as during successful previous expeditions. While recent work in rodents and primates has identified correlates of such history biases in neural population activity of parietal and frontal cortical areas9,12,16–20, little is currently known about the neurotransmitter systems involved in this important form of bias.
Influential theoretical work postulates that the catecholaminergic neuromodulator noradrenaline shapes behavior by reducing the impact of prior expectations on learning, inference, and decision-making21,22. The catecholaminergic (and in particular, the noradrenaline) systems of the brainstem project to large parts of the cerebral cortex, where the neuromodulators they release change the functional properties of their target networks23–28. Consequently, they are in an ideal position to shape cortex-wide network activity underlying decision-making in a coordinated fashion. Indeed, pupil responses (a peripheral proxy of central arousal state29 point to a role of neuromodulation in regulating the impact of these history-dependent priors8. However, pupil responses reflect the activity of multiple neuromodulatory brainstem systems29–31, and the above result builds on correlative approaches. Direct evidence for the role of catecholaminergic neuromodulation in choice history biases is currently sparse in animal models13,32 and absent in humans.
Here, we aimed to provide causal evidence for the role of catecholamines in regulating (suppressing) history biases in human choice behavior. Specifically, we hypothesized that catecholamines down-regulate the impact of previous experiences on decision-making, which should translate into a reduction of choice history bias. We tested this hypothesis by pharmacologically increasing central catecholamine (in particular noradrenaline) levels through the selective noradrenaline re-uptake inhibitor atomoxetine33,34 and quantified the resulting behavioral changes in a variant of a visual perceptual decision task adopted from recent animal physiology35. Atomoxetine inhibits the presynaptic norepinephrine transporter, preventing the reuptake of norepinephrine throughout the brain along with inhibiting the reuptake of dopamine in specific brain regions such as the prefrontal cortex33,34. We found that atomoxetine reduced choice history biases by specifically interacting with the way people accumulated noisy perceptual evidence.
Results
Atomoxetine was administered orally (40 mg per session) in a within-subject, double-blind, placebo-controlled, and randomized design (Figure 1A). The behavioral task entailed the presentation of visual targets (Gabor patch of fixed orientation and varying contrast), one in the left and one in the right visual hemifield; participants were asked to report the location the larger contrast target by button press (Figure 1B; Methods). To increase sensory uncertainty and promote integration of evidence across time36, we added dynamic visual noise to both Gabor patches (Figure 1B; Methods). There were two different signal strengths (small and large difference in contrast); auditory feedback was presented 50 ms right after participant’s choices. We collected many trials per participant (range, 3520-4320) across four experimental sessions, in which we systematically varied and counterbalanced drug condition and stimulus-response mapping (Figure 1A and Figure S1; Methods).
Atomoxetine administration at this relatively low dose had a measurable effect on peripheral arousal markers, but not on the overall efficiency of the decision process for the behavioral task at hand. Atomoxetine increased both heart rate (Figure 1C) and pupil size (Figure 1D). While signal strength lawfully decreased reaction time (RT; F1,18 = 16.2, p = 0.001; two-way repeated measures ANOVA) and increased accuracy (F1,18 = 297.3, p < 0.001) (Figure 2A,B), atomoxetine changed neither of those behavioral parameters (RT: F1,18 = 2.3, p = 0.146; accuracy: F1,18 = 0.8, p = 0.383; no significant interaction of signal strength and drug; Figure 2A,B). Likewise, atomoxetine did not significantly change signal detection theoretic sensitivity (d’; Figure S2A).
By contrast, atomoxetine had a robust impact on the impact of history-based (i.e., dynamically varying throughout the course of the experiment) prior expectations on their decisions. We operationalized these time-varying prior expectations about the next target as the probability of repeating the choice from the previous trial (Figure 2C). The stimulus repetition probability was, by design, close to 0.5 (Figure 2C) (group average: 0.496; Methods), thus not explaining the sequential effects in participants’ behavior. Atomoxetine reduced the mean choice repetition probability (Figure 2D; F1,18 = 8.2, p = 0.010; two-way repeated measures ANOVA). Signal strength had no such effect (F1,18 = 0.1, p = 0.706) and signal strength and drug did not interact (F1,27 = 0.5, p = 0.507).
In the placebo sessions, most participants (15 out of 19) had a repetition probability larger than 0.5, that is, they tended to repeat their choices (Figure 2C). We, therefore, wondered if (i) atomoxetine reduced the group-level choice history biases by stereotypically pushing behavior more toward alternation (i.e., reduction of repetition probability, in line with previous pupillometry work8) or (ii) through a reduction of both repetition and alternation biases (shift of repetition probability toward 0.5). Our results are in line with the second scenario. There was a robust correlation between individual choice history biases during placebo sessions and the shift in history biases caused by atomoxetine: participants with the strongest repetition or alternation biases, exhibited the strongest shift towards a repetition probability of 0.5 (Figure 2E; corrected for reversion to the mean; Methods). Consequently, atomoxetine yielded an even more consistent and robust reduction of the absolute choice history bias, defined as the absolute value of P(repeat)−0.5 (Figure 2F; F1,18 = 10.6, p < 0.001; two-way repeated measures ANOVA). Atomoxetine did not change overall choice bias, quantified as signal detection theoretic criterion (Figure S2D-F). These findings show that atomoxetine specifically reduced the impact of history-dependent prior expectation in all participants, regardless of whether they idiosyncratically tended to repeat (i.e., assuming a relatively stable environment) or alternate (assuming a volatile environment).
How did participants’ expectations shape their decision processes, and how did atomoxetine act on this mechanism? Current models of perceptual decision-making posit the temporal accumulation of sensory evidence, resulting in an internal decision variable that grows with time36–39. When this decision variable reaches one of two decision bounds, a choice is made, and the corresponding motor response is initiated. In this framework (Figure 3A), a bias can arise in two ways: (i) by shifting the starting point of accumulation toward one of the bounds or (ii) by changing the rate of evidence accumulation toward one choice alternative.
Previous work has shown that individual biases in choice repetition probabilities are generally explained by a history-dependent bias in the drift rather than starting point, across a range of different perceptual choice tasks4. We replicated this finding for the placebo condition of the present dataset (Figure S3A). To this end, we fitted an accumulation-to-bound model of decision-making to the behavioral data. The model had five free parameters: 1) boundary separation (controlling response caution); 2) the mean drift rate (overall efficiency of evidence accumulation); 3) non-decision time (the speed of pre-decisional evidence encoding and post-decisional translation of choice into motor response); 4) the starting point of the decision; 5) an evidence-independent bias in the drift (henceforth called “drift bias”). We fitted all five parameters separately per drug condition; additionally, we fitted drift rate separately for both signal strength conditions, and both starting point and drift bias separately for the previous choice category (‘left’ vs ‘right’). The fitted model (Figure S3C) accounted well for the overall behavior in each task (Figure S3D,E). Indeed, atomoxetine also specifically reduced the history-dependent shift in drift bias, but not the history shift in starting point (Figure 3B). Furthermore, this drift bias (not starting point) effect was robustly and specifically correlated to the individual change of choice repetition probability (Figure 3C).
Discussion
Using selective pharmacological interventions in humans, we here imply catecholamines in the modulation of human choice history biases: boosting catecholamines levels through atomoxetine specifically reduced the magnitude of choice history biases, regardless of whether those promoted choice repetition or alternation. We interpret this finding in the context of computational theory postulating a down-weighting of prior expectations (which, we assume, drive the choice history biases analyzed here) in combination with new sensory evidence in the evaluation of current environmental state21,22. Catecholamines have previously been implicated in modulating learning rates40–43 and facilitating set shifting44 in dynamic environments; here, we provide causal evidence for their role in controlling the weight of historical information in perceptual decisions.
Our finding is roughly consistent with previous correlative evidence from pupillometry8,45–49. Non luminance-mediated changes of pupil diameter are an indirect marker of neuromodulatory (including noradrenergic and cholinergic) activity30,31,50–54. In dynamic environments, pupil responses are associated with violations of learned, top-down expectations and reduce their weight45–49. In stable environments, pupil responses predict a reduction of choice repetition probability8. There are, however, also notable differences between our current results and this previous pupillometry work. First, the pupil-predicted reduction of repetition probability was due to a stereotypical increase in the tendency to alternate8. In contrast, we here found that catecholamines reduce both repetition and alternation biases (shift of repetition probability toward 0.5), which is more readily in line with a reduction in prior weight. Second, our pharmacological intervention did not reduce participant’s overall choice bias (bias towards one choice alternative, irrespective of the immediate history), even though this is another commonly reported behavioral correlate of pupil responses31,55–58. One possible explanation for this difference is that atomoxetine likely increased both tonic and phasic catecholamines levels59, while task-evoked pupil responses reflect phasic neuromodulatory activity31. Another possible explanation is that while atomoxetine selectively elevates central catecholamine levels (noradrenaline and dopamine), pupil responses track not only noradrenaline30,31,50–53, but also acetylcholine30,54, serotonin60 and orexin61 levels.
Which neuromodulator specifically mediated the effects reported here? Atomoxetine increases the extracellular levels of noradrenaline, but also of (prefrontal) dopamine (Methods). Indeed, one recent report suggest that striatal dopamine may play a role in adapting choice history bias to temporal regularities in a stimulus sequence13. Thus, our results are consistent with a role of one or both catecholamines. However, the noradrenergic locus coeruleus and dopaminergic midbrain exhibit a close interplay62,63 and are co-activated with pupil responses during decisions31. Thus, it could well be that their (complex) interaction regulates choice history bias in perceptual decisions.
In sum, pharmacologically elevating central catecholamine (specifically noradrenaline) levels in human participants specifically reduces their propensity to accumulate noisy evidence for perceptual decisions in a manner that is biased by previously reported perceptual experience. This supports computational theories according to which noradrenaline reduces the impact of dynamic priors on inference and decision-making.
Materials and Methods
Participants
Nineteen participants (14 female, age range 20-26) participated in the experiment, entailing concurrent pupillometry and MEG recordings. All had normal or corrected to normal vision and no history or indications of psychological or neurological disorders. The experiment was approved by the ethics committee of the University Medical Center Hamburg-Eppendorf. Participants participated in four experimental sessions, in which we crossed drug condition (atomoxetine vs. placebo) and stimulus-response mapping (press ipsi vs. contra) (about 2 hours per session). Participants were paid 10€ per hour. One participant completed three recording sessions.
Pharmacological intervention
We used the selective noradrenaline reuptake inhibitor atomoxetine (dose, 40 mg) to boost the levels of catecholamines, specifically noradrenaline and (in the prefrontal cortex) dopamine33,34. Atomoxetine is a relatively selective inhibitor of the noradrenaline transporter, which is responsible for the natural reuptake of noradrenaline that has been released into the extracellular space. Consequently, atomoxetine acts to increase the extracellular levels of noradrenaline, an effect that has been confirmed experimentally in rat prefrontal cortex33. The same study showed that atomoxetine also increases the prefrontal levels of dopamine, which has a molecular structure very similar to the one of noradrenaline and is, in fact, a direct precursor of noradrenaline. Atomoxetine has smaller affinity to the serotonin transporter, and there are discrepant reports about the quantitative relevance of these effects: while one study found no increases in serotonin levels under atomoxetine33, a recent study reports a significant atomoxetine-related occupancy of the serotonin transporter in nonhuman primates64 at dosages that would correspond to human dosages of 1.0– 1.8 mg/kg. Note that these dosages are substantially higher than the administered dosage in this study (40 mg, independent of body weight). It is therefore unclear to what extent our atomoxetine condition affected cortical serotonin levels. A mannitol-aerosil mixture was administered as placebo. All substances were encapsulated identically to render them visually indistinguishable. Peak plasma concentrations are reached about 1–2 hours after administration for atomoxetine, and the half-life is about 5 hours65. Thus, participants received either a placebo or atomoxetine 1.5 hours before the start of the experimental session.
Behavioral task
Each trial consisted of three consecutive intervals (Figure 1A): (i) the baseline interval (uniformly distributed between 0.6 and 1.4 s) containing a dynamic noise pattern on both the left and right side of the green fixation dot; (ii) the decision interval (terminated by the participant’s response; max 3 s) containing the same noise patterns with Gabor patch embedded in each; (iii) the feedback interval (uniformly distributed between 0.25 and 1.4 s).
Two dynamic noise patterns were presented throughout the experiment. The luminance across all pixels was kept constant. This pedestal noise pattern had 20% contrast. In the decision interval two Gabor patches (sinusoidal gratings; 2 cycles per degree; phase randomly drifted left or right) were added to each noise pattern. The Gabor patches symmetrically differed from 80% contrast. On half of trials, the Gabor patch on right side of the screen was of higher contrast (signal, right); on the other half the Gabor patch on the left side was of higher contrast (signal, left). Signal location was randomly selected on each trial, under the constraint that it would occur on 50% of the trials within each block of 160 trials. Stimuli were presented within two circular patches to the left and right of the central fixation cross (diameter, 15 degrees of visual angle; eccentricity, 12 degrees).
Participants were instructed to report the location of the signal, the highest contrast Gabor patch by pressing one of two response buttons with their left or right index finger as soon as they felt sufficiently confident (“free response paradigm”). We did not give specific instructions to tradeoff accuracy for speed (or vice versa). The mapping between perceptual choice and button press (e.g., “right” –> press right key; “left” –> press left key) was counterbalanced across the four recording sessions. Participants received auditory feedback (high tone, correct; low tone, error) 50 ms after they indicated their choice.
Signal strength, the difference in contrast between the left and right signals, was individually titrated before the main experiment to two difficulty levels that yielded about 70% and 85% correct choices, using an adaptive staircase procedure (Quest)66. In the placebo sessions, we measured a mean accuracy of 74.07 % correct (± 0.97 % s.e.m.) and 80.83 % correct (± 1.12 % s.e.m), for weak and strong signal strengths respectively.
Stimuli were back-projected on a transparent screen using a Sanyo PCL-XP51 projector with a resolution of 800×600 at 24 Hz. The screen was positioned 65 cm away from their eyes. The luminance profile was linearized by measuring and correcting for the systems gamma curve. A doubling of contrast values therefore also produced a doubling of luminance differences. During the first training session stimuli were presented on a VIEWPixx monitor with the same resolution and refresh rate (also linearized). To minimize any effect of light on pupil diameter, the overall luminance of the screen was held constant throughout the experiment.
Participants performed between 22 and 27 blocks (distributed over four sessions) yielding a total of 3520–4320 trials per participant. One participant performed a total of 17 blocks (distributed over three scanning sessions), yielding a total of 2720 trials.
Data acquisition
Recordings took place in a dimly lit magnetically shielded room. Magnetoencephalography (MEG) was recorded using a whole-head CTF 275 MEG system (CTF Systems Inc., Canada) at a sampling rate of 1200 Hz. In addition, eye movements and pupil diameter were recorded with an MEG-compatible EyeLink 1000 Long Range Mount system (SR Research, Osgoode, ON, Canada). Finally, electrocardiogram (ECG) as well as vertical and horizontal electrooculogram (EOG) were acquired using Ag/AgCl electrodes at a sampling rate of 1200 Hz. MEG data will be the focus of a different report.
Analysis of electrocardiogram data
ECG data were used to analyze average heart rate. We used an adaptive threshold to detect the R-peak of each QRS-complex in the ECG. Heart rate was then computed by dividing the total number of R-components by time.
Analysis of pupil data
All analyses were performed using custom-made Python scripts, unless stated otherwise.
Preprocessing
Periods of blinks and saccades were detected using the manufacturer’s standard algorithms with default settings. The remaining data analyses were performed using custom-made Python scripts. We applied to each pupil timeseries (i) linear interpolation of missing data due to blinks or other reasons (interpolation time window, from 150 ms before until 150 ms after missing data), (ii) low-pass filtering (third-order Butterworth, cut-off: 6 Hz), (iii) removal of pupil responses to blinks and to saccades, by first estimating these responses by means of deconvolution and then removing them from the pupil time series by means of multiple linear regression67, and (iv) conversion to units of modulation (percent signal change) around the mean of the pupil time series from each measurement session.
Quantification of pre-trial pupil size
We quantified pre-trial pupil size as the mean pupil size during the 0.25 s before trial onset.
Analysis and modeling of choice behavior
All analyses were performed using custom-made Python scripts, unless stated otherwise. The first trial of each block was excluded from the analyses. We always computed behavioral metrics separately for each participant and each of the four recording sessions, and then averaged over the two SR-mappings within each drug condition.
Overt behavior
Reaction time (RT) was defined as the time from stimulus offset until the button press. Repetition probability was defined as the fraction of choices that was the same as the choice on the previous trial.
Signal-detection theoretic modeling
The signal detection68 metrics sensitivity (d’) and criterion (c) were computed separately for each of the pupil bins. We estimated d’ as the difference between z-scores of hit-rates and false-alarm rates. We estimated criterion by averaging the z-scores of hit-rates and false-alarm rates and multiplying the result by -1.
Drift diffusion modeling
We fitted the reaction time data with the drift diffusion model37,39, based on continuous maximum likelihood using the Python-package PyDDM69.
The decision dynamics were described by (Figure 3A): where y is the decision variable, s is the stimulus category (−1 for left signals; 1 for right signals), v is the drift rate and controls the overall efficiency of evidence accumulation, vbias is the drift bias which is and evidence-independent constant added to the drift, and cdW is Gaussian distributed white noise with mean 0 and variance c2Δt. The starting point of evidence accumulation z was defined as a fraction of the boundary separation: where a is the separation between the two decision bounds. Evidence accumulation terminated at 0 (“left”) or a (“right). The model was fitted separately for each recording session of each participant. In each fit, we let drift rate vary with signal strength39, and starting point and drift bias with previous choice4.
Statistical comparisons
We used a paired sample t-test to test for significant differences in heart rate, pupil size, and behavioral measures between placebo and drug sessions (Figure 1 and Figure 3). We used 2 × 2 repeated measures ANOVA to test for the main effects of drug and signal strength, as well as their interaction, on behavioral measures (Figure 2). All tests were performed two-tailed.
We performed permutation analyses to obtain the distribution of correlation coefficients predicted exclusively by regression to the mean (Figure 2E). This was done by computing the above-described across-subject correlation 10K, each time using randomly assigned “placebo” vs. “drug” labels of each participant. We then compared our observed correlation coefficient (reflecting the combined effects of regression to the mean and the relationship between overall bias and the pupil predicted shift therein) against this permutation distribution. This analysis suggested that the relationship between choice history bias and the Atomoxetine predicted shift therein was stronger than expected based on regression to the mean (proportion of permutations below observed correlation < 0.05).
We also used a permutation procedure to test for differences between the correlation coefficients for history shifts in starting point bias and drift bias (Figure 3C). Again, we permuted the condition labels (drug / placebo).
Data availability
Data will be made publicly available upon publication.
Code availability
Analysis scripts will be made publicly available upon publication.
Supplementary figures
Acknowledgments
Funding: Project grant of Amsterdam Brain & Cognition Priority Program; German Research Foundation (DFG; grant number Heisenberg Professorship DO 1240/3-1, and SFB 936, Projects A2/A3, A7, Z3).
Footnotes
Competing Interest Statement: The authors declare no competing interests.