Dopamine release and its control over early Pavlovian learning differs between the NAc core and medial NAc shell

Claire E. Stelly; Kasey S. Girven; Merridee J. Lefner; Kaitlyn M. Fonzi; Matthew J. Wanat

doi:10.1101/2020.07.29.227876

Abstract

Dopamine neurons respond to cues to reflect the value of associated outcomes. These cue-evoked dopamine responses can encode the relative rate of reward in rats with extensive Pavlovian training. Specifically, a cue that always follows the previous reward by a short delay (high reward rate) evokes a larger dopamine response in the nucleus accumbens (NAc) core relative to a distinct cue that always follows the prior reward by a long delay (low reward rate). However, it was unclear if these reward rate dopamine signals are evident during early Pavlovian training sessions and across NAc subregions. To address this, we performed fast-scan cyclic voltammetry recordings of dopamine levels to track the pattern of cue and reward-evoked dopamine signals in the NAc core and medial NAc shell. We identified regional differences in the progression of cue-evoked dopamine signals. However, reward rate encoding was not evident in either the NAc core or NAc shell during early training sessions. Pharmacological experiments found that dopamine-sensitive conditioned responding emerged in the NAc core before the NAc shell. Together, these findings illustrate regional differences in NAc dopamine release and its control over behavior during early Pavlovian learning.

Introduction

Learning to associate cues with rewarding outcomes is a fundamental process that underlies reward-driven behaviors. The mesolimbic dopamine system regulates behavioral responses toward reward-predictive cues [1, 2]. In particular, dopamine neurons respond to cues to encode the value of the outcome in well-trained animals. These cue-evoked dopamine responses can convey prospective reward-related information, such as reward preference, reward size, and reward probability [3–7]. Our recent findings illustrate that cue-evoked dopamine release also signals retrospective reward-related information [8]. In this study, rats were trained on a Pavlovian task in which distinct cues signaled identical outcomes but differed in the time elapsed since the previous reward delivery. The Short Wait cue always followed the previous reward by a short delay (high reward rate) while the Long Wait cue always followed the previous reward by a long delay (low reward rate). We found a larger dopamine response to the Short Wait cue in the nucleus accumbens (NAc) core of rats with extensive Pavlovian training (> 24 sessions) [8]. While these results demonstrate that dopamine encodes the relative reward rate in well-trained animals, it was unclear how these signals develop during early learning and if they are uniformly broadcast throughout the medial NAc.

Dopamine’s role in reward learning has been primarily studied using Pavlovian tasks with a single cue-reward relationship [9, 10]. In contrast, the difference in the dopamine response between cues has been primarily studied in well-trained animals [3,4,6,7]. As such, it is unclear how dopamine signals emerge when learning multiple cue-reward relationships simultaneously. Cue-evoked dopamine release could acquire value-related information through a multi-step process or a single-step process. For example, in a multi-step process cue-evoked dopamine release first signals an upcoming reward (independent of value) and over training conveys the relative difference in value between cues. Alternatively, in a single-step process the cue-evoked dopamine response will reflect differences in reward value as these signals first emerge during training.

In the current study we performed voltammetry recordings of NAc dopamine release during early Pavlovian learning to determine if value-related dopamine signals develop in a single-or multi-step process. Rats were trained on a Pavlovian task where distinct cues were associated with different reward rates [8]. The presence of reward rate encoding by cue-evoked dopamine release during early training sessions would suggest value-related dopamine signals emerge via a single-step process. In contrast, the absence of reward rate encoding during early training sessions would indicate that value-related signals emerge via a multi-step process. We performed dopamine recordings in the NAc core and the medial NAc shell, as cue-evoked dopamine responses are present in both NAc subregions [11–13]. Furthermore, we pharmacologically inhibited dopamine receptors to determine if conditioned responding requires dopamine in the NAc core and/or medial NAc shell.

Methods

Subjects and surgery

The University of Texas at San Antonio Institutional Animal Care and Use Committee approved all procedures. Male CD IGS Sprague Dawley rats (Charles River Laboratories, RRID:RGD 734476) were pair-housed upon arrival, allowed ad libitum access to water and chow, and maintained on a 12 h light/dark cycle. Voltammetry electrodes were constructed by threading a 7 μm diameter carbon fiber through polyamide-coated silica tubing and sealed with epoxy [14]. The sensing end of the electrode was cut to a length of ∼150 μm. Voltammetry electrodes were surgically implanted under isoflurane anesthesia in rats weighing 300 – 400 g. Electrodes were implanted bilaterally and targeted the NAc core (relative to bregma: 1.3 mm anterior; ± 1.3 mm lateral; 7.0 mm ventral) or the medial NAc shell (1.5 mm anterior; ± 0.6 mm lateral; 7.3 mm ventral). Rats were also implanted with a Ag/AgCl reference electrode that was placed under the skull at a convenient location. Bilateral stainless-steel guide cannulae (InVivo One) were implanted 1 mm dorsal to the NAc core or medial NAc shell. Following surgery, rats were single-housed for the duration of the experiment and allowed to recover for 1-3 weeks before behavioral procedures.

Behavioral procedures

At ≥ 7 days post-surgery, rats were placed on mild dietary restriction to 90% of their free feeding weight, allowing for a weekly increase of 1.5%. Rats were handled regularly before behavioral testing commenced. All behavioral sessions occurred during the light cycle in operant boxes (Med Associates) with a grid floor, a house light, a recessed food tray equipped with an infrared beam-break detector, and auditory stimulus generators (white noise and 4.5 kHz tone). To familiarize the animals with the operant chamber and food retrieval from the tray, rats first received 1-2 magazine training sessions in which 20 unsignaled food pellets (45 mg, BioServ) were delivered at a 90 ± 15 s variable interval. Rats underwent 6 Pavlovian reward conditioning sessions as described previously [8]. Pavlovian sessions consisted of 50 trials where the termination of a 5 s audio CS (tone or white noise, counterbalanced across animals) resulted in the delivery of a single food pellet and illumination of the food port light for 4.5 s. Each session contained 25 Short Wait trials and 25 Long Wait trials delivered in pseudorandom order. The Short Wait CS was presented after a 20 ± 5 s ITI, and the Long Wait CS was presented after a 70 ± 5 s ITI. We monitored head entries into the food tray across training sessions. Conditioned responding was quantified as the change in the rate of head entries during the 5 s CS relative to the 5 s preceding the CS delivery [8, 15]. We also quantified the latency to initiate a head entry during the CS.

Pharmacology

Flupenthixol dihydrochloride (Tocris) was dissolved in sterile 0.9% NaCl. Rats received bilateral 0.5 μl microinjections of flupenthixol (10 μg/side) or vehicle into the nucleus accumbens core or shell at 0.25 μl The injectors were removed 1 min after the infusion ended. Behavioral sessions commenced 30 min after the microinjections [15, 16].

Voltammetry recordings and analysis

Indwelling carbon fiber microelectrodes were connected to a head-mounted amplifier to monitor dopamine release in behaving rats using fast-scan cyclic voltammetry [8,14,15,17–19]. During voltammetric scans, the potential applied to the carbon fiber was ramped in a triangular waveform from −0.4 V (vs. Ag/AgCl) to +1.3 V and back at a rate of 400 V/s. Scans occurred at 10 Hz with the electrode potential held at −0.4 V between scans. Dopamine was chemically verified by obtaining high correlation of the cyclic voltammogram during a reward-related event with that of a dopamine standard (correlation coefficient r² ≥ 0.75 by linear regression). Voltammetry data for a session were excluded from analysis if the detected voltammetry signal did not satisfy the chemical verification criteria [8,15,17]. Dopamine was isolated from the voltammetry signal using chemometric analysis [20] with a standard training set accounting for dopamine, pH and drift. The background for voltammetry recording analysis was set at 0.5 s before the CS onset. Trials were excluded if chemometric analysis failed to identify dopamine on > 25% of the data points. The change in dopamine concentration was estimated based on the average post-implantation electrode sensitivity (34 nA/μM)[14].

All quantification of dopamine responses was performed on signals that had been smoothed via a 3-point moving average. CS-evoked dopamine release was quantified as the mean dopamine level during the 5 s CS relative to the 5 s prior to the CS delivery [8, 15]. The slope of the CS response was calculated by taking the difference in the mean dopamine levels during the peak (1.5 – 2 s) and the end of the CS (4.5 – 5 s). The change in dopamine release to the US was was quantified as the mean dopamine response during the 2 s following the pellet delivery relative to the mean dopamine response during the 0.5 s preceding the pellet delivery.

Experimental design and statistical analysis

We performed all statistical analyses in Graphpad Prism 8. All data are plotted as mean ± SEM. A mixed-effects model fit (restricted maximum likelihood method) was used to analyze effects on behavioral measures and dopamine responses. Data were analyzed in 5 trial bins for within-session analyses or averaged within session for full training analyses. The significance level was set to α = 0.05 for all tests. The full list of statistical analyses is presented in Supplementary Table 1.

Histology

Rats were anesthetized, electrically lesioned via the voltammetry electrodes, and perfused intracardially with 4% paraformaldehyde. Brains were extracted and post-fixed in the paraformaldehyde solution for a minimum of 24 hrs, then were transferred to 15 % and 30 % sucrose in phosphate-buffered saline. Tissue was cryosectioned and stained with cresyl violet. Implant locations were mapped to a standardized rat brain atlas [21].

Results

Dopamine in the NAc does not encode reward rate in early training sessions

Rats were trained on a Pavlovian delay conditioning task in which 5 s audio conditioned stimuli (CSs) signaled the delivery of a food reward (US). This task involved two trial types with distinct CSs. Both CSs resulted in the identical outcome (a single food pellet) but differed in the time elapsed since the previous reward (Fig. 1A). In Short Wait trials, the CS was presented 15 – 25 s following the previous reward delivery (high reward rate). In Long Wait trials, the CS was presented 65 – 75 s following the previous reward delivery (low reward rate). Training sessions consisted of 25 Short Wait trials and 25 Long Wait trials presented in a pseudorandom pattern so that the identity of the upcoming trial could not be predicted. Rats with extensive training on this task (> 24 sessions) exhibit a larger NAc core dopamine response to the Short Wait CS relative to the Long Wait CS [8]. While dopamine encodes reward rate in well-trained rats, it is unclear when this signal first emerges. Furthermore, it is unknown whether dopamine signaling uniformly reflects reward rate throughout the medial NAc.

Figure 1. Dopamine release in the NAc core during early training sessions.

(A) Dopamine release in the NAc core during early training sessions. (A) Pavlovian task. Short and Long Wait trials were presented in a pseudorandom pattern. (B) Conditioned responding across sessions in 5 trial bins. Conditioned responding was quantified as the change in the rate of head entries during the 5 s CS relative to the rate of head entries during the 5 s preceding the CS. (C) Representative two-dimensional pseudocolor plots of the resulting current from voltage sweeps (y-axis) as a function of time (x-axis) of voltammetry recordings in the NAc core. (D) Location of voltammetry electrodes. (E) Average dopamine signals across training sessions. (F) Dopamine response to the CS across sessions in 5 trial bins. (G) Dopamine response to the US across sessions in 5 trial bins. (H) Relationship between CS-evoked dopamine release and conditioned responding. (I) Relationship between US-evoked dopamine release and conditioned responding.

To address these questions, we performed voltammetry recordings of NAc dopamine levels in the NAc core and medial NAc shell during the first 6 Pavlovian training sessions. Conditioned responding in this task was quantified as the change in the rate of head entries during the 5 s CS relative to the rate of head entries during the 5 s preceding the CS [8]. Rats significantly increased conditioned responding to both CSs over all conditioning trials (two-way mixed effects analysis: trial effect F_{(29, 493)}=14.1, p<0.0001, n = 18 rats; Fig. 1B and Supplementary Fig. 1). The magnitude of conditioned responding did not differ between the Short and Long Wait CSs (reward rate effect F_{(1, 17)}=0.281, p=0.603), consistent with prior work [8]. Additionally, there was no difference in the latency to initiate a head entry between the trial types (Supplementary Fig. 2). We first assessed dopamine signals from electrodes in the NAc core (n = 10 electrodes; Fig. 1C). Short and Long Wait CS presentation and US delivery evoked time-locked phasic dopamine responses (Fig. 1D-E). The CS response increased over conditioning trials but did not differ between trial types (two-way mixed effects analysis: trial effect F_{(5.0, 44.9)}=5.54, p=0.0005; reward rate effect F_{(1.0, 9.0)}=1.71, p=0.223; Fig. 1F). The US response decreased over all conditioning trials but likewise did not differ between trial types (trial effect F_{(3.6, 32.0)}=10.5, p<0.0001; reward rate effect F_{(1.0, 9.0)}=0.208, p=0.659; Fig. 1G). While the overall pattern of CS-and US-evoked dopamine release is consistent with prior studies [10, 15], we found no evidence of reward rate encoding during early training sessions in the NAc core.

While there was no difference in NAc core dopamine release between trial types, we next examined how these dopamine signals related to behavioral outcomes. Conditioned responding was not correlated with CS-evoked dopamine release (Fig. 1H). Rather, the increase in conditioned responding across sessions was related to the decrease in US-evoked dopamine release in the NAc core (Fig. 1I). These results are consistent with research linking conditioned responding to reward-evoked dopamine signaling [15,22,23].

We next examined dopamine signals from electrodes in the medial NAc shell (n = 14 electrodes; Fig. 2A). Similar to the NAc core, the CS and US evoked time-locked phasic dopamine responses (Fig. 2B). The CS dopamine response increased over conditioning trials but did not differ between trial types (two-way mixed effects analysis: trial effect F_{(5.0, 65.1)}=4.93, p=0.0007; reward rate effect F_{(1.0, 13.0)}=0.000192, p=0.989; Fig. 2C). The US dopamine response decreased over all conditioning trials but did not differ between trial types (trial effect F_{(3.8, 49.0)}=9.83, p<0.0001; reward rate effect F_{(1.0, 13.0)}=2.33, p=0.151; Fig. 2D). Conditioned responding was correlated with dopamine release to the US, but not with dopamine release to the CS (Figs. 2E-F). These results demonstrate that reward rate is not encoded by CS-evoked dopamine signals in either the NAc core or the medial NAc shell in early training sessions. In contrast, the CS-evoked dopamine response encodes the reward rate in rats with extensive Pavlovian training [8]. Collectively, these data indicate that value-related dopamine signals to reward-predictive cues emerge via a multi-step process.

Figure 2. Dopamine release in the medial NAc shell during early training sessions.

((A) Location of voltammetry electrodes. (B) Average dopamine signals across training sessions. (C) Dopamine response to the CS across sessions in 5 trial bins. (D) Dopamine response to the US across sessions in 5 trial bins. (E) Relationship between CS-evoked dopamine release and conditioned responding. (F) Relationship between US-evoked dopamine release and conditioned responding.

Transient changes in the slope of the CS-evoked dopamine response

While there were no gross differences in the amplitude of CS-evoked dopamine release between Short and Long Wait trial types, the temporal dynamics of the response varied between trial types and across training sessions in the NAc core (Fig. 1E). To quantify these dynamics, we calculated the difference in dopamine levels from when average peak CS response occurred relative to the end of the CS presentation (Fig. 3A). In the NAc core, the slope of the CS response was significantly decreased in Long Wait trials in sessions 3, 4, and 5 (two-way mixed effects analysis: reward rate effect in session 3 F_{(1.0, 9.0)}=8.31, p=0.018; session 4 F_(1.0,8.0)=11.0, p=0.0086; session 5 F_{(1.0, 9.0)}=6.80, p=0.028; Fig. 3B). This transient effect was no longer observed by session 6 (reward rate effect: F_{(1.0, 9.0)}=0.131, p=0.725). The difference in the slope of the CS dopamine response between trial types was not accompanied by a corresponding difference in conditioned responding or the latency to respond for either trial type (Supplementary Fig. 3). In contrast to the NAc core, the slope of the CS-evoked dopamine response did not differ between Short and Long Wait trial types in the medial NAc shell (reward rate effect for all trials: F_{(1.0, 13.0)}=0.00984, p=0.923; Fig. 3C). Together, these results highlight trial-type and region-specific changes in the dynamics of the CS-evoked dopamine during early Pavlovian training sessions.

Figure 3. Differential dynamic of the CS dopamine response between trial types in the NAc core.

(A) Slope of the CS dopamine response is calculated as the difference between the regions of interest denoted by the teal overlay. (B) Slope of the CS dopamine response in the NAc core. (C) Slope of the CS dopamine response in the medial NAc shell. * p < 0.05, ** p < 0.01 main effect of trial type.

Regional differences in CS-evoked dopamine release

Prior studies have noted differences in dopamine release between the NAc core and NAc shell in reward-based tasks [11–13,24,25]. As such, we directly compared CS-and US-evoked dopamine release between the NAc core and medial NAc shell during early Pavlovian training sessions. This analysis identified a significant interaction of brain region and training session on the CS-evoked dopamine response (three-way mixed effects analysis: region effect F_{(1, 84)}=2.075, p=0.154; session effect F_{(2.0, 43.3)}=12.15, p<0.0001; region x session F_{(5, 84)}=2.93, p=0.0174; reward rate effect F_{(1.0, 22.0)}=0.989, p=0.331; Fig. 4A). This interaction effect was driven by a smaller dopamine response in the NAc shell during later training sessions. In contrast, US-evoked dopamine release did not differ between the NAc subregions (three-way mixed effects analysis: region effect F_{(1, 84)}=0.00396, p=0.950; session effect F_{(1.60, 35.2)}=28.3, p<0.0001; reward rate effect F_{(1.0, 22.0)}=1.41, p=0.248; Fig. 4B). Therefore, the trajectories of CS-evoked dopamine release in the NAc core and medial NAc shell diverge over the course of associative learning.

Figure 4.

Comparing the dopamine response between the NAc core and medial NAc shell. (A) CS dopamine response. (B) US dopamine response. * p < 0.05 interaction effect of session x NAc core x medial NAc shell.

Emergence of dopamine-sensitive conditioned responding in the NAc core and medial NAc shell

NAc dopamine signaling is necessary for conditioned behavioral responses to rewarding cues [16, 26]. However, the differential development of the CS-evoked dopamine responses between the NAc core and medial NAc shell suggests that these regions may not contribute equally to behavior throughout training. To address this, rats were implanted with bilateral cannulae targeting the NAc core or the medial NAc shell for local pharmacological manipulations (Fig. 5A). The D1/D2 dopamine receptor antagonist flupenthixol (10 μg/side) or vehicle was infused 30 min before the first 5 sessions. Rats were trained without microinjections for an additional session to differentiate acute versus sustained behavioral effects of the drug treatment (Fig. 5B).

Figure 5.

Emergence of dopamine-dependent conditioned responding in the NAc core and medial NAc shell. (A) Training paradigm. Flupenthixol or vehicle was infused into the NAc core or NAc shell before the first five training sessions. Training continued for an additional session without injections. (B) Location of the injector tips and infusion area. (C) Conditioned responding in rats receiving injections into the NAc core. (D) Conditioned responding in rats receiving injections into the NAc shell. Note that the data are plotted separately by trial type for visual clarity. * p < 0.05, ** p < 0.01 main effect of drug injection.

Flupenthixol microinjections in the NAc core significantly disrupted conditioned responding (Drug effect F_(1,456)=6.85; p=0.00910; Reward rate effect F_(1.0,19.0)=1.85; p=0.190; Drug x Reward rate interaction F_(1,456)=0.666; p=0.796; n = 10 vehicle, 11 flupenthixol; note that data are plotted separately by trial type for visual clarity in Fig. 5C). Within-session analyses identified lower levels of conditioned responding in flupenthixol-treated rats during the third and fifth training session (three-way mixed effects analysis: drug effect session 1 F_(1,76)=1.97, p=0.165; session 2 F_(1,76)=0.878, p=0.352; session 3 F_(1,76)=4.16, p=0.0449; session 4 F_(1,76)=3.95, p=0.0503; session 5 F_(1,76)=10.3, p=0.00190). Furthermore, impairments in conditioned responding following flupenthixol treatment persisted during the sixth session in which no drug was administered (F_(1,76)=11.5, p=0.00110). Flupenthixol application in the NAc core selectively reduced the number of CS-evoked head entries without altering head entries during the ITI or the latency to approach the food tray (Supplementary Fig. 4). Disruption of NAc core dopamine transmission therefore selectively impairs cue-driven appetitive behavior without altering motor function or response initiation.

In rats with cannulae in the medial NAc shell, conditioned responding was not acutely affected by flupenthixol treatment during sessions 1-5 (Drug effect F_(1,408)=1.16, p=0.283; Reward rate effect F_(1.0,17.0)=0.190; p=0.669; Drug x Reward rate interaction F_(1,408)=0.0211; p=0.885; n = 10 vehicle, 9 flupenthixol; note that data are plotted separately by trial type for visual clarity in Fig. 5D). However, a behavioral deficit was evident on the sixth session in which no injection was given (three-way mixed effects analysis: F_(1,68)=5.06, p=0.0277). Flupenthixol treatment in the NAc shell caused a subsequent non-significant reduction of cue-evoked head entries and significantly increased latency to approach the food tray during the sixth training session (Supplementary Fig. 5). This effect on response latency suggests that dopamine transmission in the NAc shell regulates the speed of response initiation in addition to the magnitude of the conditioned response. Collectively, these results demonstrate that dopamine signals in the NAc core and medial shell contribute to Pavlovian appetitive behavior at distinct phases of training.

Discussion

Dopamine neurons respond to cues to convey reward-related information in Pavlovian tasks [3,7,8]. In particular, cue-evoked dopamine release in the NAc core signals the relative reward rate in rats with extensive Pavlovian training (>24 sessions) [8]. However, it was not known if reward rate dopamine signals rapidly emerged during early learning. To address this, we recorded dopamine release during the first 6 Pavlovian training sessions. Our data demonstrates that the reward rate is not reflected in cue-evoked dopamine responses in the NAc core or the medial NAc shell during early training. Coupled with our prior work, these findings suggest that NAc dopamine encodes value-related information via a sequential, multi-step process [8]. Specifically, cue-evoked dopamine release initially signals that a reward is forthcoming. However, over training these cue-evoked dopamine responses then signal the relative value of the reward.

The acquisition of conditioned responding depends on the temporal relationship between the cue, reward, and inter-trial interval (ITI). In particular, the ratio of the ITI duration relative to the cue duration can impact the rate of learning [27]. For example, an increase in the ITI (i.e. lower reward rate) facilitates acquisition in tasks involving a single cue-reward association [27–30]. However, we found no difference in the acquisition of conditioned responding between Short and Long Wait trials in our task. One potential explanation for this apparent discrepancy is that the difference in the ITI/cue ratio is not sufficiently large enough to observe differences in learning rates between the trial types. Alternatively, if rats cannot distinguish the difference in the ITI/cue ratios between the trial types in early training sessions, one would not anticipate a difference in the learning rate between Short and Long Wait trials.

Dopamine neurons projecting to the striatum are genetically and functionally diverse [22,31,32]. While rewards and reward-predictive cues elicit dopamine release in the NAc core and medial NAc shell, regional differences in the dynamics of the dopamine response are evident in some tasks [11–13,24,25]. Our data demonstrates dopamine release to cues initially increased in both regions, though there was a selective attenuation in the cue-evoked dopamine response in the NAc shell during later sessions. In contrast, there were no regional differences in the dopamine response to the reward delivery across sessions. Our results demonstrate that the increase in conditioned responding across sessions was related to the decrease in reward-evoked dopamine release in the both the NAc core and medial NAc shell. These findings are consistent with research linking conditioned responding to reward-evoked dopamine signaling [15,22,23]. Furthermore, the selective relationship between conditioned responding and reward-evoked dopamine release (and not cue-evoked dopamine release) agrees with recent research demonstrating that the dopamine response to cues and rewards evolve independently of one another during early learning [33].

Although there was no difference in the average dopamine response between Long and Short Wait cues, we identified transient differences in the dynamics of the dopamine response between trial types in the NAc core. Specifically, the decay of the cue-evoked dopamine response was more pronounced for Long Wait trials relative to Short Wait trials during sessions 3-5, though this effect was absent in session 6. The dynamics of the cue-evoked dopamine response could be controlled by dopamine neuron firing patterns and/or regulation of dopamine release at the terminals [34–36]. We speculate that the slope of the dopamine response during the cue could be controlled by cholinergic signaling within the NAc. Striatal cholinergic neurons exhibit a reduction in firing to reward-predictive cues [37, 38]. A decrease in cholinergic signaling can facilitate dopamine release evoked by high frequency stimulations [39, 40]. The transient difference in the slope of the CS dopamine response between trial types could therefore arise from differences in striatal cholinergic signaling. Regardless, future studies will be needed to determine how striatal cholinergic neurons regulate the dynamics of cue-evoked dopamine release during early Pavlovian learning.

Conditioned responding can be modulated by the dopamine response to cues and rewards, though this can depend on task parameters and prior training [15,22,23,41]. We previously found that conditioned responding updates with cue-specific changes in dopamine release in well-trained animals. Rats trained to experience Short and Long Wait trials in separate sessions exhibited a selective elevation in dopamine release and conditioned responding to the Short Wait cue upon experiencing both trials together for the first time [8]. Based on these findings, we anticipated the emergence of cue-evoked dopamine release would parallel emergence of dopamine-sensitive conditioned responding during early training sessions. Our voltammetry recordings and dopamine receptor antagonist experiments instead illustrate that the increase in cue-evoked dopamine release precedes the emergence of dopamine-mediated conditioned responding. In the NAc core, flupenthixol treatment reduced conditioned responding starting in the third training session. These impairments persisted during the sixth training session with no drug treatment, which demonstrates NAc core dopamine is involved with Pavlovian learning, as reported previously [10, 16]. In contrast, flupenthixol injections into the medial NAc shell failed to alter conditioned responding during the first five training sessions. However, this prior treatment with flupenthixol impaired conditioned responding during the sixth training session with no drug treatment. These results highlight a potential role for NAc shell dopamine in consolidation. In support, local injections of amphetamine into the NAc shell after the Pavlovian training sessions elevated conditioned responses [42]. Future studies will be needed to identify the specific temporal window during the trial or after the session when dopamine signaling contributes to conditioned responding. Collectively, our results highlight region-specific critical periods during training when dopamine signaling regulates conditioned responding.

Dopamine is thought to mediate distinct functions in the NAc core and NAc shell, with NAc core dopamine primarily involved with reward learning and NAc shell dopamine regulating learned behavioral actions [10,12,16,22]. However, it is important to note that these general roles may not be applicable to all behavioral tasks [43]. Indeed, our results highlight that dopamine in both the NAc core and medial NAc shell contribute to Pavlovian learning when cues convey distinct reward rates, albeit at different points during training. Future studies are needed to determine if dopamine in the NAc core and shell is similarly required for Pavlovian learning when cues signal differences in other reward-related parameters, such as reward size or probability. Research on dopamine’s role in behavior has largely focused on the ‘what’ and the ‘where’: what task elements increase dopamine release and where in the brain is dopamine released. Our results collectively highlight that it is also important to consider ‘when’ during training dopamine is capable of regulating behavioral actions.

Funding & Disclosure

This work was supported by National Institutes of Health grants DA033386 and DA042362 to M.J.W. The authors declare no competing interests.

Author Contributions

CES, KSG, and MJW designed the experiments. CES, KSG, MJL, KMF, and MJW performed the experiments and analyzed the data. CES and MJW wrote the manuscript.

References

1.↵
Phillips PE, Walton ME, Jhou TC. Calculating utility: preclinical evidence for cost-benefit analysis by mesolimbic dopamine. Psychopharmacology (Berl). 2007;191(3):483–95.
OpenUrl CrossRef PubMed
2.↵
Salamone JD, Correa M. The mysterious motivational functions of mesolimbic dopamine. Neuron. 2012;76(3):470–85.
OpenUrl CrossRef PubMed Web of Science
3.↵
Fiorillo CD, Tobler PN, Schultz W. Discrete coding of reward probability and uncertainty by dopamine neurons. Science. 2003;299(5614):1898–902.
OpenUrl Abstract/FREE Full Text
4.↵
Gan JO, Walton ME, Phillips PE. Dissociable cost and benefit encoding of future rewards by mesolimbic dopamine. Nat Neurosci. 2010;13(1):25–7.
OpenUrl CrossRef PubMed Web of Science
5.
Lak A, Stauffer WR, Schultz W. Dopamine prediction error responses integrate subjective value from different reward dimensions. Proc Natl Acad Sci U S A. 2014;111(6):2343–8.
OpenUrl Abstract/FREE Full Text
6.↵
Roesch MR, Calu DJ, Schoenbaum G. Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nat Neurosci. 2007;10(12):1615–24.
OpenUrl CrossRef PubMed Web of Science
7.↵
Tobler PN, Fiorillo CD, Schultz W. Adaptive coding of reward value by dopamine neurons. Science. 2005;307(5715):1642–5.
OpenUrl Abstract/FREE Full Text
8.↵
Fonzi KM, Lefner MJ, Phillips PEM, Wanat MJ. Dopamine Encodes Retrospective Temporal Information in a Context-Independent Manner. Cell Rep. 2017;20(8):1765–74.
OpenUrl CrossRef PubMed
9.↵
Day JJ, Roitman MF, Wightman RM, Carelli RM. Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens. Nature Neuroscience. 2007;10(8):1020–28.
OpenUrl CrossRef PubMed Web of Science
10.↵
Flagel SB, Clark JJ, Robinson TE, Mayo L, Czuj A, Willuhn I, et al. A selective role for dopamine in stimulus-reward learning. Nature. 2011;469(7328):53–7.
OpenUrl CrossRef PubMed Web of Science
11.↵
Cacciapaglia F, Saddoris MP, Wightman RM, Carelli RM. Differential dopamine release dynamics in the nucleus accumbens core and shell track distinct aspects of goal-directed behavior for sucrose. Neuropharmacology. 2012;62(5-6):2050–6.
OpenUrl CrossRef PubMed Web of Science
12.↵
Saddoris MP, Cacciapaglia F, Wightman RM, Carelli RM. Differential Dopamine Release Dynamics in the Nucleus Accumbens Core and Shell Reveal Complementary Signals for Error Prediction and Incentive Motivation. J Neurosci. 2015;35(33):11572–82.
OpenUrl Abstract/FREE Full Text
13.↵
Wanat MJ, Kuhnen CM, Phillips PE. Delays conferred by escalating costs modulate dopamine release to rewards but not their predictors. J Neurosci. 2010;30(36):12020–7.
OpenUrl Abstract/FREE Full Text
14.↵
Clark JJ, Sandberg SG, Wanat MJ, Gan JO, Horne EA, Hart AS, et al. Chronic microsensors for longitudinal, subsecond dopamine detection in behaving animals. Nat Methods. 2010;7(2):126–9.
OpenUrl CrossRef PubMed Web of Science
15.↵
Stelly CE, Tritley SC, Rafati Y, Wanat MJ. Acute Stress Enhances Associative Learning via Dopamine Signaling in the Ventral Lateral Striatum. J Neurosci. 2020;40(22):4391–400.
OpenUrl Abstract/FREE Full Text
16.↵
Saunders BT, Robinson TE. The role of dopamine in the accumbens core in the expression of Pavlovian-conditioned responses. Eur J Neurosci. 2012;36(4):2521–32.
OpenUrl CrossRef PubMed
17.↵
Stelly CE, Haug GC, Fonzi KM, Garcia MA, Tritley SC, Magnon AP, et al. Pattern of dopamine signaling during aversive events predicts active avoidance learning. Proc Natl Acad Sci U S A. 2019;116(27):13641–50.
OpenUrl Abstract/FREE Full Text
18.
Oliva I, Donate MM, Lefner MJ, Wanat MJ. Cocaine experience abolishes the motivation suppressing effect of CRF in the ventral midbrain. Addict Biol. 2019:e12837.
19.↵
Oliva I, Wanat MJ. Operant Costs Modulate Dopamine Release to Self-Administered Cocaine. J Neurosci. 2019;39(7):1249–60.
OpenUrl Abstract/FREE Full Text
20.↵
Heien ML, Khan AS, Ariansen JL, Cheer JF, Phillips PE, Wassum KM, et al. Real-time measurement of dopamine fluctuations after cocaine in the brain of behaving rats. Proc Natl Acad Sci U S A. 2005;102(29):10023–8.
OpenUrl Abstract/FREE Full Text
21.↵
Paxinos G, Watson C. The Rat Brain in Stereotaxic Coordinates: The New Coronal Set. Elsevier Science & Technology: London; 2004.
22.↵
Heymann G, Jo YS, Reichard KL, McFarland N, Chavkin C, Palmiter RD, et al. Synergy of Distinct Dopamine Projection Populations in Behavioral Reinforcement. Neuron. 2020;105(5):909–20 e5.
OpenUrl
23.↵
Lee K, Claar LD, Hachisuka A, Bakhurin KI, Nguyen J, Trott JM, et al. Temporally restricted dopaminergic control of reward-conditioned movements. Nat Neurosci. 2020;23(2):209–16.
OpenUrl CrossRef PubMed
24.↵
Ko D, Wanat MJ. Phasic Dopamine Transmission Reflects Initiation Vigor and Exerted Effort in an Action-and Region-Specific Manner. J Neurosci. 2016;36(7):2202–11.
OpenUrl Abstract/FREE Full Text
25.↵
Saddoris MP, Wang X, Sugam JA, Carelli RM. Cocaine Self-Administration Experience Induces Pathological Phasic Accumbens Dopamine Signals and Abnormal Incentive Behaviors in Drug-Abstinent Rats. J Neurosci. 2016;36(1):235–50.
OpenUrl Abstract/FREE Full Text
26.↵
Darvas M, Wunsch AM, Gibbs JT, Palmiter RD. Dopamine dependency for acquisition and performance of Pavlovian conditioned response. Proc Natl Acad Sci U S A. 2014;111(7):2764–9.
OpenUrl Abstract/FREE Full Text
27.↵
Gallistel CR, Gibbon J. Time, rate, and conditioning. Psychol Rev. 2000;107(2):289–344.
OpenUrl CrossRef PubMed Web of Science
28.
Gibbon J, Baldock MD, Locurto C, Gold L, Terrace HS. Trial and intertrial durations in autoshaping. Journal of Experimental Psychology: Animal Behavior Processes. 1977;3(3):264–84.
OpenUrl CrossRef Web of Science
29.
Holland PC. Intertrial interval effects in pavlovian serial feature positive discriminations. Animal Learning & Behavior. 1999;27(2):127–39.
OpenUrl CrossRef Web of Science
30.↵
Terrace HS, Gibbon J, Farrell L, Baldock MD. Temporal factors influencing the acquisition and maintenance of an autoshaped keypeck. Animal Learning & Behavior. 1975;3(1):53–62.
OpenUrl CrossRef Web of Science
31.↵
de Jong JW, Afjei SA, Pollak Dorocic I, Peck JR, Liu C, Kim CK, et al. A Neural Circuit Mechanism for Encoding Aversive Stimuli in the Mesolimbic Dopamine System. Neuron. 2019;101(1):133–51 e7.
OpenUrl CrossRef PubMed
32.↵
Morales M, Margolis EB. Ventral tegmental area: cellular heterogeneity, connectivity and behaviour. Nat Rev Neurosci. 2017;18(2):73–85.
OpenUrl CrossRef PubMed
33.↵
Coddington LT, Dudman JT. The timing of action determines reward prediction signals in identified midbrain dopamine neurons. Nat Neurosci. 2018;21(11):1563–73.
OpenUrl CrossRef PubMed
34.↵
Berke JD. What does dopamine mean? Nat Neurosci. 2018;21(6):787–93.
OpenUrl CrossRef PubMed
35.
Collins AL, Saunders BT. Heterogeneity in striatal dopamine circuits: Form and function in dynamic reward seeking. J Neurosci Res. 2020;98(6):1046–69.
OpenUrl
36.↵
Mohebi A, Pettibone JR, Hamid AA, Wong JT, Vinson LT, Patriarchi T, et al. Dissociable dopamine dynamics for learning and motivation. Nature. 2019;570(7759):65–70.
OpenUrl CrossRef PubMed
37.↵
Apicella P. The role of the intrinsic cholinergic system of the striatum: What have we learned from TAN recordings in behaving animals? Neuroscience. 2017;360:81–94.
OpenUrl CrossRef PubMed
38.↵
Ravel S, Sardo P, Legallet E, Apicella P. Reward unpredictability inside and outside of a task context as a determinant of the responses of tonically active neurons in the monkey striatum. J Neurosci. 2001;21(15):5730–9.
OpenUrl Abstract/FREE Full Text
39.↵
Cragg SJ. Meaningful silences: how dopamine listens to the ACh pause. Trends Neurosci. 2006;29(3):125–31.
OpenUrl CrossRef PubMed Web of Science
40.↵
Rice ME, Cragg SJ. Nicotine amplifies reward-related dopamine signals in striatum. Nat Neurosci. 2004;7(6):583–4.
OpenUrl CrossRef PubMed Web of Science
41.↵
Morrens J, Aydin C, Janse van Rensburg A, Esquivelzeta Rabell J, Haesler S. Cue-Evoked Dopamine Promotes Conditioned Responding during Learning. Neuron. 2020;106(1):142–53 e7.
OpenUrl
42.↵
Phillips GD, Setzu E, Hitchcott PK. Facilitation of appetitive pavlovian conditioning by d-amphetamine in the shell, but not the core, of the nucleus accumbens. Behav Neurosci. 2003;117(4):675–84.
OpenUrl CrossRef PubMed Web of Science
43.↵
Saddoris MP, Sugam JA, Cacciapaglia F, Carelli RM. Rapid dopamine dynamics in the accumbens core and shell: learning and action. Front Biosci (Elite Ed). 2013;5:273–88.
OpenUrl CrossRef PubMed

View the discussion thread.

Posted October 20, 2020.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Neuroscience

Subject Areas

All Articles

Animal Behavior and Cognition (5215)
Biochemistry (11745)
Bioengineering (8752)
Bioinformatics (29200)
Biophysics (14972)
Cancer Biology (12096)
Cell Biology (17411)
Clinical Trials (138)
Developmental Biology (9421)
Ecology (14182)
Epidemiology (2067)
Evolutionary Biology (18308)
Genetics (12245)
Genomics (16803)
Immunology (11869)
Microbiology (28085)
Molecular Biology (11592)
Neuroscience (60969)
Paleontology (451)
Pathology (1871)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2885)
Systems Biology (7340)
Zoology (1651)

[1] 1.↵
Phillips PE, Walton ME, Jhou TC. Calculating utility: preclinical evidence for cost-benefit analysis by mesolimbic dopamine. Psychopharmacology (Berl). 2007;191(3):483–95.
OpenUrl CrossRef PubMed

[2] 2.↵
Salamone JD, Correa M. The mysterious motivational functions of mesolimbic dopamine. Neuron. 2012;76(3):470–85.
OpenUrl CrossRef PubMed Web of Science

[3] 3.↵
Fiorillo CD, Tobler PN, Schultz W. Discrete coding of reward probability and uncertainty by dopamine neurons. Science. 2003;299(5614):1898–902.
OpenUrl Abstract/FREE Full Text

[4] 4.↵
Gan JO, Walton ME, Phillips PE. Dissociable cost and benefit encoding of future rewards by mesolimbic dopamine. Nat Neurosci. 2010;13(1):25–7.
OpenUrl CrossRef PubMed Web of Science

[5] 5.
Lak A, Stauffer WR, Schultz W. Dopamine prediction error responses integrate subjective value from different reward dimensions. Proc Natl Acad Sci U S A. 2014;111(6):2343–8.
OpenUrl Abstract/FREE Full Text

[6] 6.↵
Roesch MR, Calu DJ, Schoenbaum G. Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nat Neurosci. 2007;10(12):1615–24.
OpenUrl CrossRef PubMed Web of Science

[7] 7.↵
Tobler PN, Fiorillo CD, Schultz W. Adaptive coding of reward value by dopamine neurons. Science. 2005;307(5715):1642–5.
OpenUrl Abstract/FREE Full Text

[8] 8.↵
Fonzi KM, Lefner MJ, Phillips PEM, Wanat MJ. Dopamine Encodes Retrospective Temporal Information in a Context-Independent Manner. Cell Rep. 2017;20(8):1765–74.
OpenUrl CrossRef PubMed

[9] 9.↵
Day JJ, Roitman MF, Wightman RM, Carelli RM. Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens. Nature Neuroscience. 2007;10(8):1020–28.
OpenUrl CrossRef PubMed Web of Science

[10] 10.↵
Flagel SB, Clark JJ, Robinson TE, Mayo L, Czuj A, Willuhn I, et al. A selective role for dopamine in stimulus-reward learning. Nature. 2011;469(7328):53–7.
OpenUrl CrossRef PubMed Web of Science

[11] 11.↵
Cacciapaglia F, Saddoris MP, Wightman RM, Carelli RM. Differential dopamine release dynamics in the nucleus accumbens core and shell track distinct aspects of goal-directed behavior for sucrose. Neuropharmacology. 2012;62(5-6):2050–6.
OpenUrl CrossRef PubMed Web of Science

[12] 12.↵
Saddoris MP, Cacciapaglia F, Wightman RM, Carelli RM. Differential Dopamine Release Dynamics in the Nucleus Accumbens Core and Shell Reveal Complementary Signals for Error Prediction and Incentive Motivation. J Neurosci. 2015;35(33):11572–82.
OpenUrl Abstract/FREE Full Text

[13] 13.↵
Wanat MJ, Kuhnen CM, Phillips PE. Delays conferred by escalating costs modulate dopamine release to rewards but not their predictors. J Neurosci. 2010;30(36):12020–7.
OpenUrl Abstract/FREE Full Text

[14] 14.↵
Clark JJ, Sandberg SG, Wanat MJ, Gan JO, Horne EA, Hart AS, et al. Chronic microsensors for longitudinal, subsecond dopamine detection in behaving animals. Nat Methods. 2010;7(2):126–9.
OpenUrl CrossRef PubMed Web of Science

[15] 15.↵
Stelly CE, Tritley SC, Rafati Y, Wanat MJ. Acute Stress Enhances Associative Learning via Dopamine Signaling in the Ventral Lateral Striatum. J Neurosci. 2020;40(22):4391–400.
OpenUrl Abstract/FREE Full Text

[16] 16.↵
Saunders BT, Robinson TE. The role of dopamine in the accumbens core in the expression of Pavlovian-conditioned responses. Eur J Neurosci. 2012;36(4):2521–32.
OpenUrl CrossRef PubMed

[17] 17.↵
Stelly CE, Haug GC, Fonzi KM, Garcia MA, Tritley SC, Magnon AP, et al. Pattern of dopamine signaling during aversive events predicts active avoidance learning. Proc Natl Acad Sci U S A. 2019;116(27):13641–50.
OpenUrl Abstract/FREE Full Text

[18] 18.
Oliva I, Donate MM, Lefner MJ, Wanat MJ. Cocaine experience abolishes the motivation suppressing effect of CRF in the ventral midbrain. Addict Biol. 2019:e12837.

[19] 19.↵
Oliva I, Wanat MJ. Operant Costs Modulate Dopamine Release to Self-Administered Cocaine. J Neurosci. 2019;39(7):1249–60.
OpenUrl Abstract/FREE Full Text

[20] 20.↵
Heien ML, Khan AS, Ariansen JL, Cheer JF, Phillips PE, Wassum KM, et al. Real-time measurement of dopamine fluctuations after cocaine in the brain of behaving rats. Proc Natl Acad Sci U S A. 2005;102(29):10023–8.
OpenUrl Abstract/FREE Full Text

[21] 21.↵
Paxinos G, Watson C. The Rat Brain in Stereotaxic Coordinates: The New Coronal Set. Elsevier Science & Technology: London; 2004.

[22] 22.↵
Heymann G, Jo YS, Reichard KL, McFarland N, Chavkin C, Palmiter RD, et al. Synergy of Distinct Dopamine Projection Populations in Behavioral Reinforcement. Neuron. 2020;105(5):909–20 e5.
OpenUrl

[23] 23.↵
Lee K, Claar LD, Hachisuka A, Bakhurin KI, Nguyen J, Trott JM, et al. Temporally restricted dopaminergic control of reward-conditioned movements. Nat Neurosci. 2020;23(2):209–16.
OpenUrl CrossRef PubMed

[24] 24.↵
Ko D, Wanat MJ. Phasic Dopamine Transmission Reflects Initiation Vigor and Exerted Effort in an Action-and Region-Specific Manner. J Neurosci. 2016;36(7):2202–11.
OpenUrl Abstract/FREE Full Text

[25] 25.↵
Saddoris MP, Wang X, Sugam JA, Carelli RM. Cocaine Self-Administration Experience Induces Pathological Phasic Accumbens Dopamine Signals and Abnormal Incentive Behaviors in Drug-Abstinent Rats. J Neurosci. 2016;36(1):235–50.
OpenUrl Abstract/FREE Full Text

[26] 26.↵
Darvas M, Wunsch AM, Gibbs JT, Palmiter RD. Dopamine dependency for acquisition and performance of Pavlovian conditioned response. Proc Natl Acad Sci U S A. 2014;111(7):2764–9.
OpenUrl Abstract/FREE Full Text

[27] 27.↵
Gallistel CR, Gibbon J. Time, rate, and conditioning. Psychol Rev. 2000;107(2):289–344.
OpenUrl CrossRef PubMed Web of Science

[28] 28.
Gibbon J, Baldock MD, Locurto C, Gold L, Terrace HS. Trial and intertrial durations in autoshaping. Journal of Experimental Psychology: Animal Behavior Processes. 1977;3(3):264–84.
OpenUrl CrossRef Web of Science

[29] 29.
Holland PC. Intertrial interval effects in pavlovian serial feature positive discriminations. Animal Learning & Behavior. 1999;27(2):127–39.
OpenUrl CrossRef Web of Science

[30] 30.↵
Terrace HS, Gibbon J, Farrell L, Baldock MD. Temporal factors influencing the acquisition and maintenance of an autoshaped keypeck. Animal Learning & Behavior. 1975;3(1):53–62.
OpenUrl CrossRef Web of Science

[31] 31.↵
de Jong JW, Afjei SA, Pollak Dorocic I, Peck JR, Liu C, Kim CK, et al. A Neural Circuit Mechanism for Encoding Aversive Stimuli in the Mesolimbic Dopamine System. Neuron. 2019;101(1):133–51 e7.
OpenUrl CrossRef PubMed

[32] 32.↵
Morales M, Margolis EB. Ventral tegmental area: cellular heterogeneity, connectivity and behaviour. Nat Rev Neurosci. 2017;18(2):73–85.
OpenUrl CrossRef PubMed

[33] 33.↵
Coddington LT, Dudman JT. The timing of action determines reward prediction signals in identified midbrain dopamine neurons. Nat Neurosci. 2018;21(11):1563–73.
OpenUrl CrossRef PubMed

[34] 34.↵
Berke JD. What does dopamine mean? Nat Neurosci. 2018;21(6):787–93.
OpenUrl CrossRef PubMed

[35] 35.
Collins AL, Saunders BT. Heterogeneity in striatal dopamine circuits: Form and function in dynamic reward seeking. J Neurosci Res. 2020;98(6):1046–69.
OpenUrl

[36] 36.↵
Mohebi A, Pettibone JR, Hamid AA, Wong JT, Vinson LT, Patriarchi T, et al. Dissociable dopamine dynamics for learning and motivation. Nature. 2019;570(7759):65–70.
OpenUrl CrossRef PubMed

[37] 37.↵
Apicella P. The role of the intrinsic cholinergic system of the striatum: What have we learned from TAN recordings in behaving animals? Neuroscience. 2017;360:81–94.
OpenUrl CrossRef PubMed

[38] 38.↵
Ravel S, Sardo P, Legallet E, Apicella P. Reward unpredictability inside and outside of a task context as a determinant of the responses of tonically active neurons in the monkey striatum. J Neurosci. 2001;21(15):5730–9.
OpenUrl Abstract/FREE Full Text

[39] 39.↵
Cragg SJ. Meaningful silences: how dopamine listens to the ACh pause. Trends Neurosci. 2006;29(3):125–31.
OpenUrl CrossRef PubMed Web of Science

[40] 40.↵
Rice ME, Cragg SJ. Nicotine amplifies reward-related dopamine signals in striatum. Nat Neurosci. 2004;7(6):583–4.
OpenUrl CrossRef PubMed Web of Science

[41] 41.↵
Morrens J, Aydin C, Janse van Rensburg A, Esquivelzeta Rabell J, Haesler S. Cue-Evoked Dopamine Promotes Conditioned Responding during Learning. Neuron. 2020;106(1):142–53 e7.
OpenUrl

[42] 42.↵
Phillips GD, Setzu E, Hitchcott PK. Facilitation of appetitive pavlovian conditioning by d-amphetamine in the shell, but not the core, of the nucleus accumbens. Behav Neurosci. 2003;117(4):675–84.
OpenUrl CrossRef PubMed Web of Science

[43] 43.↵
Saddoris MP, Sugam JA, Cacciapaglia F, Carelli RM. Rapid dopamine dynamics in the accumbens core and shell: learning and action. Front Biosci (Elite Ed). 2013;5:273–88.
OpenUrl CrossRef PubMed