Basal Ganglia role in learning rewarded actions and executing previously learned choices: healthy and diseased states

Garrett Mulcahy; Brady Atwood; Alexey Kuznetsov

doi:10.1101/616854

Abstract

The basal ganglia (BG) is a collection of nuclei located deep beneath the cerebral cortex that is involved in learning and selection of rewarded actions. Here, we analyzed BG mechanisms that enable these functions. We implemented a rate model of a BG-thalamo-cortical loop and simulated its performance in a standard action selection task. We have shown that potentiation of corticostriatal synapses enables learning of a rewarded option. However, these synapses became redundant later as direct connections between prefrontal and premotor cortices (PFC-PMC) were potentiated by Hebbian learning. After we switched the reward to the previously unrewarded option (reversal), the BG was again responsible for switching to the new option. Due to the potentiated direct cortical connections, the system was biased to the previously rewarded choice, and establishing the new choice required a greater number of trials. Guided by physiological research, we then modified our model to reproduce pathological states of mild Parkinson’s and Huntington’s diseases. We found that in the Parkinsonian state PMC activity levels become extremely variable, which is caused by oscillations arising in the BG-thalamo-cortical loop. The model reproduced severe impairment of learning and predicted that this is caused by these oscillations as well as a reduced reward prediction signal. In the Huntington state, the potentiation of the PFC-PMC connections produced better learning, but altered BG output disrupted expression of the rewarded choices. This resulted in random switching between rewarded and unrewarded choices resembling an exploratory phase that never ended. Our results reconcile the apparent contradiction between the critical involvement of the BG in execution of previously learned actions and yet no impairment of these actions after BG output is ablated by lesions or deep brain stimulation. We predict that the cortico-BG-thalamo-cortical loop conforms to previously learned choice in healthy conditions, but impedes those choices in disease states.

Author summary Learning and selection of a rewarded action, as well as avoiding punishments, are known to involve interaction of cortical and subcortical structures in the brain. The subcortical structure that is included in this interaction is called Basal Ganglia (BG). Accordingly, diseases that damage BG, such as Parkinson and Huntington, disrupt action selection functions. A long-standing puzzle is that abolition of the BG output that disconnects the BG-cortical interaction does not disrupt execution of previously learned actions. This is the principle that is suggested to underlie standard Parkinsonian treatments, such as deep brain stimulation. We model the BG-cortical interaction and reconcile this apparent contradiction. Our simulations show that, while BG is necessary for learning of new rewarded choices, it is not necessary for the expression of previously learned actions. Our model predicts that the BG conforms to previously learned choice in healthy conditions, but impedes those choices in disease states.

Introduction

The basal ganglia (BG) is a complex network of excitatory and inhibitory neurons located in the deep brain of most vertebrates that controls action selection (see e.g. (1)). The BG is comprised of the dorsal striatum, external and internal portions of the globus pallidus (GPe, GPi), subthalamic nucleus (STN) and substantia nigra (2). It is traditionally implicated in motor control since BG lesions are associated with movement disorders (3–5). The BG is a shared processing center involved in a broad spectrum of motor and cognitive control (2,6). A cortico-BG-thalamo-cortical neurocircuit loop is suggested to be the structure that provides this control (7,8). However, understanding how this loop functions remains far from complete and requires more experimental and theoretical studies.

The BG is also widely recognized for its involvement in learning (9–11). Reinforcement learning is recognized as the mechanism that establishes behavioral responses for rewards, such as food or drugs of abuse and is altered in numerous disorders and disease states including Parkinson’s disease (12–14). Reinforcement learning is based on communication between midbrain dopamine neurons and the striatum (15,13), specifically ventral tegmental area projections to ventral striatum and substantia nigra pars compacta (SNc) projections to dorsal striatum (16–18). Dopamine (DA) released by dopaminergic VTA and SNc inputs to striatum signals the difference between received and expected rewards – the reward prediction error (RPE) (14). RPE encoding in VTA-NAc neurocircuits involves prediction of reward value which in turn feeds back to both VTA and SNc dopamine neurons (15). Given its role in motor control, the SNc-dorsal striatum component of the BG translates RPE into action: the hypothesized critic-actor roles of these two dopaminergic projections (15,14,19,20). If the RPE is positive, additional DA release leads to positive reinforcement of the preceding action; if the error is negative (expected more than received), a pause in DA release leads to negative reinforcement and blocks the action. As a mechanism for this control, DA modulates plasticity of synaptic projections from the cortex to striatal medium spiny neurons (MSNs) (21,22). As a reflection of the bidirectional DA modulation, there are two types of MSNs. Those that are responsible for promoting movement are part of the BG direct pathway and express D1-type dopamine receptors (GO, D1-MSNs) and those that inhibit movement are part of the BG indirect pathway and express D2 dopamine receptors (NO-GO, D2-MSNs) (23–25). Indirect and direct BG pathways respectively inhibit or disinhibit the thalamocortical relay neurons responsible for producing particular movements (26–28). The coordination of activity within the two types of MSNs determines action (29–31). Within the BG loops, synaptic plasticity of corticostriatal projections is a key node in the learning of rewarded choices (9–11,22).

The BG is suggested to remain involved in action selection after the action-reward association is learned and control the transition from goal-directed to habitual choices (8,32). On the other hand, clinical interventions for Parkinson disease (PD) do not cause impairments in goal-directed or habitual movements (33–35). Specifically, GPi lesions and deep brain stimulation, which disrupt the main output of the BG, are used to improve motor functions. This observation gave rise to a hypothesis that the BG play a critical role in learning, but not in the expression of already learned actions or choices (36,37). These choices are suggested to instead be stored in synaptic connections within cortex. This hypothesis apparently contradicts the suggested involvement of the BG in executing actions learned previously. Therefore, it is essential to fill in this knowledge gap by further investigating the role of the BG in goal-directed and habitual choices.

This paper presents a computational model of the cortico-BG-thalamo-cortical loop involved in a two-choice instrumental conditioning task (32). This task is standard for assessing action-reward association in animals and humans. Our model design is similar to a previously published design (37,38), but focused on choice selection. We implemented two synaptic mechanisms that can mediate learning: reward-related plasticity of corticostriatal synapses (39) and activity-dependent Hebbian plasticity (40,41) of cortico-cortical synapses. To elucidate the role of the BG in Parkinson’s and Huntington diseases, we calibrate the model to reflect the altered BG connectivity documented for these diseases and simulate these changes in BG activity.

Results

We simulated the same standard two-choice IC and reversal task in three conditions: Healthy, Parkinsonian, and Huntington’s BG. Fig. 1 presents a schematic diagram of nuclei and connections within the BG and their connections with cortices. The model is described in detail in Materials and Methods. The models received a stimulus (CS) that activates prefrontal cortical (PFC) neurons for all 500 trials. We say that the network chooses action 1 if the premotor cortical (PMC) neural group 1 displays greater activity than the PMC group 2. For reversal training, after action 1 is rewarded in trials 1 through 199, for trials 200 through 500, action 2 was rewarded instead. We analyze and compare the learning and reversal performance in the three model states below.

Figure 1:

The structure of the cortico-basal ganglia-thalamo-cortical loop model. The BG receives inputs from the prefrontal cortex (PFC) signaling the conditioning stimulus (CS) as well as reward inputs via substantia nigra pars compacta (SNc). The SNc forms a dopamine reward prediction error (RPE) signal, which governs plasticity of the connections from the PFC (DA LTP/LTD; green). The BG input structure, striatum, contains medium spiny neurons (MSNs), which cluster in 2 subtypes: D1 and D2 dopamine receptor-containing (direct and indirect pathways respectively). The rest of the nuclei are the globus pallidus external (GPe), subthalamic nucleus (STN), and the output structures: substantia nigra pars reticulata and globus pallidus internal (SNr/GPi). The loop is completed by connections from and to premotor cortices/thalamus (PMC/Thal). The two channels of the loop are colored purple/blue.

Healthy BG facilitates learning of rewarded choices

Fig. 2A shows choices made in the simulations: a higher activity of the PMC1 manifests choice 1 and vice versa. The graph shows the activity at the end of each trial, which is taken to be 750 msec long. On early trials, the choice is made randomly due to random initial conditions in the PMC network and mutual inhibition of PMC1 and PMC2. This reproduces the exploration phase, where the information about reward is collected (42,43). The modeled animal receives an unexpected reward every time it chooses action 1 (PMC1 on top). Within 20 trials, the system starts to consistently choose the rewarded action, and only a few exploratory deviations are made after that. On trial 200, we switch the simulated task to reversal: action 2 is rewarded instead. This quickly leads to reestablished exploratory behavior, and then locks the system to the rewarded choice, with occasional exploratory returns to choice 1. As explained below, our model allows for detailed analysis of the mechanism of this learning.

Figure 2:

Healthy BG facilitates learning of the initial task and reversal. Trial-by-trial dynamics of the PFC activity and underlying modulation of synaptic weights in the Healthy BG model. Trials 1-199:initial learning; trials 200-500: reversal (A) A higher activity of PMC1 (blue) manifests choice 1, whereas higher activity of PMC2 manifests choice 2. (B) Synaptic weights of the PFC to striatum connections. (C) Synaptic weights of the PFC to PMC connections.

Two mechanisms facilitate learning of the rewarded choice – one fast and one slow. The first mechanism is the potentiation of the PFC-to-striatum synaptic connections (Fig. 2B). The unexpected reward creates a positive RPE encoded by SNc DA signaling and potentiates PFC connections to all D1-MSNs (Fig. 2B). Note that the connections to D2-MSNs are potentiated much later (see below). Importantly, whereas the DA signal itself is not selective for MSNs specific to the rewarded action, DA-mediated potentiation of PFC-MSN synapses is selective. What makes potentiation selective is the level of activation of the corresponding MSN: in the initial trials the reward is granted only if choice 1 is selected, that is when PMC1 activity is greater, and this happens only when the corresponding D1-MSN is active (due to static synaptic connections from PMC to MSNs specific for each choice). Since synaptic plasticity explicitly depends on the activity of the postsynaptic neuron, PFC-to-MSN1 connections are potentiated much more strongly than other MSN connections (Fig. 2B). This further selectively activates D1-MSN1s. Thus, excitation of D1-MSNs neurons associated with choice 1 increases due to direct excitation from the PFC associated with the stimulus. The increased activity level of D1-MSN1s inhibits downstream GPi1 neurons and, consequently, disinhibits the PMC1 neural group (Fig. 3).

Figure 3:

Within-trial dynamics of neural activity in the model with healthy BG. The network is biased towards option 1 as the PFC-D1-MSN1 and PFC-D2MSN2 connection weights are both set at 0.7, which corresponds to a trial in late initial learning phase (∼100). Activation of the D1-MSN1 group inhibits GPi1 neurons, and thus disinhibits PMC1. GPi2 neurons remain excited and inhibit PMC2.

The PFC to D2-MSN connections are potentiated much later in the process (Fig. 2B, purple) and further reinforce the activity of PMC1. The potentiation delay is because a negative RPE is required for activation of the D2 MSNs, which is formed after the expected reward builds up and a nonrewarded action is selected by chance. Then, every choice that is not followed by the expected reward activates the corresponding indirect pathway (i.e. D2-MSN2), which excites the downstream GPi2 neurons, and consequently inhibits the PMC2 activity (Fig. 3). This blocks the nonrewarded action and helps to lock the choice to the rewarded action. Co-activation of the two mechanisms is sufficient to lock the choice to the rewarded action.

During subsequent repetitions of the same trial, the PFC-MSN connection strength starts to decrease and approaches zero (Fig. 2B trials 80 to 200). However, the persistence of the rewarded choice remains intact (Fig. 2A). The mechanism for this is the growth of direct PFC-PMC1 connections (Fig. 2C) via classical reward-independent Hebbian synaptic plasticity: the two neural groups are co-active most of the time. This transition from PFC-MSN to PFC-PMC connections as a supporting mechanism for the rewarded choice occurs after the number of repetitions is in the order of a hundred (Fig. 2). Therefore, the model shows that direct cortico-cortical connections are responsible for the choice of the rewarded action after long training.

We next analyzed the behavior of the model when we began rewarding a choice different from the choice the model had been previously conditioned to make; this learning task is called reversal learning (44). Beginning at trial 200, we rewarded the model for selecting the other action (choice 2). Thus, starting at trial 200 the model mimics omission of a reward, which acts as an unexpected punishment (negative reward) for selecting action 1. This punishment potentiates synaptic connections from the PFC to D2-MSNs associated with action 1 (D2-MSN1, Fig. 2B yellow), and, slightly later, to D1-MSNs associated with action 2 (D1-MSN2, Fig 2B red). This engagement of both direct and indirect pathways offsets the model bias for action 1 and quickly sends the model into another exploratory phase. As Fig. 2A demonstrates, between trials 200 and 300 the model is randomly choosing between the two actions. It is important to note that, in accordance with others’ findings (45,46), this second exploratory phase lasts longer than the initial exploratory phase. During reversal, the new potentiation of PFC-MSN connections is not enough to effectively overcome the bias for the initially learned choice and ensure choosing the newly rewarded option. The reversal exploratory phase ends only when the PFC-PMC2 connections become as strong as PFC-PMC1 and remove the bias (Fig. 2). Thus, the longer exploratory phase during reversal occurs because the model must first overcome its bias for the previously learned choice and then develop a new stimulus-choice 2 association.

The reversal mechanism relies more on the D2-MSN, indirect pathway and less on the D1-MSN, direct pathway than the initial learning. Due to the potentiated PFC-PMC1 connection, the system continues choosing option 1, even though it’s not rewarded. This generates a negative reward prediction error (Fig. 4) and potentiates PFC connections to the D2-type neurons associated with action 1 (D2-MSN1; Fig. 2B yellow). The connection of PFC to D1-MSN2, which conducts the GO signal for the choice 2 lags by several trials (Fig. 2B, red), during which the exploratory phase begins and allows finding the new rewarded option. The connections to the D1 MSNs do not potentiate as strong during reversal as those during initial learning. Their temporal profile closely matches the positive RPE signal, which also stays significantly lower during reversal compared to the initial learning (Fig. 2B blue and red, Fig. 4 RPE). As a result, the reversal learning engages direct and indirect pathways at a comparable strength (Fig. 2B, yellow and red), whereas during initial learning the direct pathway is engaged much stronger than the indirect (Fig. 2B, blue and purple).

Figure 4:

Reward, expected reward (A), and the RPE (B) during initial learning and reversal trials in the model with healthy BG. As before, reversal starts at trial 200 (vertical black line). Note a greater RPE at the beginning of the initial learning compared to the reversal.

Mild Parkinsonian BG: impeded learning and spontaneous oscillations

Our simulations (Fig. 5) show drastic difference in dynamics of the PMC neurons during initial learning and reversal in the model with mid-parkinsonian BG. During both phases, learning is severely impaired. First, the choice remains random for approximately the first 120 trials. Second, the model does not reliably choose the rewarded option even after this period, although the rewarded option is chosen on a much greater number of trials (Fig. 5A blue above red in the initial learning and vice versa in the reversal). Third, the activity of the PMC neurons is overall reduced compared to that in the model with healthy BG, and the trial-to-trial variations of this activity are drastically increased, even when only trials with the same choice are considered.

Figure 5:

Decreased learning performance and increased variability of PMC activity in the model with mild-parkinsonian BG. Trial-by-trial dynamics of PMC activity (A) and underlying modulation of synaptic weights (B,C) in the model with mild-parkinsonian BG state. Notation is the same as in Fig. 2. Note the difference in scale in panels (B) and (C) compared to Fig. 2

The underlying dynamic of the synaptic weights is also significantly altered. During initial learning, both direct pathway for choice 1 and indirect pathway for choice 2 are activated at a similar level (Fig. 5B), and this level is much lower than in the model with healthy BG (Fig. 2B). The latter follows directly from the reduced SNC signaling (by 70%), which decreases the RPE and, thus, impedes potentiation of PFC-MSN connections. Since both PMC neural groups are active at a similar level, both connections from PFC are potentiated (Fig. 5C), and the system does not develop a preference for the rewarded choice. After trial 80, the rewarded choice starts to prevail as the PFC-PMC connections reflect the preference for choice 1. However, the PFC-PMC1 connection does not achieve the level reached in the model with healthy BG (Fig. 2C) within the 200 trials designated for initial learning. Hence, exploration between the choices persists for all 200 trials, and the prevalence of the rewarded choice requires the persistent activation of PFC-MSN connections. Therefore, the model with mild parkinsonian BG is capable of learning the choices, but the effective learning rate is much lower.

The low levels of PFC-PMC connections persist into the reversal phase too and never reach the levels shown by the model with healthy BG even though plasticity rules of the PFC-PMC connections remain the same in both models. Therefore, our modeling predicts that the mild-parkinsonian BG does not allow for the proper potentiation of the PFC-PMC connections, and this leads to impaired learning. Interestingly, the reversal phase starts with activation of both indirect pathways simultaneously (Fig. 5B, purple and yellow). This suppresses the activity of both PMC neural groups, blocks any choice and blocks changes in the PFC-PMC synaptic weights. Only after some 40 trials, the NO-GO signal for choice 2 is replaced by a GO (Fig. 5 purple and red). Thus, the model with mild-parkinsonian BG predicts that the exploratory phase at the beginning of the reversal learning is replaced by blockade of any choice, and this further impedes learning.

Perhaps the most interesting change in the model with parkinsonian BG is the drastic increase in the trial-to-trial variability of the PMC neurons (Fig. 5A). To explain the mechanism of this variability, we considered within-trial dynamics of activity for all neural groups in the model. Fig. 6 shows these dynamics for the PMC neurons and MSNs in the healthy vs. parkinsonian BG models. In the healthy case activity levels come to an equilibrium, while in the parkinsonian case, they engage in persistent oscillations. The anti-phase for the oscillations in the neural groups corresponding to the choice 1 and 2 is due to mutual competition (inhibition) between PMC1 and PMC2 groups. The oscillations arise from the negative feedback loop that the BG, and in particular its indirect pathway, provides for the activity of each PMC neural group. Indeed, the static PMC to D2 MSN connections, which constitute this negative feedback, are stronger in the parkinsonian case (W_PMC–D₂, in Table 2). The period of these oscillations is approximately 210 ms, which is 4.7 Hz. No potentiation in the PFC-PMC and PFC-MSN connections within the ranges in Fig. 5 B and C suppress the oscillations (data not shown). Therefore, the simulations predict that the trial-to-trial variability of the PMC neurons in the model with parkinsonian BG is caused by robust within-trial oscillations in the activity of all neuron groups in the model.

Figure 6:

Within-trial dynamics of neural activity in the model with healthy (left) and parkinsonian (right) BG. Panels A, B, and C show firing rates for PMC,D1 MSNs and D2 MSNs respectively. In the healthy case, the firing rates equilibrate within 500 ms. In the parkinsonian case, oscillations in the firing rate emerge and persist. All plastic synaptic connections are set to zero to simulate the state of no bios towards any choice.

View this table:

Table 1:

Parameters of the healthy BG model state

View this table:

Table 2:

Changes in the parameters of the model that reproduce Parkinsoninan BG state.

In order to model the impact of BG DBS or surgical interventions on performance and learning in PD, we performed additional simulations of the PD model in which the BG signal to PMC was ablated from trial 150 till the end (Fig. 7). In this period, the variability of the PMC activity vanishes completely. Furthermore, the PFC-striatal connections no longer exert any influence on the choices, but the PFC-PMC connections are strong enough to lock the choice to the rewarded option, and the cortical connections increase further at a greater rate. After the reversal on trial 200, however, the changed values of the choices remain unnoticed by the system, the choice remains locked on the now unrewarded option, and the cortical connections supporting this choice keep rising. In this state, behavior improves, but learning is impaired.

Figure 7:

In the PD state model, the variability of PMC activity and switching between choice 1 and 2 cease at the DBS onset. Trial-by-trial dynamics of the PMC activity and underlying modulation of synaptic weights in the PD BG model with simulated DBS starting at trial 150. Same notation as in Fig. 2. (A) The levels of PMC1 and PMC2 activity (choice 1 vs. 2) at the end of each trial (B) Synaptic weights of the PFC to striatum connections reflect rewarded choices. (C) Synaptic weight of the PFC to PMC1 connection keep growing after DBS onset, and during reversal.

Grade 2 Huntington’s Disease BG state: persistent exploratory behavior

If the above case of Parkinson’s disease is associated with strengthening the indirect pathway, in the case of Huntington’s disease the connections in the indirect pathway become weaker (Table 3). The major difference with the healthy BG model is that the trial-to-trial dynamics of the PMC neural groups looks like the exploratory phase never ends (Fig. 8A). At the same time, we see from the synaptic weights (Fig. 8B and C) that choice-reward contingencies are learned almost as effectively as in the healthy case (Fig. 2), although the synaptic weights are somewhat lower. The differences are the activation of the indirect pathway for choice 2 lingering at the beginning of the reversal phase (Fig. 8B purple) and the persistence of the PFC-MSN connections similar to the parkinsonian case. The latter, however, is not a cause but rather a consequences of the continuous exploratory choices that bring no reward. Therefore, despite the efficacious learning (Fig. 8C), choice behavior is impaired relative to control (Fig. 8A).

Figure 8:

Random switches between rewarded and unrewarded options persist in the model with Huntington state BG. Trial-to-trial dynamics of PFC neural activity (A) and underlying dynamics of synaptic weights (B,C). The notation is the same as in Fig. 2.

View this table:

Table 3:

Changes in the parameters of the model that reproduce Huntington disease state.

The cause for the persistent exploratory phase is the positive PMC-BG feedback loop through D1 MSNs, which is not balanced by the D2 MSN pathway. Indeed, an occasional increase in the activity of the PMC2 neural group, which represents a non-rewarded action, excites the corresponding D1 MSN group, and through disinhibition by GPi2 activity, further increases the PMC2 activity (Fig. 9). The reduced connectivity in the D2 MSN pathway makes the STN neural activity the same for choices 1 and 2 (data not shown) and excludes the indirect pathway from the competition between the choices. This leads to occasional choices of the non-rewarded option, and our simulations show that this behavior is robust with respect to growing PFC-PMC and PFC-MSN connections (Fig. 8). Therefore, the lack of balance between direct and indirect pathways in the model of Huntington’s disease causes persistent random switching from rewarded to non-rewarded choice after both initial learning and reversal.

Figure 9:

Occasional choice of the nonrewarded option made in the model with Huntington state BG. Within-trial dynamics of PMC, D1 MSN, and GPi neural activity is shown. The greater activity of PMC2 groups signifies that the action 2 is chosen, even though choice 1 is made preferable in the model by potentiating PFC-PMC1, PFC-D1 MSN1 and PFC-D2 MSN2 connections: W_PFC1–PMC1 = 0.04, W_{PFC1–D1MSN1} = 1, W_{PFC1–D2MSN2} = 1

In order to model the impact of BG DBS or surgical interventions on performance and learning in HD, we also performed additional simulations of the HD model in which the BG signal to PMC was ablated from trial 100 till the end (Fig. 10). The random switches between the choices cease shortly after, but not at the onset of DBS. The response to DBS is very similar to that in the PD case (Fig. 7). In this period, the PFC-striatal connections no longer exert any influence on the choices, but the PFC-PMC connections are strong enough to lock the choice to the rewarded option. After the reversal on trial 200, however, the changed values of the choices remain unnoticed by the system, the choice remains locked on the now unrewarded option, and the cortical connections supporting this choice keep rising. Therefore, during DBS, or after surgical interventions ablating BG output, behavior improves, but learning is impaired in HD as well as in the PD state.

Figure 10:

In the HD state, the random switches between choice 1 and 2 cease shortly after, but not at the DBS onset. Trial-by-trial dynamics of the PMC activity and underlying modulation of synaptic weights in the Huntington BG model with simulated DBS starting at trial 100. Same notation as in Fig. 2. (A) The levels of PMC1 and PMC2 activity (choice 1 vs. 2) at the end of each trial (B) Synaptic weights of the PFC to striatum connections reflect rewarded choices. (C) Synaptic weight of the PFC to PMC1 connection keep growing after DBS onset, and during reversal.

Discussion

Our model implements the cortico-BG-thalamo-cortical loop function in a standard 2-choice instrumental conditioning task. We have shown that potentiation of corticostriatal synapses enables learning of rewarded options. However, later these synapses become redundant as direct connections between prefrontal and premotor cortices (PFC-PMC) potentiate by Hebbian learning. The model shows that disease-related imbalances of the direct and indirect pathways in the BG impairs learning and suggests that these imbalances may also impede choices that have been learned previously, in spite of BG redundancy for those choices.

Our model of the parkinsonian state reproduces several major behavioral and electrophysiological features documented experimentally: First, initial learning is much slower, but reversal takes about as many trials in the mild PD state as it does in the healthy state (∼100 trails). As initial learning is associated with an unpredicted reward (positive RPE) and reversal with reward omission (negative RPE), which is similar to punishment, this is consistent with experimental findings, in which reward, but not punishment learning is impeded in PD patients (47,48). Second, the overall PMC activity is diminished in the PD state, consistent with PD studies (49). Further, the model predicts that this activity is lowest at the beginning of the initial learning and reversal due to aberrant engagement of the indirect pathway, which can be displayed as stronger bradykinesia. Third, the model shows robust oscillations in the activity of the cortico-BG-thalamo-cortical loop in the PD state. The oscillations are generated by a negative feedback branch of the loop through the indirect pathway as suggested before (50,51). The frequency of these oscillations is about 5 Hz, which is in the theta band. An increase in the EEG theta band is a marker of PD-related cognitive decline (52,53). Our simulations show that the oscillations cause multiple choice errors and, consequently, impede task performance and learning.

In the HD state, our model displays persistent randomly occurring choices of the unrewarded option, especially frequent after the reversal. This would register as impaired learning in behavioral tests, which is consistent with experimental results for cognitive (54,55) and motor tasks (56,57) in HD patients in the early stages of the disease. Furthermore, the model suggests that performance for previously learned tasks is also affected.

Therefore, our model reproduces impairments of the previously learned actions documented in BG-affecting diseases like PD and HD as well as after certain BG lesions (8,32,58). However, surgical and deep brain stimulation (DBS) interventions in PD and HD patients do not impair, but rather restore motor function (33–35,59). This raises the question: how can these two lines of evidence therefore be reconciled?

Learning in the model consists of two phases: BG-based and cortex-based. In a faster BG-based phase, the connections from PFC to MSNs are potentiated according to the RPE signal. The BG output inhibits choices with negative RPE and disinhibits those with positive RPE. Once the behavior is learned, the RPE becomes zero, and the PFC-MSN connections decay to zero. The future choices are supported by the slower cortex-based learning phase: The connections from PFC directly to PMC are potentiated based on the Hebbian mechanism. Our simulations show that, even after the cortico-cortical connections increase to the levels ensuring robust choice of the rewarded option in the healthy state, both of the disease models are unable to make robust choices. Thus, behaviors that no longer need the BG are impaired. The model shows that it is an abnormal BG output that impairs the choices. Indeed, the BG output to the PMC does not vanish even when the behavior is learned and the BG no longer receives any RPE signal. In this case, due to the inputs from the PMC, the healthy BG disinhibits the previously learned choice, i.e. it conforms with the PFC-PMC associations. This disinhibitory function is impaired in both PD and HD, as well as after striatal lesions (8,32,58). According to this prediction, disruption of the BG output by GPi lesions or DBS, which was successfully used in PD (33–35) and tested in HD patients (59), would improve performance on previously learned tasks. Indeed, our model of a lesion of BG output demonstrates strengthening of performance on previously learned choices. Therefore our model reconciles how specific GPi lesions or DBS that abolish BG output, restore previously learned behaviors that were lost due to disrupted BG function, however this comes at the expense of decreased cognitive flexibility.

Altogether, we have modeled the function of the cortico-BG-thalamo-cortical loop in a 2 choice instrumental conditioning task and shown the mechanism by which this function is disrupted in HD and PD conditions. Further, we have shown how DBS or GPi lesions restore previously learned choices, but completely disrupts learning of new behavior. Our results reconcile the apparent contradiction between the critical involvement of the BG in execution of previously learned actions and yet no impairment of these actions after BG output is ablated by lesions or DBS.

Materials and Methods

We adopt rate model formalism extensively used to reproduce activity and function of numerous brain structures (60). In particular, we follow a validated model of motor control (38) and modify it for action selection.

Structure of the basal ganglia

Fig. 1 presents a schematic diagram of nuclei and connections within the BG and their connections with cortices. The cortico-BG-thalamo-cortical loop is separated into channels selective for each of the two actions of the model (see below). First, the striatum, the primary input structure of the BG, receives excitatory inputs from the prefrontal cortex (PFC) and premotor cortex (PMC) in the cerebrum as well as the thalamus. From the striatum, two competing pathways are activated: a direct pathway (striatum-SNr/GPi) and an indirect pathway (striatum-GPe-STN-SNr/GPi). These two pathways converge at the BG output nuclei, the SNr and GPi, and serve to modulate their activites. In the model SNr and GPi activity are treated as one unit. SNr/GPi activity inhibits a corresponding neural group in the thalamus and PMC and blocks the corresponding action. In the model thalamus and PMC activity is treated as a single unit (PMC/Thal). To execute the action, SNr/GPi activity must decrease and disinhibit the PMC/Thal neurons. In addition, DA neurons in the SNc signal a reward prediction error (RPE), which change synaptic weights of PFC-striatum connections via DA-dependent long-term synaptic potentiation (LTP) and long-term synaptic depression (LTD) to allow for reward-based learning.

Behavioral task

Our model implements a standard design for intertemporal choice tasks (32). The circuitry shown in Fig. 1 is built to reproduce selection between two actions, one of which is rewarded. A typical task is to learn that, for instance, action 1 is rewarded if a conditioning stimulus (CS) is presented. Then, this task is “reversed”: after learning this contingency, the reward following the same CS is shifted to action 2. Thus, the cortico-BG-thalamo-cortical loop has 2 channels: for choice 1 and 2, except for the PFC that represents the CS and the SNc that represents the unexpected reward. Activation of neural groups 1 and 2 in the PMC/thalamus correspond to execution of action 1 and 2 respectively. Thus, in the model, an action is considered selected if the activity level of the corresponding PMC neural group at the end of a simulated trial is higher than that of the other group. The behavioral readout is if the stimulus-reward contingencies can be learned, and how many trials learning takes.

Firing rate equations

The activity of every neuron (except the dopaminergic neurons in the SNc) is governed by the following differential equation (38): where A is the instantaneous activity level of the neuron. Here, τ is a time constant taken to equal 15 msec based on previous models and experimental studies (61). I is the synaptic input to the neuron. The expressions for synaptic input to each neuron group, and the formula are compiled in Table 1. σ(I) is a normalized response function defined as:

View this table:

Table 1:

Synaptic inputs

We have adapted the following notation: X_m to denote the activity (firing rate) of neural group X in the pathway for the m^th action. Since our model contains only two actions, the only possible values for m are 1 and 2. The index n in the formula for X_m refers to the other of the two channels, e.g. . Further, w_{X_Y} denotes the synaptic weight (strength of connection) from group X to group Y and dr_X denotes a tonic drive to group X. Many of these weights are assumed constant throughout our trials, but several of them are plastic as described below.

Synaptic plasticity

The synaptic weights from PFC to PMC neurons and from PFC to MSNs are plastic, which means that they change depending on the activity of these nuclei and behavioral outcome (reward received) respectively (40,41,39). In simulations, the synaptic weights are updated at the beginning of every trial depending on the behavior of the model in previous trials. Before we discuss the specific mechanisms by which we updated these plastic synaptic weights, we will first discuss how we calculated the activity of the dopaminergic neurons in the SNc, which essentially mediate reward-based learning.

The activity of the SNc neurons is associated with a reward prediction error (RPE) (62). Following previous models (e.g. (38)), we assume that the activity of the SNc neural group reflects the difference between the expected reward and the actual reward: where R is the actual reward given based on the action selected, and is the expected reward at the j^th trial. The expected reward on the first trial, , is equal to 0 and is then subsequently updated according to the following scheme: where α is a constant (set equal to 0.15) and R_j denotes the actual reward received by the model on the j^th trial.

The actual reward received in simulations, R, is determined by the following: where we determined which action is selected by comparing the activities of the PMC neurons at the end of each trial as described above.

Altogether, after each trial, the PFC-striatal synaptic connections are updated according to the following rules: where λ is a learning rate constant and d is the decay rate constant. Here, PFC,D1_m, and D2_m denote the activity of the respective neural group at the end of the trial (m = 1,2).

Lastly, we describe the mechanism by which we updated the connections between the PFC and PMC neurons. Here, we let W_PFC–PMCm denote the synaptic weight of the connection between the PFC neural group and the m^th PMC neural group. After each trial, the synaptic weights are updated according to the following Hebbian Learning Rule: where λ_CM is the learning rate and d_CM is the decay rate of the cortical connections. Here, PFC and PMC_m denote the activity of the PFC neurons and m^th PMC neuron group at the end of the trial.

Now, we will outline our methodologies for calibrating our three different BG model states: healthy, Parkinsonian, and Huntington’s disease.

Healthy BG state

We target to reproduce rodent behavior in instrumental conditioning (IC) tasks (32). Thus, an animal will learn contingencies between a conditioning signal and a rewarded action— pressing one of two levers. We reduce the model by (38) and focus our model on the interaction of the thalamocortical and BG networks (Fig. 1) and reproduce the function of the cortico-BG-thalamo-cortical loop in the above two-choice task. The parameter values are shown in Table 1. The values were taken from previous studies (38) with a few minor modifications that allow for both robust instrumental conditioning as well as reversal learning.

Parkinsonian BG state

To create disease models from our healthy BG model, we reviewed physiological data. The neuropathology of Parkinson’s Disease (PD) is incredibly well-understood: it begins with the destruction of the dopaminergic neurons in the SNc (63,64). Further, the disease is accompanied by a decreased firing rate of the D1 MSNs (65,66), GPe (67–69), and PMC (70) as well as increased firing rates in the D2 MSNs (65,66), STN (71,72), and GPi (73,67,74). We induced an in silico mild Parkinsonian state in our model by suppressing SNc output by 70% and changing synaptic weights along with tonic drives (49,64) as outlined in Table 2.

Huntington’s BG state

The pathology of Huntington’s Disease (HD) is less well-understood; however, it is clear that there is a progression of the disease from chorea (involuntary, jerky movement) at its onset to akinesia (loss of the power of voluntary movement) at its conclusion (75). We modeled the chorea phase (Grade 2 HD) by weakening the D2 MSN-GPe connection by 90%, weakening the GPe-STN connection by 40%, and decreasing the PFC input to account for destruction of the PFC (75,76). These percentages are gathered from the physiological observations of Reiner et al. (75). The resulting parameters are shown in Table 3.

Numerical Simulations

Our model was coded in MATLAB. We considered a trial to last 750 msec, and at the end we register the activity of each neuron in the circuit. We chose to cutoff trials at this point because it was sufficient to guarantee that the neural activity converges to a steady state. An exception is a case when neural activity does not approach a steady state and remains oscillatory, which we also found in this study. We update strengths for the plastic synapses after each trial. Finally, we reset the initial activity of the neurons to be at randomized levels at the beginning of each subsequent trial. We ran simulations consisting of 500 such trials.

References

1.↵
Redgrave P, Prescott TJ, Gurney K. The basal ganglia: a vertebrate solution to the selection problem? Neuroscience. 1999 Apr;89(4):1009–23.
OpenUrl CrossRef PubMed Web of Science
2.↵
Nelson AB, Kreitzer AC. Reassessing models of basal ganglia function and dysfunction. Annu Rev Neurosci. 2014;37:117–35.
OpenUrl CrossRef PubMed
3.↵
Bhatia KP, Marsden CD. The behavioural and motor consequences of focal lesions of the basal ganglia in man. Brain. 1994;117(4):859–76.
OpenUrl CrossRef PubMed Web of Science
4.
DeLong MR. Primate models of movement disorders of basal ganglia origin. Trends Neurosci. 1990 Jul;13(7):281–5.
OpenUrl CrossRef PubMed Web of Science
5.↵
Denny-Brown. The basal ganglia and their relation to disorders of movement. Br J Surg. 1962 Jul;50(219):117–8.
OpenUrl
6.↵
Aron AR, Durston S, Eagle DM, Logan GD, Stinear CM, Stuphorn V. Converging Evidence for a Fronto-Basal-Ganglia Network for Inhibitory Control of Action and Cognition. J Neurosci. 2007 Oct 31;27(44):11860–4.
OpenUrl FREE Full Text
7.↵
Draganski B, Kherif F, Klöppel S, Cook PA, Alexander DC, Parker GJM, et al. Evidence for segregated and integrative connectivity patterns in the human Basal Ganglia. J Neurosci Off J Soc Neurosci. 2008 Jul 9;28(28):7143–52.
OpenUrl Abstract/FREE Full Text
8.↵
Redgrave P, Rodriguez M, Smith Y, Rodriguez-Oroz MC, Lehericy S, Bergman H, et al. Goal-directed and habitual control in the basal ganglia: implications for Parkinson’s disease. Nat Rev Neurosci. 2010 Nov;11(11):760–72.
OpenUrl CrossRef PubMed Web of Science
9.↵
Packard MG, Knowlton BJ. Learning and Memory Functions of the Basal Ganglia. Annu Rev Neurosci. 2002 Mar;25(1):563–93.
OpenUrl CrossRef PubMed Web of Science
10.
Foerde K, Shohamy D. The role of the basal ganglia in learning and memory: insight from Parkinson’s disease. Neurobiol Learn Mem. 2011 Nov;96(4):624–36.
OpenUrl CrossRef PubMed
11.↵
Lanciego JL, Luquin N, Obeso JA. Functional neuroanatomy of the basal ganglia. Cold Spring Harb Perspect Med. 2012 Dec 1;2(12):a009621.
OpenUrl Abstract/FREE Full Text
12.↵
Graybiel AM. Habits, Rituals, and the Evaluative Brain. Annu Rev Neurosci. 2008 Jul;31(1):359–87.
OpenUrl CrossRef PubMed Web of Science
13.↵
Maia TV, Frank MJ. From reinforcement learning models to psychiatric and neurological disorders. Nat Neurosci. 2011 Feb;14(2):154–62.
OpenUrl CrossRef PubMed Web of Science
14.↵
Keiflin R, Janak PH. Dopamine Prediction Errors in Reward Learning and Addiction: From Theory to Neural Circuitry. Neuron. 2015 Oct 21;88(2):247–63.
OpenUrl
15.↵
Takahashi Y, Schoenbaum G, Niv Y. Silencing the critics: understanding the effects of cocaine sensitization on dorsolateral and ventral striatum in the context of an actor/critic model. Front Neurosci. 2008 Jul;2(1):86–99.
OpenUrl CrossRef PubMed
16.↵
Frank MJ. Dynamic Dopamine Modulation in the Basal Ganglia: A Neurocomputational Account of Cognitive Deficits in Medicated and Nonmedicated Parkinsonism. J Cogn Neurosci. 2005 Jan;17(1):51–72.
OpenUrl CrossRef PubMed Web of Science
17.
Schonberg T, Daw ND, Joel D, O’Doherty JP. Reinforcement Learning Signals in the Human Striatum Distinguish Learners from Nonlearners during Reward-Based Decision Making. J Neurosci. 2007 Nov 21;27(47):12860–7.
OpenUrl Abstract/FREE Full Text
18.↵
Schultz W. Updating dopamine reward signals. Curr Opin Neurobiol. 2013 Apr;23(2):229–38.
OpenUrl CrossRef PubMed
19.↵
Saunders BT, Richard JM, Margolis EB, Janak PH. Dopamine neurons create Pavlovian conditioned stimuli with circuit-defined motivational properties. Nat Neurosci. 2018 Aug;21(8):1072–83.
OpenUrl CrossRef PubMed
20.↵
Keiflin R, Pribut HJ, Shah NB, Janak PH. Ventral Tegmental Dopamine Neurons Participate in Reward Identity Predictions. Curr Biol CB. 2019 Jan 7;29(1):93–103.e3.
OpenUrl
21.↵
Surmeier DJ, Plotkin J, Shen W. Dopamine and synaptic plasticity in dorsal striatal circuits controlling action selection. Curr Opin Neurobiol. 2009 Dec;19(6):621–8.
OpenUrl CrossRef PubMed Web of Science
22.↵
Bamford NS, Wightman RM, Sulzer D. Dopamine’s Effects on Corticostriatal Synapses during Reward-Based Behaviors. Neuron. 2018 Feb 7;97(3):494–510.
OpenUrl
23.↵
Gerfen CR. The Neostriatal Mosaic: Multiple Levels of Compartmental Organization in the Basal Ganglia. Annu Rev Neurosci. 1992 Mar;15(1):285–320.
OpenUrl CrossRef PubMed Web of Science
24.
Surmeier DJ, Song WJ, Yan Z. Coordinated expression of dopamine receptors in neostriatal medium spiny neurons. J Neurosci Off J Soc Neurosci. 1996 Oct 15;16(20):6579–91.
OpenUrl Abstract/FREE Full Text
25.↵
Kreitzer AC, Malenka RC. Striatal Plasticity and Basal Ganglia Circuit Function. Neuron. 2008 Nov;60(4):543–54.
OpenUrl CrossRef PubMed Web of Science
26.↵
Mannella F, Baldassarre G. Selection of cortical dynamics for motor behaviour by the basal ganglia. Biol Cybern. 2015 Dec;109(6):575–95.
OpenUrl
27.
Alexander GE, Crutcher MD. Functional architecture of basal ganglia circuits: neural substrates of parallel processing. Trends Neurosci. 1990 Jul;13(7):266–71.
OpenUrl CrossRef PubMed Web of Science
28.↵
Gurney K, Prescott TJ, Redgrave P. A computational model of action selection in the basal ganglia. I. A new functional anatomy. Biol Cybern. 2001 May 11;84(6):401–10.
OpenUrl
29.↵
Cui G, Jun SB, Jin X, Pham MD, Vogel SS, Lovinger DM, et al. Concurrent activation of striatal direct and indirect pathways during action initiation. Nature. 2013 Feb 14;494(7436):238–42.
OpenUrl CrossRef PubMed Web of Science
30.
Markowitz JE, Gillis WF, Beron CC, Neufeld SQ, Robertson K, Bhagat ND, et al. The Striatum Organizes 3D Behavior via Moment-to-Moment Action Selection. Cell. 2018 28;174(1):44–58.e17.
OpenUrl CrossRef PubMed
31.↵
London TD, Licholai JA, Szczot I, Ali MA, LeBlanc KH, Fobbs WC, et al. Coordinated Ramping of Dorsal Striatal Pathways preceding Food Approach and Consumption. J Neurosci Off J Soc Neurosci. 2018 Apr 4;38(14):3547–58.
OpenUrl Abstract/FREE Full Text
32.↵
Balleine BW, Liljeholm M, Ostlund SB. The integrative function of the basal ganglia in instrumental conditioning. Behav Brain Res. 2009 Apr;199(1):43–52.
OpenUrl CrossRef PubMed Web of Science
33.↵
Munhoz RP, Cerasa A, Okun MS. Surgical Treatment of Dyskinesia in Parkinsonâ€™s Disease. Front Neurol [Internet]. 2014 Apr 29 [cited 2019 Mar 8];5. Available from: http://journal.frontiersin.org/article/10.3389/fneur.2014.00065/abstract
34.
Brown P, Eusebio A. Paradoxes of functional neurosurgery: clues from basal ganglia recordings. Mov Disord Off J Mov Disord Soc. 2008 Jan;23(1):12–20; quiz 158.
OpenUrl
35.↵
Marsden CD, Obeso JA. The functions of the basal ganglia and the paradox of stereotaxic surgery in Parkinson’s disease. Brain J Neurol. 1994 Aug;117 (Pt 4):877–97.
OpenUrl CrossRef PubMed Web of Science
36.↵
Piron C, Kase D, Topalidou M, Goillandeau M, Orignac H, N’Guyen T-H, et al. The globus pallidus pars interna in goal-oriented and routine behaviors: Resolving a long-standing paradox: GPI, Goal-directed behaviors and habits. Mov Disord. 2016 Aug;31(8):1146–54.
OpenUrl CrossRef PubMed
37.↵
Topalidou M, Kase D, Boraud T, Rougier NP. A Computational Model of Dual Competition between the Basal Ganglia and the Cortex. eneuro. 2018;5(6):ENEURO.0339-17.2018.
38.↵
Kim T, Hamade KC, Todorov D, Barnett WH, Capps RA, Latash EM, et al. Reward Based Motor Adaptation Mediated by Basal Ganglia. Front Comput Neurosci [Internet]. 2017 Mar 31 [cited 2019 Mar 8];11. Available from: http://journal.frontiersin.org/article/10.3389/fncom.2017.00019/full
39.↵
Reynolds JNJ, Hyland BI, Wickens JR. A cellular mechanism of reward-related learning. Nature. 2001 Sep 6;413(6851):67–70.
OpenUrl CrossRef PubMed Web of Science
40.↵
Buonomano DV, Merzenich MM. CORTICAL PLASTICITY: From Synapses to Maps. Annu Rev Neurosci. 1998 Mar;21(1):149–86.
OpenUrl CrossRef PubMed Web of Science
41.↵
Wolters A, Sandbrink F, Schlottmann A, Kunesch E, Stefan K, Cohen LG, et al. A Temporally Asymmetric Hebbian Rule Governing Plasticity in the Human Motor Cortex. J Neurophysiol. 2003 May;89(5):2339–45.
OpenUrl CrossRef PubMed Web of Science
42.↵
Cohen JD, McClure SM, Yu AJ. Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration. Philos Trans R Soc B Biol Sci. 2007 May 29;362(1481):933–42.
OpenUrl CrossRef PubMed
43.↵
Prescott TJ, Bryson JJ, Seth AK. Introduction. Modelling natural action selection. Philos Trans R Soc B Biol Sci. 2007 Sep 29;362(1485):1521–9.
OpenUrl CrossRef PubMed
44.↵
Hulse SH, Egeth H, Deese J. The psychology of learning. 5th ed. New York: McGraw-Hill; 1980. 478 p. (McGraw-Hill series in psychology).
45.↵
Izquierdo A, Wiedholz LM, Millstein RA, Yang RJ, Bussey TJ, Saksida LM, et al. Genetic and dopaminergic modulation of reversal learning in a touchscreen-based operant procedure for mice. Behav Brain Res. 2006 Aug 10;171(2):181–8.
OpenUrl CrossRef PubMed Web of Science
46.↵
Linden J, James AS, McDaniel C, Jentsch JD. Dopamine D2 Receptors in Dopaminergic Neurons Modulate Performance in a Reversal Learning Task in Mice. eNeuro. 2018 Feb;5(1).
47.↵
Bodi N, Keri S, Nagy H, Moustafa A, Myers CE, Daw N, et al. Reward-learning and the novelty-seeking personality: a between- and within-subjects study of the effects of dopamine agonists on young Parkinson’s patients. Brain. 2009 Sep 1;132(9):2385–95.
OpenUrl CrossRef PubMed Web of Science
48.↵
Cools R, Altamirano L, D’Esposito M. Reversal learning in Parkinson’s disease depends on medication status and outcome valence. Neuropsychologia. 2006 Jan;44(10):1663–73.
OpenUrl CrossRef PubMed Web of Science
49.↵
Galvan A, Devergnas A, Wichmann T. Alterations in neuronal activity in basal ganglia-thalamocortical circuits in the parkinsonian state. Front Neuroanat [Internet]. 2015 Feb 5 [cited 2019 Mar 11];9. Available from: http://journal.frontiersin.org/Article/10.3389/fnana.2015.00005/abstract
50.↵
Humphries MD, Obeso JA, Dreyer JK. Insights into Parkinson’s disease from computational models of the basal ganglia. J Neurol Neurosurg Psychiatry. 2018 Nov;89(11):1181–8.
OpenUrl Abstract/FREE Full Text
51.↵
van Albada SJ, Gray RT, Drysdale PM, Robinson PA. Mean-field modeling of the basal ganglia-thalamocortical system. II. J Theor Biol. 2009 Apr;257(4):664–88.
OpenUrl CrossRef PubMed Web of Science
52.↵
Soikkeli R, Partanen J, Soininen H, Pääkkönen A, Riekkinen P. Slowing of EEG in Parkinson’s disease. Electroencephalogr Clin Neurophysiol. 1991 Sep;79(3):159–65.
OpenUrl CrossRef PubMed Web of Science
53.↵
Cozac VV, Gschwandtner U, Hatz F, Hardmeier M, Rüegg S, Fuhr P. Quantitative EEG and Cognitive Decline in Parkinson’s Disease. Park Dis. 2016;2016:1–14.
OpenUrl
54.↵
Grahn JA, Parkinson JA, Owen AM. The role of the basal ganglia in learning and memory: Neuropsychological studies. Behav Brain Res. 2009 Apr;199(1):53–60.
OpenUrl CrossRef PubMed Web of Science
55.↵
Lawrence AD, Hodges JR, Rosser AE, Kershaw A, ffrench-Constant C, Rubinsztein DC, et al. Evidence for specific cognitive deficits in preclinical Huntington’s disease. Brain J Neurol. 1998 Jul;121 (Pt 7):1329–41.
OpenUrl CrossRef PubMed Web of Science
56.↵
Gutierrez-Garralda JM, Moreno-Briseño P, Boll M-C, Morgado-Valle C, Campos-Romo A, Diaz R, et al. The effect of Parkinson’s disease and Huntington’s disease on human visuomotor learning. Eur J Neurosci. 2013 Jun;n/a-n/a.
57.↵
Heindel WC, Butters N, Salmon DP. Impaired learning of a motor skill in patients with Huntington’s disease. Behav Neurosci. 1988 Feb;102(1):141–7.
OpenUrl CrossRef PubMed Web of Science
58.↵
Yin HH, Knowlton BJ, Balleine BW. Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning. Eur J Neurosci. 2004 Jan;19(1):181–9.
OpenUrl CrossRef PubMed Web of Science
59.↵
Wojtecki L, Groiss SJ, Hartmann CJ, Elben S, Omlor S, Schnitzler A, et al. Deep Brain Stimulation in Huntington’s Disease-Preliminary Evidence on Pathophysiology, Efficacy and Safety. Brain Sci. 2016 Aug 30;6(3).
60.↵
Koch C, Segev I, editors. Methods in neuronal modeling: from ions to networks. 2. ed., 3. printing. Cambridge, Mass.: MIT Press; 2001. 671 p. (Computational neuroscience).
61.↵
1. Graham LJ
Pavlides A, Hogan SJ, Bogacz R. Computational Models Describing Possible Mechanisms for Generation of Excessive Beta Oscillations in Parkinson’s Disease. Graham LJ, editor. PLOS Comput Biol. 2015 Dec 18;11(12):e1004609.
OpenUrl CrossRef PubMed
62.↵
Schultz W. Behavioral dopamine signals. Trends Neurosci. 2007 May;30(5):203–10.
OpenUrl CrossRef PubMed Web of Science
63.↵
Kalia LV, Lang AE. Parkinson’s disease. The Lancet. 2015 Aug;386(9996):896–912.
OpenUrl
64.↵
Schapira AHV. Neurobiology and treatment of Parkinson’s disease. Trends Pharmacol Sci. 2009 Jan;30(1):41–7.
OpenUrl CrossRef PubMed
65.↵
Kita H, Kita T. Role of Striatum in the Pause and Burst Generation in the Globus Pallidus of 6-OHDA-Treated Rats. Front Syst Neurosci [Internet]. 2011 [cited 2019 Mar 11];5. Available from: http://journal.frontiersin.org/article/10.3389/fnsys.2011.00042/abstract
66.↵
Mallet N, Ballion B, Le Moine C, Gonon F. Cortical inputs and GABA interneurons imbalance projection neurons in the striatum of parkinsonian rats. J Neurosci Off J Soc Neurosci. 2006 Apr 5;26(14):3875–84.
OpenUrl Abstract/FREE Full Text
67.↵
1. Carpenter MB,
2. Jayaraman A
Miller WC, DeLong MR. Altered Tonic Activity of Neurons in the Globus Pallidus and Subthalamic Nucleus in the Primate MPTP Model of Parkinsonism. In: Carpenter MB, Jayaraman A, editors. The Basal Ganglia II [Internet]. Boston, MA: Springer US; 1987 [cited 2019 Mar 11]. p. 415–27. Available from: http://link.springer.com/10.1007/978-1-4684-5347-8_29
68.
Pan HS, Walters JR. Unilateral lesion of the nigrostriatal pathway decreases the firing rate and alters the firing pattern of globus pallidus neurons in the rat. Synapse. 1988;2(6):650–6.
OpenUrl CrossRef PubMed Web of Science
69.↵
Sterio D, Beri? A, Dogali M, Fazzini E, Alfaro G, Devinsky O. Neurophysiological properties of pallidal neurons in Parkinson’s disease. Ann Neurol. 1994 May;35(5):586–91.
OpenUrl CrossRef PubMed Web of Science
70.↵
Doudet DJ, Gross C, Arluison M, Bioulac B. Modifications of precentral cortex discharge and EMG activity in monkeys with MPTP-induced lesions of DA nigral neurons. Exp Brain Res. 1990;80(1):177–88.
OpenUrl PubMed Web of Science
71.↵
Bergman H, Wichmann T, Karmon B, DeLong MR. The primate subthalamic nucleus. II. Neuronal activity in the MPTP model of parkinsonism. J Neurophysiol. 1994 Aug;72(2):507–20.
OpenUrl CrossRef PubMed Web of Science
72.↵
Hassani OK, Mouroux M, Féger J. Increased subthalamic neuronal activity after nigral dopaminergic lesion independent of disinhibition via the globus pallidus. Neuroscience. 1996 May;72(1):105–15.
OpenUrl CrossRef PubMed Web of Science
73.↵
Hutchison WD, Lozano AM, Davis KD, Saint-Cyr JA, Lang AE, Dostrovsky JO. Differential neuronal activity in segments of globus pallidus in Parkinson’s disease patients. Neuroreport. 1994 Jul 21;5(12):1533–7.
OpenUrl CrossRef PubMed Web of Science
74.↵
Wichmann T, Bergman H, Starr PA, Subramanian T, Watts RL, DeLong MR. Comparison of MPTP-induced changes in spontaneous neuronal discharge in the internal pallidal segment and in the substantia nigra pars reticulata in primates. Exp Brain Res. 1999 Apr;125(4):397–409.
OpenUrl CrossRef PubMed Web of Science
75.↵
Reiner A, Dragatsis I, Dietrich P. Genetics and Neuropathology of Huntington’s Disease. In: International Review of Neurobiology [Internet]. Elsevier; 2011 [cited 2019 Mar 11]. p. 325–72. Available from: https://linkinghub.elsevier.com/retrieve/pii/B9780123813282000146
76.↵
Sharp AH, Ross CA. Neurobiology of Huntington’s Disease. Neurobiol Dis. 1996 Feb;3(1):3–15.
OpenUrl CrossRef PubMed Web of Science

View the discussion thread.

Posted April 23, 2019.

Download PDF

Citation Tools

Subject Area

Neuroscience

Subject Areas

All Articles

Animal Behavior and Cognition (5201)
Biochemistry (11718)
Bioengineering (8724)
Bioinformatics (29132)
Biophysics (14936)
Cancer Biology (12051)
Cell Biology (17360)
Clinical Trials (138)
Developmental Biology (9406)
Ecology (14146)
Epidemiology (2067)
Evolutionary Biology (18269)
Genetics (12223)
Genomics (16768)
Immunology (11844)
Microbiology (28016)
Molecular Biology (11560)
Neuroscience (60822)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3231)
Physiology (4940)
Plant Biology (10401)
Scientific Communication and Education (1680)
Synthetic Biology (2878)
Systems Biology (7333)
Zoology (1642)

[1] 1.↵
Redgrave P, Prescott TJ, Gurney K. The basal ganglia: a vertebrate solution to the selection problem? Neuroscience. 1999 Apr;89(4):1009–23.
OpenUrl CrossRef PubMed Web of Science

[2] 2.↵
Nelson AB, Kreitzer AC. Reassessing models of basal ganglia function and dysfunction. Annu Rev Neurosci. 2014;37:117–35.
OpenUrl CrossRef PubMed

[3] 3.↵
Bhatia KP, Marsden CD. The behavioural and motor consequences of focal lesions of the basal ganglia in man. Brain. 1994;117(4):859–76.
OpenUrl CrossRef PubMed Web of Science

[4] 4.
DeLong MR. Primate models of movement disorders of basal ganglia origin. Trends Neurosci. 1990 Jul;13(7):281–5.
OpenUrl CrossRef PubMed Web of Science

[5] 5.↵
Denny-Brown. The basal ganglia and their relation to disorders of movement. Br J Surg. 1962 Jul;50(219):117–8.
OpenUrl

[6] 6.↵
Aron AR, Durston S, Eagle DM, Logan GD, Stinear CM, Stuphorn V. Converging Evidence for a Fronto-Basal-Ganglia Network for Inhibitory Control of Action and Cognition. J Neurosci. 2007 Oct 31;27(44):11860–4.
OpenUrl FREE Full Text

[7] 7.↵
Draganski B, Kherif F, Klöppel S, Cook PA, Alexander DC, Parker GJM, et al. Evidence for segregated and integrative connectivity patterns in the human Basal Ganglia. J Neurosci Off J Soc Neurosci. 2008 Jul 9;28(28):7143–52.
OpenUrl Abstract/FREE Full Text

[8] 8.↵
Redgrave P, Rodriguez M, Smith Y, Rodriguez-Oroz MC, Lehericy S, Bergman H, et al. Goal-directed and habitual control in the basal ganglia: implications for Parkinson’s disease. Nat Rev Neurosci. 2010 Nov;11(11):760–72.
OpenUrl CrossRef PubMed Web of Science

[9] 9.↵
Packard MG, Knowlton BJ. Learning and Memory Functions of the Basal Ganglia. Annu Rev Neurosci. 2002 Mar;25(1):563–93.
OpenUrl CrossRef PubMed Web of Science

[10] 10.
Foerde K, Shohamy D. The role of the basal ganglia in learning and memory: insight from Parkinson’s disease. Neurobiol Learn Mem. 2011 Nov;96(4):624–36.
OpenUrl CrossRef PubMed

[11] 11.↵
Lanciego JL, Luquin N, Obeso JA. Functional neuroanatomy of the basal ganglia. Cold Spring Harb Perspect Med. 2012 Dec 1;2(12):a009621.
OpenUrl Abstract/FREE Full Text

[12] 12.↵
Graybiel AM. Habits, Rituals, and the Evaluative Brain. Annu Rev Neurosci. 2008 Jul;31(1):359–87.
OpenUrl CrossRef PubMed Web of Science

[13] 13.↵
Maia TV, Frank MJ. From reinforcement learning models to psychiatric and neurological disorders. Nat Neurosci. 2011 Feb;14(2):154–62.
OpenUrl CrossRef PubMed Web of Science

[14] 14.↵
Keiflin R, Janak PH. Dopamine Prediction Errors in Reward Learning and Addiction: From Theory to Neural Circuitry. Neuron. 2015 Oct 21;88(2):247–63.
OpenUrl

[15] 15.↵
Takahashi Y, Schoenbaum G, Niv Y. Silencing the critics: understanding the effects of cocaine sensitization on dorsolateral and ventral striatum in the context of an actor/critic model. Front Neurosci. 2008 Jul;2(1):86–99.
OpenUrl CrossRef PubMed

[16] 16.↵
Frank MJ. Dynamic Dopamine Modulation in the Basal Ganglia: A Neurocomputational Account of Cognitive Deficits in Medicated and Nonmedicated Parkinsonism. J Cogn Neurosci. 2005 Jan;17(1):51–72.
OpenUrl CrossRef PubMed Web of Science

[17] 17.
Schonberg T, Daw ND, Joel D, O’Doherty JP. Reinforcement Learning Signals in the Human Striatum Distinguish Learners from Nonlearners during Reward-Based Decision Making. J Neurosci. 2007 Nov 21;27(47):12860–7.
OpenUrl Abstract/FREE Full Text

[18] 18.↵
Schultz W. Updating dopamine reward signals. Curr Opin Neurobiol. 2013 Apr;23(2):229–38.
OpenUrl CrossRef PubMed

[19] 19.↵
Saunders BT, Richard JM, Margolis EB, Janak PH. Dopamine neurons create Pavlovian conditioned stimuli with circuit-defined motivational properties. Nat Neurosci. 2018 Aug;21(8):1072–83.
OpenUrl CrossRef PubMed

[20] 20.↵
Keiflin R, Pribut HJ, Shah NB, Janak PH. Ventral Tegmental Dopamine Neurons Participate in Reward Identity Predictions. Curr Biol CB. 2019 Jan 7;29(1):93–103.e3.
OpenUrl

[21] 21.↵
Surmeier DJ, Plotkin J, Shen W. Dopamine and synaptic plasticity in dorsal striatal circuits controlling action selection. Curr Opin Neurobiol. 2009 Dec;19(6):621–8.
OpenUrl CrossRef PubMed Web of Science

[22] 22.↵
Bamford NS, Wightman RM, Sulzer D. Dopamine’s Effects on Corticostriatal Synapses during Reward-Based Behaviors. Neuron. 2018 Feb 7;97(3):494–510.
OpenUrl

[23] 23.↵
Gerfen CR. The Neostriatal Mosaic: Multiple Levels of Compartmental Organization in the Basal Ganglia. Annu Rev Neurosci. 1992 Mar;15(1):285–320.
OpenUrl CrossRef PubMed Web of Science

[24] 24.
Surmeier DJ, Song WJ, Yan Z. Coordinated expression of dopamine receptors in neostriatal medium spiny neurons. J Neurosci Off J Soc Neurosci. 1996 Oct 15;16(20):6579–91.
OpenUrl Abstract/FREE Full Text

[25] 25.↵
Kreitzer AC, Malenka RC. Striatal Plasticity and Basal Ganglia Circuit Function. Neuron. 2008 Nov;60(4):543–54.
OpenUrl CrossRef PubMed Web of Science

[26] 26.↵
Mannella F, Baldassarre G. Selection of cortical dynamics for motor behaviour by the basal ganglia. Biol Cybern. 2015 Dec;109(6):575–95.
OpenUrl

[27] 27.
Alexander GE, Crutcher MD. Functional architecture of basal ganglia circuits: neural substrates of parallel processing. Trends Neurosci. 1990 Jul;13(7):266–71.
OpenUrl CrossRef PubMed Web of Science

[28] 28.↵
Gurney K, Prescott TJ, Redgrave P. A computational model of action selection in the basal ganglia. I. A new functional anatomy. Biol Cybern. 2001 May 11;84(6):401–10.
OpenUrl

[29] 29.↵
Cui G, Jun SB, Jin X, Pham MD, Vogel SS, Lovinger DM, et al. Concurrent activation of striatal direct and indirect pathways during action initiation. Nature. 2013 Feb 14;494(7436):238–42.
OpenUrl CrossRef PubMed Web of Science

[30] 30.
Markowitz JE, Gillis WF, Beron CC, Neufeld SQ, Robertson K, Bhagat ND, et al. The Striatum Organizes 3D Behavior via Moment-to-Moment Action Selection. Cell. 2018 28;174(1):44–58.e17.
OpenUrl CrossRef PubMed

[31] 31.↵
London TD, Licholai JA, Szczot I, Ali MA, LeBlanc KH, Fobbs WC, et al. Coordinated Ramping of Dorsal Striatal Pathways preceding Food Approach and Consumption. J Neurosci Off J Soc Neurosci. 2018 Apr 4;38(14):3547–58.
OpenUrl Abstract/FREE Full Text

[32] 32.↵
Balleine BW, Liljeholm M, Ostlund SB. The integrative function of the basal ganglia in instrumental conditioning. Behav Brain Res. 2009 Apr;199(1):43–52.
OpenUrl CrossRef PubMed Web of Science

[33] 33.↵
Munhoz RP, Cerasa A, Okun MS. Surgical Treatment of Dyskinesia in Parkinsonâ€™s Disease. Front Neurol [Internet]. 2014 Apr 29 [cited 2019 Mar 8];5. Available from: http://journal.frontiersin.org/article/10.3389/fneur.2014.00065/abstract

[34] 34.
Brown P, Eusebio A. Paradoxes of functional neurosurgery: clues from basal ganglia recordings. Mov Disord Off J Mov Disord Soc. 2008 Jan;23(1):12–20; quiz 158.
OpenUrl

[35] 35.↵
Marsden CD, Obeso JA. The functions of the basal ganglia and the paradox of stereotaxic surgery in Parkinson’s disease. Brain J Neurol. 1994 Aug;117 (Pt 4):877–97.
OpenUrl CrossRef PubMed Web of Science

[36] 36.↵
Piron C, Kase D, Topalidou M, Goillandeau M, Orignac H, N’Guyen T-H, et al. The globus pallidus pars interna in goal-oriented and routine behaviors: Resolving a long-standing paradox: GPI, Goal-directed behaviors and habits. Mov Disord. 2016 Aug;31(8):1146–54.
OpenUrl CrossRef PubMed

[37] 37.↵
Topalidou M, Kase D, Boraud T, Rougier NP. A Computational Model of Dual Competition between the Basal Ganglia and the Cortex. eneuro. 2018;5(6):ENEURO.0339-17.2018.

[38] 38.↵
Kim T, Hamade KC, Todorov D, Barnett WH, Capps RA, Latash EM, et al. Reward Based Motor Adaptation Mediated by Basal Ganglia. Front Comput Neurosci [Internet]. 2017 Mar 31 [cited 2019 Mar 8];11. Available from: http://journal.frontiersin.org/article/10.3389/fncom.2017.00019/full

[39] 39.↵
Reynolds JNJ, Hyland BI, Wickens JR. A cellular mechanism of reward-related learning. Nature. 2001 Sep 6;413(6851):67–70.
OpenUrl CrossRef PubMed Web of Science

[40] 40.↵
Buonomano DV, Merzenich MM. CORTICAL PLASTICITY: From Synapses to Maps. Annu Rev Neurosci. 1998 Mar;21(1):149–86.
OpenUrl CrossRef PubMed Web of Science

[41] 41.↵
Wolters A, Sandbrink F, Schlottmann A, Kunesch E, Stefan K, Cohen LG, et al. A Temporally Asymmetric Hebbian Rule Governing Plasticity in the Human Motor Cortex. J Neurophysiol. 2003 May;89(5):2339–45.
OpenUrl CrossRef PubMed Web of Science

[42] 42.↵
Cohen JD, McClure SM, Yu AJ. Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration. Philos Trans R Soc B Biol Sci. 2007 May 29;362(1481):933–42.
OpenUrl CrossRef PubMed

[43] 43.↵
Prescott TJ, Bryson JJ, Seth AK. Introduction. Modelling natural action selection. Philos Trans R Soc B Biol Sci. 2007 Sep 29;362(1485):1521–9.
OpenUrl CrossRef PubMed

[44] 44.↵
Hulse SH, Egeth H, Deese J. The psychology of learning. 5th ed. New York: McGraw-Hill; 1980. 478 p. (McGraw-Hill series in psychology).

[45] 45.↵
Izquierdo A, Wiedholz LM, Millstein RA, Yang RJ, Bussey TJ, Saksida LM, et al. Genetic and dopaminergic modulation of reversal learning in a touchscreen-based operant procedure for mice. Behav Brain Res. 2006 Aug 10;171(2):181–8.
OpenUrl CrossRef PubMed Web of Science

[46] 46.↵
Linden J, James AS, McDaniel C, Jentsch JD. Dopamine D2 Receptors in Dopaminergic Neurons Modulate Performance in a Reversal Learning Task in Mice. eNeuro. 2018 Feb;5(1).

[47] 47.↵
Bodi N, Keri S, Nagy H, Moustafa A, Myers CE, Daw N, et al. Reward-learning and the novelty-seeking personality: a between- and within-subjects study of the effects of dopamine agonists on young Parkinson’s patients. Brain. 2009 Sep 1;132(9):2385–95.
OpenUrl CrossRef PubMed Web of Science

[48] 48.↵
Cools R, Altamirano L, D’Esposito M. Reversal learning in Parkinson’s disease depends on medication status and outcome valence. Neuropsychologia. 2006 Jan;44(10):1663–73.
OpenUrl CrossRef PubMed Web of Science

[49] 49.↵
Galvan A, Devergnas A, Wichmann T. Alterations in neuronal activity in basal ganglia-thalamocortical circuits in the parkinsonian state. Front Neuroanat [Internet]. 2015 Feb 5 [cited 2019 Mar 11];9. Available from: http://journal.frontiersin.org/Article/10.3389/fnana.2015.00005/abstract

[50] 50.↵
Humphries MD, Obeso JA, Dreyer JK. Insights into Parkinson’s disease from computational models of the basal ganglia. J Neurol Neurosurg Psychiatry. 2018 Nov;89(11):1181–8.
OpenUrl Abstract/FREE Full Text

[51] 51.↵
van Albada SJ, Gray RT, Drysdale PM, Robinson PA. Mean-field modeling of the basal ganglia-thalamocortical system. II. J Theor Biol. 2009 Apr;257(4):664–88.
OpenUrl CrossRef PubMed Web of Science

[52] 52.↵
Soikkeli R, Partanen J, Soininen H, Pääkkönen A, Riekkinen P. Slowing of EEG in Parkinson’s disease. Electroencephalogr Clin Neurophysiol. 1991 Sep;79(3):159–65.
OpenUrl CrossRef PubMed Web of Science

[53] 53.↵
Cozac VV, Gschwandtner U, Hatz F, Hardmeier M, Rüegg S, Fuhr P. Quantitative EEG and Cognitive Decline in Parkinson’s Disease. Park Dis. 2016;2016:1–14.
OpenUrl

[54] 54.↵
Grahn JA, Parkinson JA, Owen AM. The role of the basal ganglia in learning and memory: Neuropsychological studies. Behav Brain Res. 2009 Apr;199(1):53–60.
OpenUrl CrossRef PubMed Web of Science

[55] 55.↵
Lawrence AD, Hodges JR, Rosser AE, Kershaw A, ffrench-Constant C, Rubinsztein DC, et al. Evidence for specific cognitive deficits in preclinical Huntington’s disease. Brain J Neurol. 1998 Jul;121 (Pt 7):1329–41.
OpenUrl CrossRef PubMed Web of Science

[56] 56.↵
Gutierrez-Garralda JM, Moreno-Briseño P, Boll M-C, Morgado-Valle C, Campos-Romo A, Diaz R, et al. The effect of Parkinson’s disease and Huntington’s disease on human visuomotor learning. Eur J Neurosci. 2013 Jun;n/a-n/a.

[57] 57.↵
Heindel WC, Butters N, Salmon DP. Impaired learning of a motor skill in patients with Huntington’s disease. Behav Neurosci. 1988 Feb;102(1):141–7.
OpenUrl CrossRef PubMed Web of Science

[58] 58.↵
Yin HH, Knowlton BJ, Balleine BW. Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning. Eur J Neurosci. 2004 Jan;19(1):181–9.
OpenUrl CrossRef PubMed Web of Science

[59] 59.↵
Wojtecki L, Groiss SJ, Hartmann CJ, Elben S, Omlor S, Schnitzler A, et al. Deep Brain Stimulation in Huntington’s Disease-Preliminary Evidence on Pathophysiology, Efficacy and Safety. Brain Sci. 2016 Aug 30;6(3).

[60] 60.↵
Koch C, Segev I, editors. Methods in neuronal modeling: from ions to networks. 2. ed., 3. printing. Cambridge, Mass.: MIT Press; 2001. 671 p. (Computational neuroscience).

[61] 61.↵
Graham LJ
Pavlides A, Hogan SJ, Bogacz R. Computational Models Describing Possible Mechanisms for Generation of Excessive Beta Oscillations in Parkinson’s Disease. Graham LJ, editor. PLOS Comput Biol. 2015 Dec 18;11(12):e1004609.
OpenUrl CrossRef PubMed

[62] Graham LJ

[63] 62.↵
Schultz W. Behavioral dopamine signals. Trends Neurosci. 2007 May;30(5):203–10.
OpenUrl CrossRef PubMed Web of Science

[64] 63.↵
Kalia LV, Lang AE. Parkinson’s disease. The Lancet. 2015 Aug;386(9996):896–912.
OpenUrl

[65] 64.↵
Schapira AHV. Neurobiology and treatment of Parkinson’s disease. Trends Pharmacol Sci. 2009 Jan;30(1):41–7.
OpenUrl CrossRef PubMed

[66] 65.↵
Kita H, Kita T. Role of Striatum in the Pause and Burst Generation in the Globus Pallidus of 6-OHDA-Treated Rats. Front Syst Neurosci [Internet]. 2011 [cited 2019 Mar 11];5. Available from: http://journal.frontiersin.org/article/10.3389/fnsys.2011.00042/abstract

[67] 66.↵
Mallet N, Ballion B, Le Moine C, Gonon F. Cortical inputs and GABA interneurons imbalance projection neurons in the striatum of parkinsonian rats. J Neurosci Off J Soc Neurosci. 2006 Apr 5;26(14):3875–84.
OpenUrl Abstract/FREE Full Text

[68] 67.↵
Carpenter MB,
Jayaraman A
Miller WC, DeLong MR. Altered Tonic Activity of Neurons in the Globus Pallidus and Subthalamic Nucleus in the Primate MPTP Model of Parkinsonism. In: Carpenter MB, Jayaraman A, editors. The Basal Ganglia II [Internet]. Boston, MA: Springer US; 1987 [cited 2019 Mar 11]. p. 415–27. Available from: http://link.springer.com/10.1007/978-1-4684-5347-8_29

[69] Carpenter MB,

[70] Jayaraman A

[71] 68.
Pan HS, Walters JR. Unilateral lesion of the nigrostriatal pathway decreases the firing rate and alters the firing pattern of globus pallidus neurons in the rat. Synapse. 1988;2(6):650–6.
OpenUrl CrossRef PubMed Web of Science

[72] 69.↵
Sterio D, Beri? A, Dogali M, Fazzini E, Alfaro G, Devinsky O. Neurophysiological properties of pallidal neurons in Parkinson’s disease. Ann Neurol. 1994 May;35(5):586–91.
OpenUrl CrossRef PubMed Web of Science

[73] 70.↵
Doudet DJ, Gross C, Arluison M, Bioulac B. Modifications of precentral cortex discharge and EMG activity in monkeys with MPTP-induced lesions of DA nigral neurons. Exp Brain Res. 1990;80(1):177–88.
OpenUrl PubMed Web of Science

[74] 71.↵
Bergman H, Wichmann T, Karmon B, DeLong MR. The primate subthalamic nucleus. II. Neuronal activity in the MPTP model of parkinsonism. J Neurophysiol. 1994 Aug;72(2):507–20.
OpenUrl CrossRef PubMed Web of Science

[75] 72.↵
Hassani OK, Mouroux M, Féger J. Increased subthalamic neuronal activity after nigral dopaminergic lesion independent of disinhibition via the globus pallidus. Neuroscience. 1996 May;72(1):105–15.
OpenUrl CrossRef PubMed Web of Science

[76] 73.↵
Hutchison WD, Lozano AM, Davis KD, Saint-Cyr JA, Lang AE, Dostrovsky JO. Differential neuronal activity in segments of globus pallidus in Parkinson’s disease patients. Neuroreport. 1994 Jul 21;5(12):1533–7.
OpenUrl CrossRef PubMed Web of Science

[77] 74.↵
Wichmann T, Bergman H, Starr PA, Subramanian T, Watts RL, DeLong MR. Comparison of MPTP-induced changes in spontaneous neuronal discharge in the internal pallidal segment and in the substantia nigra pars reticulata in primates. Exp Brain Res. 1999 Apr;125(4):397–409.
OpenUrl CrossRef PubMed Web of Science

[78] 75.↵
Reiner A, Dragatsis I, Dietrich P. Genetics and Neuropathology of Huntington’s Disease. In: International Review of Neurobiology [Internet]. Elsevier; 2011 [cited 2019 Mar 11]. p. 325–72. Available from: https://linkinghub.elsevier.com/retrieve/pii/B9780123813282000146

[79] 76.↵
Sharp AH, Ross CA. Neurobiology of Huntington’s Disease. Neurobiol Dis. 1996 Feb;3(1):3–15.
OpenUrl CrossRef PubMed Web of Science

Basal Ganglia role in learning rewarded actions and executing previously learned choices: healthy and diseased states

Abstract

Introduction

Results

Healthy BG facilitates learning of rewarded choices

Mild Parkinsonian BG: impeded learning and spontaneous oscillations

Grade 2 Huntington’s Disease BG state: persistent exploratory behavior

Discussion

Materials and Methods

Structure of the basal ganglia

Behavioral task

Firing rate equations

Synaptic plasticity

Healthy BG state

Parkinsonian BG state

Huntington’s BG state

Numerical Simulations

References

Citation Manager Formats

Subject Area