A mathematical framework for inferring connectivity in probabilistic neuronal networks

doi:10.1016/j.mbs.2006.08.020

Mathematical Biosciences

Volume 205, Issue 2, February 2007, Pages 204-251

https://doi.org/10.1016/j.mbs.2006.08.020 Get rights and content

Abstract

We describe an approach for determining causal connections among nodes of a probabilistic network even when many nodes remain unobservable. The unobservable nodes introduce ambiguity into the estimate of the causal structure. However, in some experimental contexts, such as those commonly used in neuroscience, this ambiguity is present even without unobservable nodes. The analysis is presented in terms of a point process model of a neuronal network, though the approach can be generalized to other contexts. The analysis depends on the existence of a model that captures the relationship between nodal activity and a set of measurable external variables. The mathematical framework is sufficiently general to allow a large class of such models. The results are modestly robust to deviations from model assumptions, though additional validation methods are needed to assess the success of the results.

Introduction

In many cases where one wishes to infer the connectivity among nodes in a network, only a small fraction of the nodes can be sampled simultaneously. For example, neuroscientists can measure the individual activity of only tens, or possibly hundreds, of neurons simultaneously, despite the fact that the neural networks of even small areas of the brain contain millions of neurons. Similar situations exist when studying, for example, gene networks, communication networks, or social networks. The recurring feature is the presence of large numbers of unmeasured nodes. The effects of connections involving these unmeasured nodes can confound attempts to infer the “subnetwork” of connections among the measured nodes. For example, connections between a single unmeasured node and multiple measured nodes could cause the measured nodes to appear connected even if no direct connection among the measured nodes exists. The focus of this work is a mathematical framework that can, under suitable assumptions, provide a method to control for the effects of unmeasured nodes.

We restrict our attention to networks that can be represented as directed graphs (digraphs), where the direction of a connection indicates a causal connection from one node to another. We view each node as having some measurable “activity” (we will be more specific below). A causal connection means the activity in one node affects the future activity of another node (but not the reverse, unless there also happens to exist a reciprocal connection). We can hence refine our goal to inferring the causal subnetwork of connections among the measured nodes (see Fig. 1). We wish to identify the causal influences among measured nodes, distinguishing such causal network connections from those connections involving unmeasured nodes where no such causal influence is present. For example, in the case where a single unmeasured node has connections onto multiple measured nodes (a “common input” configuration, illustrated with dashed thick lines in Fig. 1), there is no causal connection among the measured nodes. Hence, to reconstruct the causal subnetwork among measured nodes, we must control for the possibility of common input connections from unmeasured nodes.

If measured node 1 has a causal connection onto measured node 2 (as in Fig. 1), one would expect the connection to introduce correlations into the activity of the nodes. The activity of node 2 would be correlated with a delayed version of the activity of node 1, where the delay corresponds to the time required for the influence of node 1’s activity to affect node 2. However, common input connections from an unmeasured node could induce similar correlation between the measured nodes’ activity. Consider the example from Fig. 1 where an unmeasured node has a connection onto measured nodes 3 and 4. If the connection onto node 4 has a shorter delay than the connection onto node 3, then the activity of node 3 will be correlated with a delayed version of the activity of node 4. From observing this correlation in the activity of nodes 3 and 4, one might naively and incorrectly conclude that node 3 received a causal connection from node 4.

This paper describes a method that can distinguish the causal subnetwork from common input connections, subject to a certain form of ambiguity in the identity of nodes that we call “subpopulation” ambiguity. We argue that, at least in a large number of neuroscience experiments, this subpopulation ambiguity is already present in how nodes are identified. Hence, even without the presence of unmeasured nodes, a determination of connectivity will be subject to this subpopulation ambiguity. When our analysis can be applied, the presence of unmeasured nodes adds little additional ambiguity to the determination of connectivity.

This research is motivated by neuroscience applications, and this initial formulation is designed to be immediately applicable to neuroscience experiments. For clarity and to emphasize that some elements of the analysis are specialized to neuroscience, we will primarily use the language of neuroscience, referring to the network nodes as neurons. Nonetheless, we believe the basic approach can be generalized to have wider application.

In Section 2, we detail the subpopulation ambiguity that is present in our results. In Section 3, we describe the model framework and the assumptions of the analysis. We present the analysis of the model in Section 4 and derive our estimates of causal network structure. We demonstrate the results applied to simulations in Section 5 and discuss the results in Section 6.

Section snippets

The subpopulation ambiguity

The definition of a subpopulation of neurons is based on the relationship between neural activity and any measured external variables. (A group of neurons whose activity has a similar relationship to the external variables will be considered part of the same subpopulation.) We first describe the external variables before discussing the subpopulation ambiguity.

The model network

Our results are based on a fairly generic class of probabilistic causal network models in discrete time. Since the measured activity of a neuron is the sequence of spike times, we model the activity as a point process. Rather than introduce standard point process notation (see, e.g., Refs. [1], [2]), we jump immediately to the formulation in discrete time, which is all we need for our analysis.

Initially, imagine that we have discretized time sufficiently finely so that a neuron can have at most

Analysis of model network

We seek to develop a method to determine the connectivity among measured neurons under the assumption that the activity of all neurons was generated according to the network model (3.3). The presence of unmeasured neurons will prevent us from completely succeeding, as our connectivity estimates will be subject to the subpopulation ambiguity discussed in Section 2.

We divide the set of all neural indices into two non-overlapping sets: $Q$ containing the indices of measured neurons and $P$ containing

Simulation results

To illustrate our approach, we simulated several small networks. We designated two neurons as measured neurons and recorded the spike times of only those two neurons. The remaining neurons were unmeasured, and we ignored their spike times in the analysis. From the spike times of the two measured neurons and the external variables, we attempted to determine the connectivity between the two measured neurons.

Discussion

We have developed a model-based modular approach to identifying causal connections in a neural network where many neurons remain unmeasured. The approach is modular because the analysis works with a large class of models (which determine the f (w, X; θ) of Eq. (3.3)). The network analysis can use models, and algorithms to determine their parameters, that have been developed independently. When a model captures the neurons’ behavior sufficiently well, we can distinguish causal connections from

References (20)

D.H. Perkel et al.
Neuronal spike trains and stochastic point processes. II. Simultaneous spike trains
Biophys. J.
(1967)
J.R. Rosenberg et al.
The Fourier approach to the identification of functional coupling between neuronal spike trains
Prog. Biophys. Mol. Biol.
(1989)
D.R. Cox et al.
Point Processes
(1980)
D.J. Daley et al.
An Introduction to the Theory of Point Processes
(1988)
D.Q. Nykamp
Revealing pairwise coupling in linear–nonlinear networks
SIAM J. Appl. Math.
(2005)
L. Paninski
Maximum likelihood estimation of cascade point-process neural encoding models
Network: Comput. Neural Syst.
(2004)
M. Galassi, J. Davies, J. Theiler, B. Gough, G. Jungman, M. Booth, F. Rossi, Gnu Scientific Library Reference Manual,...
S. Marcelja
Mathematical description of the responses of simple cortical cells
J. Opt. Soc. Am.
(1980)
E.H. Adelson et al.
Spatiotemporal energy models for the perception of motion
J. Opt. Soc. Am. A
(1985)
D.Q. Nykamp
Measuring linear and quadratic contributions to neuronal response
Network: Comput. Neural Syst.
(2003)

There are more references available in the full text version of this article.

Cited by (45)

A Simplified model of mutually inhibitory sleep-active and wake-active neuronal populations employing a noise-based switching mechanism
2016, Journal of Theoretical Biology
Citation Excerpt :
Furthermore, in this study I do not examine the effects of specific network architectures (rather than all-to-all or random connectivity) on bout behavior. The impact of network architecture on bouts in a two population system will be carried out using the mathematical and statistical theory developed in Zhao et al. (2011); Nykamp (2007, 2005) and Nykamp (2009) and presented elsewhere. The purpose of the current study is simply to elucidate the basic principles of noise-induced bout switching in a biophysical two population system and the relation of these principles to the behavior of sleep-active and wake-active populations during early infancy.
Infant rats switch randomly between the sleeping and waking states; during early infancy (up to postnatal day 8), sleep and wake bouts are random, brief (with means on the order of several seconds) and exponentially distributed, with the length of a particular bout independent of the length of prior bouts. As the rat ages during this early period, mean sleep and wake bout lengths gradually increase, though sleep and wake bouts remain exponentially distributed. Additionally, sleep and wake bouts are regulated independently of each other – alterations in the development of sleep (wake) bouts has no impact on the regulation wake (sleep) bouts. Sleep and wake bout behavior is associated with the activity of mutually inhibitory sleep-active and wake-active brainstem populations. In this work, I employ a simplified biophysical model of two mutually inhibitory populations consisting of ten integrate-and-fire neurons each and a noise-based switching mechanism. I show that such a noise-based switching mechanism naturally accounts for the experimentally observed features of sleep–wake switching during early infancy – random alternating activity bouts occur as a consequence of noise (provided inhibition is strong relative to excitation), bout durations are exponential (due to a lack of memory within the system), and cross-population inhibition or intrapopulation excitatory coupling provide mechanisms for changing and independently regulated sleep and wake bout means.
Visualizing whole-brain activity and development at the single-cell level using light-sheet microscopy
2015, Neuron
Citation Excerpt :
One conceptual advantage is that, by increasing coverage, large-scale imaging reduces the number of hidden variables in a system. For instance, when making inferences about causal connections in neural networks or other systems, the common input problem often appears: if activity in two nodes is strongly related, it can be hard to differentiate the causal connections between them from a common, hidden, input to both (Nykamp, 2007). However, the more neurons recorded from, ideally, the entire brain, the larger the chance that this common input is included in the data and possibly can be identified through, for example, machine learning and post hoc anatomical tracing methods.
The nature of nervous system function and development is inherently global, since all components eventually influence one another. Networks communicate through dense synaptic, electric, and modulatory connections and develop through concurrent growth and interlinking of their neurons, processes, glia, and blood vessels. These factors drive the development of techniques capable of imaging neural signaling, anatomy, and developmental processes at ever-larger scales. Here, we discuss the nature of questions benefitting from large-scale imaging techniques and introduce recent applications. We focus on emerging light-sheet microscopy approaches, which are well suited for live imaging of large systems with high spatiotemporal resolution and over long periods of time. We also discuss computational methods suitable for extracting biological information from the resulting system-level image data sets. Together with new tools for reporting and manipulating neuronal activity and gene expression, these techniques promise new insights into the large-scale function and development of neural systems.
Coupling time decoding and trajectory decoding using a target-included model in the motor cortex
2012, Neurocomputing
Citation Excerpt :
These methods aim to reconstruct continuous arm movements of human or non-human primates using measurements of observed neural activity. Commonly used decoding methods include various linear Gaussian models [19,24,34,12] and GLM (generalized linear model)-based models [22,33,21,31]. Other approaches include neural networks [25,13], nonparametric models [32], general-purpose filters [29], common-input (or hidden-state) models [16,4,36,18], and approximate methods for state-space models [8,15,23].
Significant progress has been made within the last decade in motor cortical decoding that predicts movement behaviors from population neuronal activity in the motor cortex. A majority of these decoding methods have focused on estimating a subject's hand trajectory in a continuous movement. We recently proposed a time identification decoding approach and showed that if a stereotyped movement is well represented by a sequence of targets (or landmarks), then the main structure of the movement can be reconstructed by detecting the reaching times at those targets. Both trajectory decoding and landmark-time decoding have their particular advantages, whereas a coupling of these two different strategies has not been examined. In this article we propose a synergy that comes from combining these two approaches for a stereotyped movement under a linear state-space framework. We develop a new decoding procedure based on a forward–backward propagation where the target is used in the initial stage in the backward step. Experimental results show that the new method significantly improves decoding accuracy over the non-target-included models. Furthermore, the coupling based on the new target-included method effectively combines the time decoding and trajectory decoding and further improves the decoding accuracy.
Population decoding of motor cortical activity using a generalized linear model with hidden states
2010, Journal of Neuroscience Methods
Citation Excerpt :
While all of these non-linear models have attractive theoretical and computational properties, they do not take into account other internal or external variables that may affect spiking activity, such as muscular activation, the subject’s level of attention, or other factors in the subject’s environment. Collectively, we call these unobserved (or unobservable) variables hidden states, or common inputs, using the terminology from Kulkarni and Paninski (2007a); see also Yu et al. (2006, 2009), Nykamp (2007) and Brockwell et al. (2007) for related discussion. Similarly, recent studies in the nonstationary relationship between neural activity and motor behaviors indicate that such non-stationarity may be accounted for by the fact that the spike trains also encode other states such as muscle fatigue, satiation, and decreased motivation (Carmena et al., 2005; Chestek et al., 2007).
Generalized linear models (GLMs) have been developed for modeling and decoding population neuronal spiking activity in the motor cortex. These models provide reasonable characterizations between neural activity and motor behavior. However, they lack a description of movement-related terms which are not observed directly in these experiments, such as muscular activation, the subject’s level of attention, and other internal or external states. Here we propose to include a multi-dimensional hidden state to address these states in a GLM framework where the spike count at each time is described as a function of the hand state (position, velocity, and acceleration), truncated spike history, and the hidden state. The model can be identified by an Expectation–Maximization algorithm. We tested this new method in two datasets where spikes were simultaneously recorded using a multi-electrode array in the primary motor cortex of two monkeys. It was found that this method significantly improves the model-fitting over the classical GLM, for hidden dimensions varying from 1 to 4. This method also provides more accurate decoding of hand state (reducing the mean square error by up to 29% in some cases), while retaining real-time computational efficiency. These improvements on representation and decoding over the classical GLM model suggest that this new approach could contribute as a useful tool to motor cortical decoding and prosthetic applications.
Statistically inferred neuronal connections in subsampled neural networks strongly correlate with spike train covariances
2024, Physical Review E
Circumstantial evidence and explanatory models for synapses in large-scale spike recordings
2023, arXiv

View all citing articles on Scopus

¹: This research was supported in part by the National Science Foundation Grant DMS-0415409.

View full text

A mathematical framework for inferring connectivity in probabilistic neuronal networks

Abstract

Introduction

Section snippets

The subpopulation ambiguity

The model network

Analysis of model network

Simulation results

Discussion

Biophys. J.

Prog. Biophys. Mol. Biol.

Point Processes

An Introduction to the Theory of Point Processes

Revealing pairwise coupling in linear–nonlinear networks

SIAM J. Appl. Math.

Maximum likelihood estimation of cascade point-process neural encoding models

Network: Comput. Neural Syst.

Mathematical description of the responses of simple cortical cells

J. Opt. Soc. Am.

Spatiotemporal energy models for the perception of motion

J. Opt. Soc. Am. A

Measuring linear and quadratic contributions to neuronal response

Network: Comput. Neural Syst.