Causal effects in microbiomes using interventional calculus

Sazal, Musfiqur; Stebliankin, Vitalii; Mathee, Kalai; Yoo, Changwon; Narasimhan, Giri

doi:10.1038/s41598-021-84905-3

Download PDF

Article
Open access
Published: 11 March 2021

Causal effects in microbiomes using interventional calculus

Musfiqur Sazal¹,
Vitalii Stebliankin¹,
Kalai Mathee^2,3,
Changwon Yoo⁴ &
…
Giri Narasimhan^1,3

Scientific Reports volume 11, Article number: 5724 (2021) Cite this article

3114 Accesses
8 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Causal inference in biomedical research allows us to shift the paradigm from investigating associational relationships to causal ones. Inferring causal relationships can help in understanding the inner workings of biological processes. Association patterns can be coincidental and may lead to wrong conclusions about causality in complex systems. Microbiomes are highly complex, diverse, and dynamic environments. Microbes are key players in human health and disease. Hence knowledge of critical causal relationships among the entities in a microbiome, and the impact of internal and external factors on microbial abundance and their interactions are essential for understanding disease mechanisms and making appropriate treatment recommendations. In this paper, we employ causal inference techniques to understand causal relationships between various entities in a microbiome, and to use the resulting causal network to make useful computations. We introduce a novel pipeline for microbiome analysis, which includes adding an outcome or “disease” variable, and then computing the causal network, referred to as a “disease network”, with the goal of identifying disease-relevant causal factors from the microbiome. Internventional techniques are then applied to the resulting network, allowing us to compute a measure called the causal effect of one or more microbial taxa on the outcome variable or the condition of interest. Finally, we propose a measure called causal influence that quantifies the total influence exerted by a microbial taxon on the rest of the microiome. Our pipeline is robust, sensitive, different from traditional approaches, and able to predict interventional effects without any controlled experiments. The pipeline can be used to identify potential eubiotic and dysbiotic microbial taxa in a microbiome. We validate our results using synthetic data sets and using results on real data sets that were previously published.

A host–microbiota interactome reveals extensive transkingdom connectivity

Article 20 March 2024

Nicole D. Sonnert, Connor E. Rosen, … Noah W. Palm

Genome-wide association studies

Article 26 August 2021

Emil Uffelmann, Qin Qin Huang, … Danielle Posthuma

Microbiota in health and diseases

Article Open access 23 April 2022

Kaijian Hou, Zhuo-Xun Wu, … Zhe-Sheng Chen

Introduction

This term microbiome refers to a microbial habitat, and includes the microorganisms (bacteria, archaea, microbial eurkaryotes, and viruses), their genomes, and the surrounding environmental conditions¹. The microbes in a microbiome are involved in complex, dynamic interactions among themselves as well as with the host environment. Balanced compositions and harmonious relationships in the microbiomes are associated with healthy environments. However, a dysbiosis (i.e., imbalance) can disrupt these relationships and are associated with human disease and environmental ills. A deeper understanding of microbial interactions within the microbiome is the overarching aim of our work. We hypothesize that many of the microbial relationships are a result of complex biological processes and are therefore causal in nature. While the etiology of a handful of infectious diseases can be traced back to a single species or strain of some pathogen, most diseases are complex and multifactorial. Uncovering causal relationships is thus an important first step toward understanding disease and also predicting the course of future treatments.

Great strides have also been made in causal inferencing from data. Starting from strong theoretical foundations, the notion of conditional independence², the theory of Bayesian networks^3,4, the notion of d-separation⁵, the development of efficient inferencing algorithms^6,7, and the development of the do-calculus^8,9, have made it possible to go from good experimental data sets to useful causal relationships and predictive capability for interventions.

With ground-breaking advancements in high-throughput sequencing technologies, it is now possible to examine microbial diversity in microbiomes with increased precision, and has led to a large number of research investigations on the associations between the microbiome and phenotypes such as obesity, neurological disorders, inflammation, immune disorders, metabolic diseases, and more^10,11,12,13. Interest in constructing causal networks for microbiomes is recent^14,15. Focused experiments in the laboratory to elicit causal relationships within microbiomes do exist^16,17, but do not employ computational causal inferencing approaches. Sazal et al. were among the first to construct causal networks for microbiomes¹⁸. They have showed that directed edges in causal networks inferred from metagenomics data using the R-based tool bnlearn¹⁹ are consistent with known colonization order²⁰. Kitsios et al. investigated data from 56 patients with bacterial pneumonia and constructed a network of relationships between microbial taxa and other clinical variables²¹. Although they used the web-based inferencing tool, CausalMGM²², to construct a probabilistic graphical model, their work falls short of doing causal inference and shows an undirected network of associational relationships for lung microbiomes. Mainali et al. used Granger causality to infer causality, but their work requires microbiome data from longitudinal studies²³. Literature on interventional studies of microbiomes are limited to laboratory experiments. Causal impact on the gut microbiome by nutrients²⁴ and diet^25,26 have been studied.

A significant advantage of constructing causal networks is that it allow us to study interventions, thus making it possible to measure the impact of a hypothetical action, i.e., the effect of “doing/intervening”. It helps us to answer interventional questions of the type: “if a person consumes a specific antibiotic, how will the abundance of taxon A in his/her gut change?” or “what is the expected abundance of B. longum if the relative abundance of C. difficile is fixed at 0.1?” We apply the interventional do-calculus designed by Pearl and others²⁷ to data from microbiome studies. In particular, we apply the techniques to reanalyze the extensive gut microbiome data available for Inflammatory Bowel Disease (IBD). Dysbiosis of the gut microbiome is associated with IBD, colorectal cancer, obesity, and much more. However, the relationships between microbial taxa are complex and the experiments required to understand the causal mechanisms are expensive and time-consuming, and therefore remain poorly understood. This work attempts to tease out some of these relationships.

While causal networks describe inferred causal relationships between entities, the question we ask is: How to quantify the causal effect of one entity on another in a microbiome? In this work, causal networks were constructed and intervention calculus was applied to the resulting network to estimate the pairwise causal effects of covariates on each other and on specific response (outcome) variables. A scoring method is proposed to measure causal effects between pairs of entities and the causal influence of individual entities on all others. By augmenting the data with disease information, we construct networks called disease networks, which were used to identify the taxa playing key roles in the healthy and disease states. The pairwise effects provides useful information on the magnitude of the interaction (direct or indirect) between two specific taxa. However, in the context of microbiomes, dysbiosis is a community phenomenon. A pathogen can impact the whole community, not just a specific taxon. The concept of causal influence is an attempt to quantify the contribution of a microbial taxon to the dysbiosis of the microbial community. Finally, the concept of disease networks provide a framework to quantify the causal effect of individual taxon on the disease variable (or a health outcome). In summary, the results presented here suggest a way to identify “dysbiotic” and “eubiotic” microbes in microbiomes.

The paper is organized as follows. Section “Algorithm” discusses the Algorithms used in this paper; the “Results” section includes the details about data, findings from the experiments, and validation; the “Discussion” section summarizes the conclusions from the analysis, the arguments about the hypothesis, future directions, and concluding remarks; the “Methods” section describes the problem formulation and how the algorithms are applied to reach conclusions; and finally the Data Availability section gives the source of the data used in this paper.

Algorithms

In this paper we primarily used three algorithms: a modified version of PC-stable for constructing causal networks²⁸, an interventional calculus method for computing causal effects²⁹, and an algorithm to identify Y-structures³⁰. All the abovementioned algorithms are discussed here briefly.

Causal structure

A Bayesian network (BN) is defined as a directed acyclic graph (DAG), $G = (V,E)$, where the n vertices from V represent n random variables from the set $X = \{X_1, \ldots , X_n\}$, each with its own probability distribution. Also, the m directed edges in E represent probabilistic relationships between the variables from X. If variables $X_i$ and $X_j$ are either marginally or conditionally independent, then the construction of the BN eliminates the edge between $X_i$ and $X_j$. Consequently, the DAG G can be associated with P, a joint probability distribution factorized as shown in Eq. (1)³¹.

$$\begin{aligned} P(\mathbf{V} ) = \prod _{j=1}^{n}P(X_i | Pa(X_i)). \end{aligned}$$

(1)

A causal structure or network (CN) is a BN with additional properties and interpretations. In a CN, an edge $X_i \rightarrow X_j$ means that $X_i$ was inferred to have a direct causal effect on $X_j$, while the lack of edge between $X_i$ and $X_j$ means that they are either marginally or conditionally independent. As with BNs, it is possible to compute any marginal probability on the variables in X in a CN. However, the edges of a CN clearly have great significance from a causal perspective.

In the causal structure, the DAG encodes conditional dependence and independence relationships in the edges and in the joint probability function P. We discuss three important local substructures within causal structures that impact the independence relationships, and these include: chains, forks, and v-structures as shown in Fig. 1. In a chain, X and Y are connected by a directed path through node Z. The important consequence of a chain is that if no other paths exist from X to Y, then the two variables X and Y are conditionally independent given the intermediate Z. Note that the above property would hold even if Z is a set of nodes that intercepts every chain from X to Y. In a fork, variable Z is a “common cause” for variables X and Y. An important consequence of a fork is that if there are no directed paths between X and Y, then they are independent conditional on Z. Again Z could be a set of nodes that commonly cause X and Y. Finally, set Z is a “collider” node between X and Y, if it is the “common-effect” forming a v-structure (also called inverted fork). An important consequence of the v-structure is that if X and Y are unconditionally independent, then they become dependent when conditioned on Z and the descendants of Z.

Several DAGs can encode the exact same joint probability function. These DAGs are called Markov equivalent networks. Such DAGs form a Markov equivalence class and can be uniquely represented by a CPDAG, with the same skeleton and v-structures. CPDAGs allow both directed ($\rightarrow$) and undirected (−) edges. CPDAGs are specialized causal structures, but allow interventional calculus to measure causal effects.

Construction of causal networks

To construct the CNs²⁸, the network constructed by PC-stable algorithm was enhanced by incorporating correlational patterns (sign of correlation coefficients) on the edges and that help us to interpret the results biologically. The main steps of PC-stable algorithm we used to construct causal networks are as follows.

In Step 1, the algorithm starts with a complete undirected graph and then performs a series of conditional independence tests to eliminate as many edges as possible. The remaining undirected graph is referred to as the skeleton.

Step 2 is key to inferring a causal structure, and uses the concept of v-structures, which are defined as follows. For any three nodes representing variables $X_i,X_j,X_k$ in a skeleton S, if $\{X_i,X_j\}$ and $\{X_j, X_k\}$ are edges in S, but $\{X_i, X_k\}$ is not, and if edges are oriented as $X_i \rightarrow X_j \leftarrow X_k$ then the triple $(X_i,X_j,X_k)$ is called a v-structure. Triples satisfying the v-structure property can be identified in the skeletons using conditional dependency tests, following which edges are appropriately directed to form a v-structure. The variable $X_j$ in the triple forming the v-structure represents a “common effect” of $X_i$ and $X_k$. These v-structures are critical in assigning directions to some of the edges of the skeleton.

In Step 3, three rules²⁸ are applied repeatedly to orient edges not already in v-structures.

Rule 1::: Orient $X_j - X_k$ as $X_j \rightarrow X_k$ whenever (a) there is a directed edge $X_i \rightarrow X_j$ and (b) $X_i$ and $X_k$ are not adjacent.
Rule 2::: Orient $X_j - X_k$ as $X_j \rightarrow X_k$ whenever there is a chain $X_j \rightarrow X_i \rightarrow X_k$.
Rule 3::: Orient $X_j - X_k$ as $X_j \rightarrow X_k$ whenever there are two chains $X_j - X_i \rightarrow X_k$ and $X_j - X_l \rightarrow X_k$ given that $X_i$ and $X_l$ are not adjacent.

Intervention

In the context of causality there are two types of data: observational and interventional. Observational data arise from observational experiments, not to be confused with randomized controlled experiments. On the other hand, interventional data are recorded after perturbations using external agents. Interventional queries can be answered using interventional data (also called experimental data), where some variables in the system are set/held to a fixed value by an external agent. However, interventional data can only answer queries when the variables are set to the specific value used in the experiment. A general need is to answer queries when the variables are set to arbitrary values for which experiments were not carried out. The challenge is to infer causal relationships, infer the result of arbitrary interventions, and to infer the magnitudes of causal relationships only from observational data. A mutilation operation at a node X in a DAG is obtained by deleting all incoming edges into X. A mutilated network with respect to node X in a DAG is derived from the original network by performing a mutilation operation at X. Figure 2 shows a network (left) and the mutilated network (middle) obtained by a mutilation operation at node X.

Intervention is expressed using Pearl’s $\mathbf {do()}$ operator³². $P(Y| \mathbf {do (X=a)})$ denotes the distribution of Y if the value of X is set to a. The post-interventional densities are expresses using the formula in Eq. (2)³³.

$$\begin{aligned} P(\mathbf {V} | \mathbf {d o(x)})=\left\{ \begin{array}{ll}\prod _{V_{i} \in \mathbf {V} \backslash \mathbf {X}} P\left( V_{i} | Pa\left( V_{i}\right) \right) , &{} \text{ if } \mathbf {X}=\mathbf {x} \\ 0, &{} \text{ otherwise } \end{array}\right. \end{aligned}$$

(2)

Note that a controlled experiment can potentially answer interventional questions, but may be either prohibitively expensive, impossible to execute, or unethical to perform. Causal calculus allows us to answer such interventional questions using purely in silico methods. We clarify that data collected from research studies (e.g., a microbiome study) are considered as observational data, and not the result of controlled interventions, which require that variables be artificially held at specific values. Conditional expectation is given by $E[Y|X=x]$, while interventional expectation is given by $E[Y|\mathbf {do}(X=x)]$, which is the expectation of Y if every sample in the population had variable X fixed at value x³⁴. Observational probability P(y|x) is thus different from interventional probability $P(y|\mathbf {do}(x))$. Observational distribution P(Y|x) describes the distribution of Y given that the observed value of variable X is x. On the other hand, interventional distribution of Y is the distribution if we set the variable X of all samples to take value x, while other variables are held unchanged³⁵. To achieve $\mathbf {do}(X = a)$, we delete all incoming edges to node X, fix its value at a, and then perform the necessary computations on the resulting network (Figure 2 shows an original network and corresponding mutilated network).

Interventional calculus

A causal model has both probabilistic and causal interpretations. From a probabilistic perspective, as mentioned earlier, each variable $X_i \in \mathbf{X}$, is independent of all its non-descendants when conditioned on its parents, $Pa(X_i)$, a condition called the Markov condition. From a causal perspective, a directed edge $(X_i, X_j)$ in G represents a direct causal impact exerted by $X_i$ on $X_j$³⁶. The left side of Eq. (2) is the post-interventional distribution of G, while the right side is the pre-interventional distribution from the mutilated graph, $G_m$. To study the magnitude of the causal effect of $X_i$ on $X_n$, where $i \ne n$, we make $X_n$ the outcome variable and apply standard computations. The distribution of $X_n$ after an intervention $\mathbf {do} (X_i = x_i)$ can be estimated by integrating over all variables corresponding to $Pa(X_i)$. Assume that $X_i$ has at least one parent, i.e., $Pa(X_i) \ne \emptyset$, and that $X_n \notin Pa(X_i)$. Note that if $X_n \in Pa(X_i)$, then $P(X_n | \mathbf {do})X_i=x_i)) = P(X_n)$ because the causal network G is acyclic. Thus, if $X_n \notin Pa(X_i)$, then

$$\begin{aligned} P(X_n | \mathbf {do}(X_i=x_i)) = \idotsint _{Pa(X_i)} P (X_n | X_i = x_i, Pa(X_i)) P({Pa(X_i)}) d({Pa(X_i)}), \end{aligned}$$

(3)

where $P(Pa(X_i))$ is the joint distribution of the parents of $X_i$, and the integral is over all possible values that can be taken by the parents of $X_i$. Taking expectation on both sides, and assuming $X_n \notin Pa(X_i)$, gives us the following:

$$\begin{aligned} \mathbb {E}[X_n | \mathbf {do}(X_{i}=x_i)] = {\displaystyle \idotsint _{Y_i} \mathbb {E}(X_n | x_i, Y_j) P(Y_j) d(Y_j),} \end{aligned}$$

(4)

where $Pa(X_i) = \{Y_1, \ldots , Y_p\}$.

Causal effect and causal influence

The magnitude of causal effect of $X_i$ on $X_n$, upon the action $\mathbf {do}(X_i = x_i)$ is denoted by $C(X_i, X_n)$ and is given by:

$$\begin{aligned} C(X_i, X_n)=\frac{\partial }{\partial x} \mathbb {E}[X_n | \mathbf {do}(X_i=x_i)]. \end{aligned}$$

(5)

If we assume that the joint distribution of n random variables $X_1, \ldots , X_n$ (as expressed in Eq. 1) is Gaussian/normal, then the causal effect values of $X_i$ on $X_n$ as described in Eq. (5) can be computed using linear regression because the normality implies that $\mathbb {E}(X_n| Pa(X_i), X_i=x)$ is linear in $x_i$ and $Y_j \in Pa(X_i), j = 1, \ldots , p$, as shown below²⁷:

$$\begin{aligned} E\left( X_n |Y_1,\ldots ,Y_p,x_i \right) =\alpha +\gamma x_i+\sum _{j=1}^p \beta _{j}^{T} Y_j, \end{aligned}$$

(6)

for some values $\alpha , \gamma \in \mathbb {R}$ and $\beta \in \mathbb {R}^p$ represents a vector of regression coefficients of the parents of $X_i$. Thus, as shown in²⁹, the magnitude of causal effect of $X_i$ on $X_j$ is given by:

$$\begin{aligned} C(X_i, X_n)= \gamma , \end{aligned}$$

(7)

where $\gamma$ is as dictated by Eq. (6). Note that,the linear regression model is only applied in the quantification step, which comes after the structure learning step, i.e., after the structure of the DAG or partial DAG representing the causal structures (qualitative relations) are inferred. At that time, the regression is only applied to connect the distribution of a random variable with that of its immediate parents in the causal structure. Thus, by the time, regression is applied, the nodes/variable involved in the relationships are already inferred.

The notion of the quantity, causal effect, defined above is a pairwise measure of how much one variable causally impacts another. Here we define another quantity called the causal influence of a node in a causal network, defined as the sum total of absolute value of the causal effect it exerts on every other node. Let $T = \{B_1, B_2, \ldots , B_n\}$ be the set of nodes representing random variables.

Equation (5) gives the causal effect of $B_i$ on $B_j$. The causal influence of node $B_i$ is given by the quantity:

$$\begin{aligned} CI(B_i) = \sum _{j \ne i} |C(B_i, B_j)|. \end{aligned}$$

(8)

Since causal effect values can take negative values as well, the formula for causal influence involves the sum of the absolute values. This prevents individual causal effect values of highly influential nodes from canceling each other out. To avoid confusion, we note that the definition of causal influence is the sum of the causal effect of the taxon on every other taxon, regardless of whether the corresponding nodes have a direct causal link or not. This ensures that we also attribute to the causal influence of a node, all effects that it might have indirectly.

Y-structures

A v-structure over variables X, Y, Z is shown in Fig. 1. There are two directed edges $X \rightarrow Z$, $Y \rightarrow Z$ and there is no edge between X and Y. However, the v-structure is not enough for discovering that variables X or Y causes Z without the assumption that the structure is causally sufficient.

The concept of Y-structures is an extension of the concept of v-structures. As shown in Fig. 2 (right), a Y-structure contains four nodes ($\{W, X, Y, Z\}$), with 3 of the 4 vertices forming a V-structure ($\{X, Y, Z\}$). If there is an edge from Z, the center of the V-structure, to the node W, and if there are no edges from X to W or from Y to W, then the nodes X, Y, Z, W form a Y-structure in the causal network. We will refer to the edge directed from Z to W as the Y-leg. Theoretically, we know that if a Y-structure is learned from data, the Y-leg represents an unconfounded causal relationship³⁰, making the Y-leg edges valuable for biological interpretations.

Results

Synthetic data

Since networks with known causal relationships are not readily available, we first performed experiments with synthetically generated data sets. We generated random networks with variable number of nodes ($n = 9, 17, 26, 35$) with different number of edges. For each random network (ground truth), we generated $m = 1000$ samples, and then attempted to see (a) if the network that generated the data could be recovered using our inferencing tools, and (b) if the causal influence values match the values computed from the ground truth network. The procedure for generating the synthetic networks and the corresponding data set is as follows. We generated random DAGs with predefined number of nodes and edges using pcalg package²⁹. Finally we generated a specified number of (random) samples from the synthetically generated DAG using a logic sampling algorithm³⁷.

The summary statistics of inferred networks from the synthetic data are shown in Table 1. We report precision, recall, F-1 score, and accuracy. The true positive (TP) rate is defined as the number of correctly inferred directed edges in the inferred network with respect to the true network. This above performance metrics were averaged over 100 experiments. A false positive (FP) rate is defined as the number of directed edges not present in the true network, but present in the inferred network. False negative (FN) rate is defined as the number of directed edges present in the true network, but not in the inferred network.

Table 1 Network configuration (number of nodes, directed edges), the number of directed edges in each synthetic true network, precision, recall, F-1 score, and accuracy.

Full size table

For each case, we learned the causal network and computed the causal effects between every pair of nodes. We also computed the deviation of estimated effects from the true effects, measured as $true - estimated$. Similarly we computed the relative deviation of estimated effects from the true effects, measured as $(true - estimated)/true$. The distribution of deviation and relative deviation values are shown in Fig. 3 as violin plots.

Besides computing the pairwise causal effects we also calculated causal influence from the synthetic data. Figure 4 summarizes the comparison of influence values between true and inferred networks. Figure 4 compares the true causal influence values (computed from the ground truth CN) with the causal inference values from the inferred network. The figure shows that the causal influence values are reasonably close to the true values. This is seen by the difference between the bars in the bar charts. More importantly, it shows that even though the true values are sometimes different from the true values, the ordering of the nodes sorted by decreasing causal influence values is very close to the true values. In order to support a statement on the rankings, we applied Spearman correlation and showed that the correlation coefficients are high, thus showing that the sorted order of the two lists are remarkably consistent.

Real data set

We constructed the causal networks from the data sets mentioned in Table 2 obtained from the IHMP study³⁹. In each of the resulting causal structures, nodes represent random variables for one of two things—relative abundance of taxa, and disease status. In the visualized networks, the size of each node is proportional to the average value of that variable in the cohort. The color of each node represents the phylum to which the corresponding taxon belongs. Taxa from the the same phylum have the same color. Firmicutes taxa are colored with cyan, Bacteroidetes are colored blue, Proteobacteria are colored green, and Verrucomicrobia are colored purple.

Table 2 Three real data sets were used in this study.

Full size table

Edges represent the belief of direct causal relationships as inferred by the PC-stable algorithm. More importantly, the absence of an edge suggests that there is no direct causal relationship, although indirect relationships may exist. The color of the edges represents the sign of the correlation between the abundance vectors of the taxa represented by the nodes (green color stands for positive correlations, and red color for negative correlations). The transparency of each edge represents the confidence value for the predicted edge, computed by its bootstrap value. For each network, we estimated the confidence value of the predicted edges by computing the bootstrap value from 200 repetitions. An inferred causal structure may contain undirected edges if the data are not enough to support an edge orientation. Those undirected edges remain causally “uninterpretable”.

To quantify the statistical significance of the overall resulting causal structures we computed the maximum likelihood and log-likelihood scores of the networks we constructed. To obtain this measure for our networks, we randomly permuted the values in each row and created networks $N = 1000$ times and each time we calculated the log-likelihood. The fraction of the networks generated by random permutations whose likelihood is higher than that obtained for the predicted network is the p value or reported statistical significance value. The p values of the networks we used for analyses were less than 0.05.

Our experiments with the real data sets involved first inferring a causal network from the data and then computing all pairwise causal effect values. We created causal networks from the UC, CD, and non-IBD data sets separately using the PC-stable algorithm. Outcome causal networks (also called disease causal networks or simply disease networks) were also created by augmenting the data sets with a disease variable, corresponding to the categorical variable representing the disease status of the individual. Note that if disease severity were available for the subjects then this variable could also be continuous. Finally, we applied intervention techniques to measure causal effects and causal influence of each taxon.

UC data set

The causal network that resulted from the UC data set is shown in Fig. 5. Also we showed the causal network inferred from healthy cohorts in the supplementary (Figure S1).

The causal graph shown may be intuitive but is not easy to interpret precisely. In contrast, the intervention technique provides quantitative information that may lend itself more easily to interpretation. Thus, after creating causal networks, we computed causal effect values for all pairs of nodes, and causal influence values for all nodes. The distribution of pairwise causal effect values in the UC causal network is shown in Fig. 6. To identify the strongest pairwise causal relationships, we selected the top 15% (shown in green rectangle) and the bottom 15% (shown in red rectangle) to zoom in for further inspection.

Next, we computed the causal influence measures for each microbial taxon (i.e., sum of absolute values of causal effects on every other variable). We then ranked the taxa as shown in Fig. 7a,b with the expectation that this list would highlight the most influential taxa in health or disease.

It also made sense to inspect the change in causal influence in going from the healthy cohort to the diseased cohort. If we denote $CI_H(i)$ and $CI_{UC}(i)$ to be causal influence of variable i in the causal network constructed from the healthy cohort and UC cohort, respectively, then $CI_H(i) - CI_{UC}(A)$ represents the change in influence for taxon A. Figure 7c shows the ten taxa with the highest change in causal influence. Green bars indicate higher causal influence values in healthy samples, while red bars indicate higher values in UC samples, suggesting that the taxa representing the green bars on the left of the chart are potentially eubiotic, while the taxa representing the red bars on the right of the chart play a dysbiotic role in subjects with UC.

Influence subnetworks in the causal network from UC data

Based on the causal influence values computed above, the top five taxa from the UC cohort were R. torques (RUMTO), R. inulinivorans (ROSIN), S. wadsworthensis (SUTWA), B. xylanisolvens (BACXY), P. distasonis (PARDI). We discuss our methodology to analyze their influence in greater detail. We include the detailed analyses of the sub-networks associated with R. torques (RUMTO) and B. xylanisolvens. Other subnetworks are discussed in the supplementary section.

We start with the most influential taxon, R. torques, labeled RUMTO in the UC network shown in Fig. 5. R. torques is a well known pathogenic taxon for UC. In the UC network, it has five outgoing directed edges connecting to B. dorei, E. eligens, P. copri, E. rectale, and D. invisus. Additionally, a total of 19 taxa (out of 35) are reachable by a directed path from R. torques. Further discussion on the analysis of the subnetworks can be found in the “Discussion” section. Similar discussion on the impact of B. xylanisolvens (labeled BACXY), another key player in UC pathogenesis can be found in the “Discussion” section.

To further investigate the fidelity of the causal network we dive deeper into some edges where mediator variables or metabolic data are available. As mentioned earlier E. eligens potentially interacts with F. prausnitzii via the metabolite, Acetate. When we included the concentration of Acetate from the associated metabolomics data into the analysis, the resulting network shows E. eligens to be independent of F. prausnitzii conditioned on Acetate concentration (see Fig. 8a). An investigation into the link from B. xylanisolvens to B. vulgatus in the UC causal network shows a similar behavior. Since both are known to be consumers of a metabolite named d-fructose, we created a causal network by including the concentration of d-fructose in the causal inferencing. As in the above example, B. xylanisolvens is independent of B. vulgatus when conditioned on d-fructose concentration (shown in Fig. 8b).

Disease networks

Disease networks were created by combining data sets from one or more diseases (often including a data set from a healthy cohort) and producing networks with an additional node representing the outcome or disease status. For example in a disease network involving UC and healthy data sets, each sample from the UC cohort would have its disease variable set to 1 (0 for healthy samples).

Finally, we measured the causal effect of each taxon on the special disease node and sorted the list by their absolute value as shown in Table 3 for the UC disease network. We also reported the p-value of those pairwise effects of those from the bootstrapping with 100 repetition and same number of sample size.

Table 3 Sorted list of taxa (descending order) based on causal effects on ulcerative colitis “disease” node.

Full size table

.

When we queried the published literature on this topic, we discovered that barring two, all the taxa listed in Table 3 were known to be either potentially pathogenic or beneficial, again supporting the claim that our approach helps to identify pathogenic and beneficial bacteria in healthy and diseased patients. Note that, we do not find enough evidence about the beneficial behavior or pathogenicity of A. onderdonkii, B. intestinihominis and their entries are marked with ? sign.

Y-structure validation

In the causal network inferred from UC data set we have 18 Y-structures. We focus on the Y-leg edges from these Y-structures. Our experiments showed that the bootstrap values for the Y-leg edges with 100 repetitions for a given sample size ranged from 0.49 to 0.98. The mean, median, and standard deviation of bootstrap values were 0.68, 0.62, and 0.17, respectively. Theoretically, we know that the Y-leg edges cannot be confounded and bootstrap values also show high confidence on those edges. In other words, the endpoints of a Y-leg cannot have a “common cause”.

Sensitivity analysis

For sensitivity analysis, we investigated the stability of the computed causal networks with perturbations in the input data. Ideally small changes in the input data should produce small or no changes in the model. The data sets were modified repeatedly as follows. For a randomly chosen sample, we generated a new random sample with same mean and standard deviation as the chosen sample. In this manner, we added $1\%, 2\%, 3\%, \ldots$ new samples to the data set until we the resulting input caused a significant change in the network structure. A significant change in the network is defined as the deletion of any edge from the original network with bootstrap value more than 0.50. For UC network, the first structural change to the network occurred after adding $7\%$ artificially vreated samples to the data set.

Similarly, randomly chosen samples from the input were deleted. Again, the first significant change in the network occurred after the deletion of $6\%$ of the samples. Thus, the computed causal networks are robust to an average of $6.5\%$ perturbations of samples. Similarly, the Disease networks are robust to perturbation of $8\%$ of the samples. One possible reason of less sensitivity of Disease network is that, Disease network is learned from larger number of data samples in comparison to UC network.

We also conducted a “substitution” experiment, which randomly perturbs the data across samples, neither deleting nor inserting rows in the data matrix. Again we started from $1\%$ and continue until we spotted at least one significant change. From the perturbation we found that the networks are more sensitive than deleting or adding samples. For UC network we encounter significant changes after randomly perturbing $4\%$ of samples and for the Disease network we noticed significant changes at $5\%$ of data perturbation. One of the possible reasons for the increase in the sensitivity is that, when we perturb data across samples, the relative abundance values are no longer coming from the same distribution and that makes the network more unstable.

Discussion

Experiments with synthetically generated data sets (Fig. 4) shows that even though there are differences between true and inferred influence values, the relative ranking for most values remain consistent with that of true values. These experiments suggest that causal inference is a promising approach to analyzing microbiome data, especially when it comes to the identification of potentially dysbiotic or eubiotic microbes.

In the UC network (Fig. 5), E. coli and S. unclassified are isolated. It is known that E. coli is part of normal gut flora and evidence suggests that it is not playing a harmful role in the IBD gut⁴¹. The bacterial taxa D. invisus, E. eligens, S. wadsworthensis, R. inulinivorans, A. muciniphila are at the top of the network and have no incoming edges, suggesting that they exert an influence on most, if not all, of the descendant taxa in the lower part of the network. The highly abundant taxon F. prausnitzii from the Proteobacteria phylum has several incoming and outgoing edges, many colored red, suggesting that it has a strong negative influence on its descendant bacterial taxa and that its ancestors also impact it negatively.

The distribution of pairwise causal effect values in the UC causal network (see Fig. 6) is normally distributed with a peak at 0, suggesting that most pairwise causal effects are relatively small. The top 30% of the pairwise causal effects involve bacteria including R. torques, F. prausnitzii, S. wadsworthensis, B. xylanisolvens, B. uniformis, P. copri, all of which are known to be key players in UC pathogenesis.

Analysis of the data from non-IBD subjects (see supplementary Figure S1) shows the bacterial taxa B. xylanisolvens, E. eligens, B. finegoldii, A. muciniphila and some species of Oscillobacter to have the highest causal influence on the remaining taxa. These claims are supported in the literature, which show them to play a eubiotic role^42,43,44,45. Analysis of the data from the diseased state (UC) shows that the taxa R. torques, B. massiliensis, P. distasonis, and D. invisus are the most influential. Again, the published literature supports the above claims by suggesting that these are potentially pathogenic^46,47,48,49. Thus, we conclude that our methods allow us to identify potentially eubiotic and dysbiotic bacteria in cohorts of microbiome samples.

The subnetwork rooted at R. torques in the UC causal network (Fig. 5 shows a total of 19 taxa reachable from R. torques. Published work has suggested that D. invisus (DIAIN), a direct child of R. torques, is also associated with IBD⁵⁰. Evidence also suggests that R. torques has an impact on pectin-modulated bacteria such as P. copri (PRECO)⁵¹. R. torques is also connected to F. prausnitzii (FAEPR) via E. eligens (EUBEL). It has been shown that E. eligens is a producer of acetate, which in turn is consumed by F. prausnitzii.

Causal influence values have already suggested that B. xylanisolvens (labeled BACXY) is a key player in UC. The analysis of the subnetwork rooted at B. xylanisolvens, which reaches 16 other taxa, is done in the context of metabolic networks from previously published literature. B. xylanisolvens is a producer of cellobiose, which may be consumed by B. uniformis (BACUN)^52,53. B. xylanisolvens and P. merdae (PARME) both consume d-glucose^52,54, making them potential competitors for glucose. This may explain the negatively correlated causal connection from B. xylanisolvens to P. merdae. B. xylanisolvens and B. vulgatus (BACVU) are both consumers of d-fructose^52,55, making them potential competitors, although no evidence of competition is found in the network.

The analysis performed by selective addition of metabolite concentrations from associated metabolomic data (available from IHMP) was shown in Fig. 8a,b. This targeted analysis strongly suggests a role for the intermediate metabolites in the interaction between the pair of bacterial taxa mentioned. The claim is supported by the published literature on acetate and butyrate. After reaching the gut, carbohydrates resistant to digestion (commonly derived from dietary fibers) are degraded by gut microbiota to produce monosaccharides. These monosaccharides can be utilized by some bacteria including E. eligens in the gut to produce short-chain fatty acids such as acetate, butyrate, and propionate⁵⁶. Faecalibacterium prausnitzii is a commonly known acetate consuming bacteria, it consumes acetate and produce various fatty acid including butyrate by utilizing glucose⁵⁷. Interestingly, under in vitro conditions it was confirmed that the growth of F. prausnitzii is strongly stimulated in the presence of acetate⁵⁸. B.xylanisolvens produce by-products such as acetate, succinate, and propionate. These fatty acids are the by-procducts of xylose and sugar fermentation. B. xylanisolvens is able to produce acid from many sugars such as glucose, mannitol, sucrose, glyercol, fructose, galactose, and melibiose^43,52. Similarly, Faecalibacterium prausnitzii produces butyrate, formate, and lactate using fructose, oligofructose, and inulin⁵⁹. Also, from the controlled experiment it is evident that treatment with fructans led to an increase of F. prausnitzii⁶⁰. Due to the scarcity of data and knowledge-bases, many edges cannot be verified via metabolic networks. However, from the evidence it is understandable that metabolites play a huge role in the causal relationships in microbiomes.

Bacterial taxa that play an important role in the causal networks of healthy cohorts, but play a less influential role in the networks for disease cohorts are inferred as playing a eubiotic role within the microbiome. For example, B. xylanisolvens, E. eligens, B. finegoldii, A. muciniphila have the largest reduction in their causal influence values between the healthy and the diseased cohorts (Fig. 7) and their beneficial roles are confirmed by the literature^{43,52,61,62,63,64}. Bacterial taxa that play an important role in the causal networks inferred from both healthy and disease cohorts are also of interest, since they can be inferred as being important in healthy microbiomes, but likely changing their roles during dysbiosis, perhaps by an introduction of a pathogenic strain or by triggering one of its virulence factors. For example, the known pathogen R. torques has a reduction in its causal influence value between the healthy and diseased cohorts (Fig. 7)⁴⁶.

The Disease networks are a novel way of combining the information from the UC and healthy cohorts. The first obvious difference between the network for only UC data (Fig. 5) and the disease network for UC using a combination of UC and healthy data (Fig. 9) is the number of edges—the disease network has more edges than the network without the disease node. It is unclear why more dependencies between the taxa appear in the presence of disease node. One possible explanation is that due to the greater diversity in the samples, which now contains two very different cohorts, there are more dependencies among the variables. Unlike network from only UC data, there are no isolated nodes in the disease network.

More detailed analysis of the UC disease network revealed additional useful information. The taxa, S. wadsworthensis (SUTWA) and B. xylanisolvens (BACXY) are among the most influential bacteria based on causal effect values (on the special disease node) as shown in Table 3. Both taxa are directly connected by an edge to the disease node and have no other directed paths leading to the disease node. The taxon, E. eligens (EUBEL), a known beneficial bacterial taxon, has a directed edge to disease and directed paths to some other key players such as B. xylanisolvens (BACXY) and S. wadsworthensis (SUTWA) shown in the supplementary (see Figure S2). We investigated one of the outgoing edges to F. prausnitzii and we found from the existing knowledge-bases that both E. eligens and F. prausnitzii are associated with the metabolite Pectin and the metabolic activity “macromolecular degradation”^65,66. R. torques (RUMTO) is a known pathogenic taxon and has a directed path to the special disease node. R. torques is also connected to R. inulinivorans (ROSIN1) by an edge. Interestingly R. torques is an acetate producer and R. inulinivorans is an acetate consumer^67,68, suggesting a possible mode of causal interaction between the two taxa. Oscillibacter is considered an important beneficial taxon, and in the UC disease network it is directly connected to the disease node. It also has multiple paths to disease node via other known beneficial taxa S. wadsworthensis and B. xylanisolvens, suggesting other unknown modes of interaction contributing to disease.

The analysis of Y-structures identified 18 Y-leg edges. Based on information from the existing knowledge-bases, we discuss the biological significance of the Y-leg edge from Bacteroides fragilis to Roseburia intestinalis. It has been shown that Bacteroides fragilis is responsible for producing the metabolite, acetate, which accounts for 30–54% of the total products by bacteria⁶⁹. Furthermore, acetate is efficiently utilized by certain groups of anaerobic bacteria particularly by butyrate-producing species including Roseburia intestinalis⁷⁰. While we can never categorically prove that a Y-leg edge is not counfounded by any hidden factor, we may be able to explain why the edge is significant. The Y-leg edge from B. dorie to Parabacteroides distasonis is potentially significant because of the intermediate metabolite, Xylan^54,65.

The methods described in this paper have also been applied to the Crohn’s disease data set. Results can be found in the Supplemental section. We included the causal network inferred from the data collected from the CD cohort (Supplementary Figure S3) and the causal effects and causal influence values computed from the network.

We discuss a few limitations of the work presented here. The first limitation is that the work presented here assumes that there are no hidden confounders, when in reality we cannot rule out their existence. Minimizing the effects of hidden confounders or measuring unbiased effects in the presence of hidden confounders remains a challenging research direction. Second, the causal influence notion allows us to study the influence of one taxon on the disease node. Future work needs to also consider how groups of taxa influence disease. More generally, future work needs to consider how groups of taxa influence or impact other groups of taxa. A third major limitation is that of compositionality, which is caused by the use of relative abundance values instead of raw abundance values in our analyses. Relative abundance is an attempt to normalize sequencing depth in different samples, but introduce compositionality and the ensuing correlations into the analysis. The log-ratio transform and the hierarchical multinomial-logit models provide two approaches to address compositionality^71,72. Unfortunately, the log-ratio method is known to harm the variance strtucture in the data, while the second approach remains to be strongly validated. Finally, future work entails limited laboratory verifications of some of the microbial interactions, especially those involving metabolites.

In summary, this paper takes us one step closer to understanding complex systems such as microbiomes in a causal way. It helps us to shed light on interactions between microbial taxa and the role of metabolites. It provides the framework to include other omics data and understand complex relationships and processes in microbiomes in a quantitative way with the use of interventional calculus. They also make it possible to elucidate biological processes by drawing inferences on the role of intermediaries such as metabolites, genes, and environmental factors. The resulting causal networks are statistically significant, robust, and sensitive. We hypothesize that our approach can lead to a better understanding of the efficacy of probiotics and prebiotics.

Methods

The first step in inferring causality is to learn the causal relationships, which entails discovering the structure of the network of relationships. The next step is to use the structure to infer the causal effects, i.e., the magnitude of the strength of causal relationships. Note that the causal network allows us to infer causal effect values even if the nodes are not directly connected by an edge. However, the nodes involved must be connected by a path in order for the causal effect value to be non-zero. The pipeline for causal inference is as follows.

Infer causal networks $\rightarrow$ Apply interventional calculus to compute causal effects $\rightarrow$ Compute causal influence values.

Problem formulation

To investigate causal relationships in microbiomes, we consider causal networks with nodes corresponding to random variables of interest. The simplest causal network for microbiomes would have nodes representing the relative abundance of every detected microbial taxon, and the edges would represent the causal relationships between the taxa suggesting the direction and magnitude of interactions taking place between the taxa. We will also discuss disease networks, a special causal network that has one extra node representing an outcome variable such as the disease status or severity. The edges would either represent the causal relationships between the taxa or between a taxon and the outcome node, highlighting the taxa that are believed to have a direct impact on the outcome along with direction and magnitude of that interaction. More complex microbiome data sets may have nodes representing measurements of different omics entities such as the expression of genes, concentration of metabolites, amount of proteins, methylation data, and more. Additional nodes could also represent host or environmental variables arising from host transcriptome data, host mutational data, host phenotypic data, host clinical data, host medication information, or other environmental conditions that may be measured for the microbiome. An additional level of complexity can be introduced by considering temporal data from longitudinal microbiome studies, which will introduce time-dependant variables of interest. Once a causal network is constructed, interventional calculus can be applied to the resulting network. Used basic probabilistic inference techniques as described by Barber⁷³, it is possible to determine the magnitude of the causal impact of one variable of interest on one or more variables of interest.

The goal of the work reported here is to construct causal networks from microbiome data sets, to compute causal effects between all pairs of entities, and to interpret the biological significance of these computations. The causal effects are determined by the regression coefficients under normality assumption. Thus, the magnitude as well as the sign of the causal effect values can be interpreted biologically. The causal network and the resulting computations help us to: (a) identify the key players (most influential taxa) in a microbiome under healthy and disease status, (b) compute the causal effects of individual taxa on the disease outcome. For the first problem, we compute the most influential node, which is defined to be the node with the highest CI value, where CI is as given in Eq. (8). The CI values also help us to compare the impact of the different microbial taxa on the disease node, allowing us to put them in sorted order of influence. In a second problem, we explore causal effects of taxa on the outcome or disease node, or vice versa. In general, while the dysbiosis of microbiomes have been strongly associated with disease, it is not known if the dysbiosis is the cause or effect (or both) of the disease. Thus, our techniques allow us to identify taxa most significantly linked to disease or health.

Data

We worked on both real and simulated data sets. The synthetic data was generated following the logic sampling algorithm³⁷. It takes as input three positive integers, n, m and d. It outputs a “true” causal network G and a synthetic data set stored as a matrix of size $m \times n$, representing m samples each with n features or variables of interest that describe the sample. After successfully generating the synthetic data using the above algorithm, we have a ground truth causal network model (including “true” network and the “true” regression functions at each vertex) and data generated using such a network model.

As summarized in Table 2, we analyzed the IBD gut microbiome data set by comparing cohorts A and B. The IBD data set were from the Integrative Human Microbiome Project (iHMP)³⁹, and includes data from subjects with Crohn’s Disease (CD), ulcerative colitis (UC), and a cohort of non-IBD (i.e., healthy) subjects that were used as controls.

Experiments

For each data set (synthetic and real), we generated a causal structure by applying the PC-stable algorithm²⁸, after which we computed (a) the causal effect values between every pair of microbial taxa, and (b) the causal influence of each microbial taxon, i.e., the sum total of the (absolute values of) causal effect on all other taxa. For the IBD data set, we also computed the changes in causal influence of taxa between diseased and healthy (non-IBD) samples for iHMP data. To quantify the causal relationships we applied intervention technique that used linear regression model by ordinary least squares method (under normality assumption. We used the coefficients as a measure of the causal effects²⁷.

We used the processed data for only the bacterial abundance information downloaded directly from iHMP website³⁹. The relative abundance matrix was used to generate the causal graphs and then used to estimate the causal effects. The relative abundance is computed by normalizing each raw count with the total number of reads in a sample. In the IBD data set, which included a healthy cohort and diseased cohorts, we also analyzed the data sets by combining the cohorts, but augmenting the causal network with an extra outcome node named disease representing the (binary) disease variable. If the severity of the disease were provided, then this node could represent a continuous random variable. This process is called context embedding, which is important for causal inference because in different contexts, the same event can be interpreted differently. For the healthy state, the value of disease variable was set to 0, and for the disease state its value was set to 1. We computed the causal effect of all taxa on the disease variable. Note that, in general, while the association may be well established, we do not know if the microbiome composition is the cause or the effect of the disease.

Data availability

Data used for this study are publicly available by “NIH Integrative Human Microbiome Project”. We used inflamatory bowel disease (IBD) data that includes both Crohn’s Disease (CD) and ulcerative colitis (UC) from “The Inflammatory Bowel Disease Multi’omics Database”. Data repository and download instructions are available at: https://ibdmdb.org/tunnel/public/summary.html. We downloaded taxonomic_profiles.tsv.gz file from metagenomes data type and HMP2_metabolomics.csv.gz file from metabolites data type for further processing and analysis.

References

Marchesi, J. R. & Ravel, J. The vocabulary of microbiome research: A proposal. Microbiome 3, 20 (2015).
Article Google Scholar
Shah, R. D. et al. The hardness of conditional independence testing and the generalised covariance measure. Ann. Stat. 48, 1514–1538 (2020).
Article MathSciNet MATH Google Scholar
Charniak, E. Bayesian networks without tears. AI Mag. 12, 50–50 (1991).
Google Scholar
Nielsen, T. D. & Jensen, F. V. Bayesian Networks and Decision Graphs (Springer, 2009).
MATH Google Scholar
Hayduk, L. et al. Pearl’s d-separation: One more step into causal thinking. Struct. Equ. Model. 10, 289–311 (2003).
Article MathSciNet Google Scholar
Minka, T. P. A family of algorithms for approximate Bayesian inference. Ph.D. thesis, Massachusetts Institute of Technology (2001).
Tzikas, D. G., Likas, A. C. & Galatsanos, N. P. The variational approximation for Bayesian inference. IEEE Signal Process. Mag. 25, 131–146 (2008).
Article ADS Google Scholar
Tucci, R. R. Introduction to Judea Pearl’s do-calculus. arXiv:1305.5506 (arXiv preprint) (2013).
Pearl, J. The do-calculus revisited. arXiv:1210.4852 (arXiv preprint) (2012).
John, G. K. & Mullin, G. E. The gut microbiome and obesity. Curr. Oncol. Rep. 18, 45 (2016).
Article PubMed CAS Google Scholar
Li, Q., Han, Y., Dy, A. B. C. & Hagerman, R. J. The gut microbiota and autism spectrum disorders. Front. Cell. Neurosci. 11, 120 (2017).
Article PubMed PubMed Central CAS Google Scholar
Honda, K. & Littman, D. R. The microbiome in infectious disease and inflammation. Annu. Rev. Immunol. 30, 759–795 (2012).
Article CAS PubMed PubMed Central Google Scholar
Aarts, E. et al. Gut microbiome in adhd and its relation to neural reward anticipation. PLoS One 12, 20 (2017).
Article CAS Google Scholar
Bourrat, P. Have causal claims about the gut microbiome been over-hyped?. BioEssays 40, 1800178 (2018).
Article Google Scholar
Fischbach, M. A. Microbiome: Focus on causation and mechanism. Cell 174, 785–790 (2018).
Article CAS PubMed PubMed Central Google Scholar
Sanna, S. et al. Causal relationships among the gut microbiome, short-chain fatty acids and metabolic diseases. Nat. Genet. 1, 20 (2019).
Google Scholar
Ramakrishnan, V. R. & Frank, D. N. Microbiome in patients with upper airway disease: Moving from taxonomic findings to mechanisms and causality. J. Allergy Clin. Immunol. 142, 73–75 (2018).
Article PubMed PubMed Central Google Scholar
Sazal, M. R., Ruiz-Perez, D., Cickovski, T. & Narasimhan, G. Inferring relationships in microbiomes from signed Bayesian networks. In 2018 IEEE 8th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS), 1–1 (IEEE, 2018).
Scutari, M. Learning Bayesian networks with the bnlearn R package. arXiv:0908.3817 (arXiv preprint) (2009).
Sazal, M., Mathee, K., Ruiz-Perez, D., Cickovski, T. & Narasimhan, G. Inferring directional relationships in microbial communities using signed Bayesian networks. BMC Genom. 21, 1–11 (2020).
Article Google Scholar
Kitsios, G. D. et al. Respiratory microbiome profiling for etiologic diagnosis of pneumonia in mechanically ventilated patients. Front. Microbiol. 9, 1413 (2018).
Article PubMed PubMed Central Google Scholar
Ge, X., Raghu, V. K., Chrysanthis, P. K. & Benos, P. V. CausalMGM: An interactive web-based causal discovery tool. Nucleic Acids Res. 20, 20 (2020).
Google Scholar
Mainali, K., Bewick, S., Vecchio-Pagan, B., Karig, D. & Fagan, W. F. Detecting interaction networks in the human microbiome with conditional Granger causality. PLoS Comput. Biol. 15, e1007037 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Lam, Y. Y., Zhang, C. & Zhao, L. Causality in dietary interventions-building a case for gut microbiota. Genome Med. 10, 62 (2018).
Article PubMed PubMed Central Google Scholar
De Filippis, F., Vitaglione, P., Cuomo, R., BerniCanani, R. & Ercolini, D. Dietary interventions to modulate the gut microbiome—how far away are we from precision medicine. Inflamm. Bowel Dis. 24, 2142–2154 (2018).
Article PubMed Google Scholar
Leeming, E. R., Johnson, A. J., Spector, T. D. & Le Roy, C. I. Effect of diet on the gut microbiota: Rethinking intervention duration. Nutrients 11, 2862 (2019).
Article PubMed Central Google Scholar
Pearl, J., Glymour, M. & Jewell, N. P. Causal Inference in Statistics: A Primer (Wiley, 2016).
MATH Google Scholar
Colombo, D. & Maathuis, M. H. Order-independent constraint-based causal structure learning. J. Mach. Learn. Res. 15, 3741–3782 (2014).
MathSciNet MATH Google Scholar
Kalisch, M. et al. Causal inference using graphical models with the R package pcalg. J. Stat. Softw. 47, 1–26 (2012).
Article Google Scholar
Mani, S., Spirtes, P. L. & Cooper, G. F. A theoretical study of Y structures for causal discovery. arXiv:1206.6853 (arXiv preprint) (2012).
Scutari, M. Bayesian network constraint-based structure learning algorithms: Parallel and optimised implementations in the bnlearn R package. arXiv:1406.7648 (arXiv preprint) (2014).
Pearl, J. et al. Causal inference in statistics: An overview. Stat. Surv. 3, 96–146 (2009).
Article MathSciNet MATH Google Scholar
Henckel, L., Perković, E. & Maathuis, M. H. Graphical criteria for efficient total effect estimation via adjustment in causal linear models. arXiv:1907.02435 (arXiv preprint) (2019).
Pearl, J. A linear “microscope’’ for interventions and counterfactuals. J. Causal Inference 5, 20 (2017).
Article Google Scholar
Huszár, F. Ml beyond curve fitting: An intro to causal inference and do-calculus. https://www.inference.vc/untitled/ (2018). Accessed 12 Jun 2020.
Lauritzen, S. L. Causal inference from graphical models. Complex Stoch. Syst. 20, 63–107 (2001).
MathSciNet MATH Google Scholar
Korb, K. B. & Nicholson, A. E. Bayesian Artificial Intelligence (CRC Press, 2010).
Book MATH Google Scholar
Kassambara, A. & Kassambara, M. A. R package ggpubr (2020).
NIH Integrative Human Microbiome Project (iHMP). https://www.hmpdacc.org/ihmp/ (2014). Accessed 12 Jun 2020.
Shannon, P. et al. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS PubMed PubMed Central Google Scholar
Singh, V. et al. Interplay between enterobactin, myeloperoxidase and lipocalin 2 regulates E. coli survival in the inflamed gut. Nat. Commun. 6, 7113 (2015).
Article ADS CAS PubMed Google Scholar
Man, S. M., Kaakoush, N. O. & Mitchell, H. M. The role of bacteria and pattern-recognition receptors in Crohn’s disease. Nat. Rev. Gastroenterol. Hepatol. 8, 152 (2011).
Article PubMed Google Scholar
Ulsemer, P., Toutounian, K., Schmidt, J., Karsten, U. & Goletz, S. Preliminary safety evaluation of a new Bacteroides xylanisolvens isolate. Appl. Environ. Microbiol. 78, 528–535 (2012).
Article CAS PubMed PubMed Central Google Scholar
Moore, W. & Moore, L. H. Intestinal floras of populations that have a high risk of colon cancer. Appl. Environ. Microbiol. 61, 3202–3207 (1995).
Article CAS PubMed PubMed Central Google Scholar
Woloszynek, S. et al. Engineering human microbiota: Influencing cellular and community dynamics for therapeutic applications. In International Review of Cell and Molecular Biology Vol. 324 67–124 (Elsevier, 2016).
Google Scholar
Matsuoka, K. & Kanai, T. The gut microbiota and inflammatory bowel disease. In Seminars in Immunopathology Vol. 37 47–55 (Springer, 2015).
Google Scholar
Lucke, K., Miehlke, S., Jacobs, E. & Schuppler, M. Prevalence of Bacteroides and Prevotella spp. in ulcerative colitis. J. Med. Microbiol. 55, 617–624 (2006).
Article CAS PubMed Google Scholar
Wang, K. et al. Parabacteroides distasonis alleviates obesity and metabolic dysfunctions via production of succinate and secondary bile acids. Cell Rep. 26, 222–235 (2019).
Article CAS PubMed Google Scholar
Morio, F. et al. Antimicrobial susceptibilities and clinical sources of dialister species. Antimicrob. Agents Chemother. 51, 4498–4501 (2007).
Article CAS PubMed PubMed Central Google Scholar
Adamberg, K. et al. Levan enhances associated growth of Bacteroides, Escherichia, Streptococcus and Faecalibacterium in fecal microbiota. PLoS One 10, e0144042 (2015).
Article PubMed PubMed Central CAS Google Scholar
Larsen, N. et al. Potential of pectins to beneficially modulate the gut microbiota depends on their structural properties. Front. Microbiol. 10, 223 (2019).
Article PubMed PubMed Central Google Scholar
Chassard, C., Delmas, E., Lawson, P. A. & Bernalier-Donadille, A. Bacteroides xylanisolvens sp. nov., a xylan-degrading bacterium isolated from human faeces. Int. J. Syst. Evol. Microbiol. 58, 1008–1013 (2008).
Article CAS PubMed Google Scholar
McNulty, N. P. et al. The impact of a consortium of fermented milk strains on the gut microbiome of gnotobiotic mice and monozygotic twins. Sci. Transl. Med. 3, 106ra106 (2011).
Article PubMed PubMed Central CAS Google Scholar
Sakamoto, M. & Benno, Y. Reclassification of Bacteroides distasonis, Bacteroides goldsteinii and Bacteroides merdae as Parabacteroides distasonis gen. nov., comb. nov., Parabacteroides goldsteinii comb. nov. and Parabacteroides merdae comb. nov.. Int. J. Syst. Evol. Microbiol. 56, 1599–1605 (2006).
Article CAS PubMed Google Scholar
Sonnenburg, E. D. et al. Specificity of polysaccharide use in intestinal bacteroides species determines diet-induced microbiota alterations. Cell 141, 1241–1252 (2010).
Article CAS PubMed PubMed Central Google Scholar
Mukherjee, A., Lordan, C., Ross, R. P. & Cotter, P. D. Gut microbes from the phylogenetically diverse genus Eubacterium and their various contributions to gut health. Gut Microbes 12, 1802866 (2020).
Article PubMed PubMed Central CAS Google Scholar
Khan, M. T. et al. The gut anaerobe Faecalibacterium prausnitzii uses an extracellular electron shuttle to grow at oxic-anoxic interphases. ISME J. 6, 1578–1585 (2012).
Article CAS PubMed PubMed Central Google Scholar
Heinken, A. et al. Functional metabolic map of Faecalibacterium prausnitzii, a beneficial human gut microbe. J. Bacteriol. 196, 3289–3302 (2014).
Article PubMed PubMed Central CAS Google Scholar
Moens, F., Rivière, A., Selak, M. & De Vuyst, L. Inulin-type fructan degradation capacity of interesting butyrate-producing colon bacteria and cross-feeding interactions of Faecalibacterium prausnitzii DSM 17677 T with bifidobacteria. Arch. Public Health 72, 1 (2014).
Article Google Scholar
Verhoog, S. et al. Dietary factors and modulation of bacteria strains of Akkermansia muciniphila and Faecalibacterium prausnitzii: A systematic review. Nutrients 11, 1565 (2019).
Article CAS PubMed Central Google Scholar
Chung, W. S. F. et al. Modulation of the human gut microbiota by dietary fibres occurs at the species level. BMC Biol. 14, 1–13 (2016).
Article CAS Google Scholar
Zitomersky, N. L. et al. Characterization of adherent bacteroidales from intestinal biopsies of children and young adults with inflammatory bowel disease. PLoS One 8, e63686 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Wexler, H. M. Bacteroides: The good, the bad, and the nitty-gritty. Clin. Microbiol. Rev. 20, 593–621 (2007).
Article CAS PubMed PubMed Central Google Scholar
Zhang, T., Li, Q., Cheng, L., Buch, H. & Zhang, F. Akkermansia muciniphila is a promising probiotic. Microb. Biotechnol. 12, 1109–1125 (2019).
Article PubMed PubMed Central Google Scholar
Flint, H. J., Scott, K. P., Duncan, S. H., Louis, P. & Forano, E. Microbial degradation of complex carbohydrates in the gut. Gut Microbes 3, 289–306 (2012).
Article PubMed PubMed Central Google Scholar
Salyers, A., West, S., Vercellotti, J. & Wilkins, T. Fermentation of mucins and plant polysaccharides by anaerobic bacteria from the human colon. Appl. Environ. Microbiol. 34, 529–533 (1977).
Article CAS PubMed PubMed Central Google Scholar
Flint, H. J., Duncan, S. H., Scott, K. P. & Louis, P. Interactions and competition within the microbial community of the human colon: Links between diet and health. Environ. Microbiol. 9, 1101–1111 (2007).
Article CAS PubMed Google Scholar
Hojo, K. et al. Reduction of vitamin K concentration by salivary Bifidobacterium strains and their possible nutritional competition with Porphyromonas gingivalis. J. Appl. Microbiol. 103, 1969–1974 (2007).
Article CAS PubMed Google Scholar
Rios-Covian, D., Salazar, N., Gueimonde, M. & de los Reyes-Gavilan, C. G. Shaping the metabolism of intestinal Bacteroides population through diet to improve human health. Front. Microbiol. 8, 376 (2017).
Article PubMed PubMed Central Google Scholar
Chassard, C. & Bernalier-Donadille, A. H2 and acetate transfers during xylan fermentation between a butyrate-producing xylanolytic species and hydrogenotrophic microorganisms from the human gut. FEMS Microbiol. Lett. 254, 116–122 (2006).
Article CAS PubMed Google Scholar
Morton, J. T. et al. Establishing microbial composition measurement standards with reference frames. Nat. Commun. 10, 1–11 (2019).
Article CAS Google Scholar
Silverman, J. D., Roche, K., Holmes, Z. C., David, L. A. & Mukherjee, S. Bayesian multinomial logistic normal models through marginally latent matrix-T processes. arXiv:1903.11695 (arXiv preprint) (2019).
Barber, D. Bayesian Reasoning and Machine Learning (Cambridge University Press, 2012).
Book MATH Google Scholar

Download references

Acknowledgements

The authors would like to thank all members of Bioinformatics Research Group (BioRG).

Author information

Authors and Affiliations

Bioinformatics Research Group (BioRG), Florida International University, Miami, 33199, USA
Musfiqur Sazal, Vitalii Stebliankin & Giri Narasimhan
Herbert Wertheim College of Medicine, Florida International University, Miami, 33199, USA
Kalai Mathee
Biomolecular Sciences Institute, Florida International University, Miami, 33199, USA
Kalai Mathee & Giri Narasimhan
Department of Biostatistics, Florida International University, Miami, 33199, USA
Changwon Yoo

Authors

Musfiqur Sazal
View author publications
You can also search for this author in PubMed Google Scholar
Vitalii Stebliankin
View author publications
You can also search for this author in PubMed Google Scholar
Kalai Mathee
View author publications
You can also search for this author in PubMed Google Scholar
Changwon Yoo
View author publications
You can also search for this author in PubMed Google Scholar
Giri Narasimhan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The research project was conceived by M.S. M.S. performed all the experiments, and analyzed the results. G.N. supervised the whole research project. All authors reviewed the manuscript. All authors read and approved the manuscript.

Corresponding author

Correspondence to Giri Narasimhan.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sazal, M., Stebliankin, V., Mathee, K. et al. Causal effects in microbiomes using interventional calculus. Sci Rep 11, 5724 (2021). https://doi.org/10.1038/s41598-021-84905-3

Download citation

Received: 09 October 2020
Accepted: 23 February 2021
Published: 11 March 2021
DOI: https://doi.org/10.1038/s41598-021-84905-3

This article is cited by

Copper intrauterine device increases vaginal concentrations of inflammatory anaerobes and depletes lactobacilli compared to hormonal options in a randomized trial
- Bryan P. Brown
- Colin Feng
- Heather B. Jaspan
Nature Communications (2023)
Prior exposure to microcystin alters host gut resistome and is associated with dysregulated immune homeostasis in translatable mouse models
- Punnag Saha
- Dipro Bose
- Saurabh Chatterjee
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

A host–microbiota interactome reveals extensive transkingdom connectivity

Genome-wide association studies

Microbiota in health and diseases

Introduction

Algorithms

Causal structure

Construction of causal networks

Intervention

Interventional calculus

Causal effect and causal influence

Y-structures

Results

Synthetic data

Real data set

UC data set

Influence subnetworks in the causal network from UC data

Disease networks

Y-structure validation

Sensitivity analysis

Discussion

Methods

Problem formulation

Data

Experiments

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Additional information

Publisher's note

Supplementary Information

Supplementary Information 1.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Copper intrauterine device increases vaginal concentrations of inflammatory anaerobes and depletes lactobacilli compared to hormonal options in a randomized trial

Prior exposure to microcystin alters host gut resistome and is associated with dysregulated immune homeostasis in translatable mouse models

Comments

Search

Quick links