Elsevier

Journal of Theoretical Biology

Volume 242, Issue 2, 21 September 2006, Pages 409-420
Journal of Theoretical Biology

Evolution on distributive lattices

https://doi.org/10.1016/j.jtbi.2006.03.013Get rights and content

Abstract

We consider the directed evolution of a population after an intervention that has significantly altered the underlying fitness landscape. We model the space of genotypes as a distributive lattice; the fitness landscape is a real-valued function on that lattice. The risk of escape from intervention, i.e., the probability that the population develops an escape mutant before extinction, is encoded in the risk polynomial. Tools from algebraic combinatorics are applied to compute the risk polynomial in terms of the fitness landscape. In an application to the development of drug resistance in HIV, we study the risk of viral escape from treatment with the protease inhibitors ritonavir and indinavir.

Introduction

The evolutionary fate of a population is determined by the replication dynamics of the ensemble and by the reproductive success of its individuals. We are interested in scenarios where most individuals have a low fitness, eventually leading to extinction, and only a few types of individuals (“escape mutants”) can survive permanently. These situations often arise due to a significant change of the underlying fitness landscape. For example, a virus that has been transmitted to a new host is confronted with a new immune response. Likewise, medical interventions such as radiation therapy, vaccination, or chemotherapy result in altered fitness landscapes for the targeted agents, which may be bacteria, viruses, or cancer cells.

Given a population and such a hostile fitness landscape, the central question is whether the population will survive. In the case of medical interventions we wish to know the probability of successful treatment. Answering this question involves computing the risk of evolutionary escape, i.e., the probability that the population develops an escape mutant before extinction. We present a mathematical framework for computing such probabilities.

Our primary application is the evolution of drug resistance during treatment of HIV infected patients (Clavel and Hance, 2004). We consider therapy with two different protease inhibitors (PIs). These compounds interfere with HIV particle maturation by inhibiting the viral protease enzyme. The effectiveness of PI therapy is limited by the development of drug resistance. Rapid and highly error prone replication of a large virus population generates mutants that resist the selective pressure of drug therapy. PI resistance is caused by mutations in the protease gene that reduce the binding affinity of the drug to the enzyme. These mutations have been shown to accumulate in a stepwise manner (Berkhout, 1999). For most PIs, no single mutation confers a significant level of resistance, but multiple mutations are required for escape from drug pressure. Quantitative predictions of the probability of successful PI treatment would help in finding effective antiretroviral combination therapies. Selecting a drug combination amounts to controlling the viral fitness landscape.

We regard the directed evolution of a population towards an escape state as a fluctuation on a fitness landscape. The space of genotypes is modeled as follows. We start with a finite partially ordered set (poset) E whose elements are called events. The events are non-reversible mutations with some constraints on their order of occurrence. Such constraints are primarily due to epistatic effects between different loci in a genome (Bonhoeffer et al., 2004). The event constraints define the poset structure: e1<e2 in E means that event e1 must occur before event e2 can occur. Each genotype g is represented by a subset of E, namely, the set of all events that occurred to create g. Thus a genotype g is an order ideal in the poset E. The space of genotypes G is the set of all order ideals in E, which is a distributive lattice (Stanley, 1997, Section 3.4). The order relation on G is set inclusion and corresponds to the accumulation of mutations. This mathematical formulation is reasonable in the above situations, where a population is exposed to strong selective pressure.

The risk of escape is governed by the structure of G, the fitness function on G, and the population dynamics (such as the mutation rates and population size). Our focus is on the dependency of the risk of escape on the assigned fitness values for each genotype gG. This leads us to the risk polynomial, which is shown to be equivalent to a well-known object in algebraic combinatorics. Indeed, one of the objectives of this work is to provide a bridge between algebraic combinatorics and evolutionary biology.

This paper is organized as follows. In Section 2 we formalize our model of a static fitness landscape on the genotype lattice G derived from an event poset E, and we discuss evolution on the lattice G. In Section 3 we review the multistate branching process studied by Iwasa et al., 2003, Iwasa et al., 2004.

In Section 4 we study the Bayesian networks which arise from identifying the events in E with binary random variables. These statistical models can be used to infer the genotype space from given data. For conjunctive Bayesian networks we recover the distributive lattice of order ideals in E. Of particular interest is the case where E is a directed forest: here the Bayesian network is a mutagenetic tree model (Beerenwinkel and Drton, 2005; Beerenwinkel et al., 2005a, Beerenwinkel et al., 2005b). The application of our methods to the development of PI resistance in HIV is presented in Section 5.

The appendix summarizes various representations of the risk polynomial in terms of structures from algebraic combinatorics. Efficient methods for computing the risk polynomial and their implementation are presented.

Section snippets

Fitness landscapes on distributive lattices

A partially ordered set (or poset) is a set E together with a binary relation, denoted “”, which is reflexive, antisymmetric, and transitive. Here we fix a finite poset E whose elements are called events. If the number of events is n then we often identify the set underlying E with the set [n]={1,2,,n}. In this way, the subsets of E are encoded by the 2n binary strings of length n. The empty subset of E is encoded by the all-zero string 0^=000 which represents the wild type, and the full set

The risk of escape

For a poset of events E and the corresponding distributive lattice G=J(E), the risk polynomial of G is defined as the polynomial (4), which we denote by R(G;f). The risk polynomial was introduced in Iwasa et al., 2003, Iwasa et al., 2004. In this section we review the evolutionary dynamics model proposed in these papers, and we discuss the probabilistic meaning of the risk polynomial.

Example 6

Let G be the genotype lattice in Fig. 1. Then the risk polynomial R(G;f) is the following polynomial of degree

Distributive lattices from Bayesian networks

In this section, we present a family of statistical models that naturally gives rise to distributive lattices. This statistical interpretation provides a method for deriving the genotype lattice G directly from data. The basic idea is to estimate the poset structure on E from observed genotypes, by applying model selection techniques to a range of Bayesian networks, and to define G as the set of all genotypes with non-zero probability in the model.

We first make precise the derivation of a

Applications to HIV drug resistance

We investigate the development of resistance during treatment of HIV infected patients with two different PIs. Consider the seven genetic events E={K20R, M36I, M46I, I54V, A71V, V82A, I84V},where K20R stands for the amino acid change from lysine (K) to arginine (R) at position 20 of the protease chain, etc. The occurrence of these mutations confers broad cross-resistance to the entire class of PIs. Appearance of the virus with all seven mutations renders most of the PIs ineffective for

Discussion

We have presented a computational framework for assessing the risk of escape of an evolving population of pathogens. The risk of escape is the probability that the population reaches an escape state before extinction. In virus transmissions, for example, this probability is the chance of survival in the new host. In the situation of antiretroviral therapy, the risk of escape is the probability of therapy failure due to the development of drug resistance.

The general setup we consider for

Acknowledgements

Niko Beerenwinkel is supported by Deutsche Forschungsgemeinschaft under Grant No. BE 3217/1-1. Nicholas Eriksson and Bernd Sturmfels are supported by the U.S. National Science Foundation, under the grants EF-0331494 and DMS-0456960 respectively, and by the DARPA program Fundamental Laws in Biology (HR0011-05-1-0057).

References (29)

  • B. Berkhout

    HIV-1 evolution under pressure of protease inhibitors: climbing the stairs of viral fitness

    J. Biomed. Sci.

    (1999)
  • S. Bonhoeffer et al.

    Evidence for positive epistasis in HIV-1

    Science

    (2004)
  • G. Brightwell et al.

    Counting linear extensions

    Order

    (1991)
  • F. Clavel et al.

    HIV drug resistance

    N. Engl. J. Med.

    (2004)
  • Cited by (33)

    • Pharmacokinetic-pharmacodynamic relationship of NRTIs and its connection to viral escape: An example based on zidovudine

      2009, European Journal of Pharmaceutical Sciences
      Citation Excerpt :

      The action of non-nucleoside reverse transcriptase inhibitors (NNRTIs) can also easily be incorporated into the effect model. Most of the current models of HIV dynamics, e.g. (Perelson, 1999) and evolution, e.g. (Beerenwinkel et al., 2006) are based on the assumption of constant drug concentrations. Our results highlight the importance of drug pharmacokinetics, in the case of zidovudine on the delay of viral DNA chain completion.

    • Universal Causality

      2023, Entropy
    • Was Gelfand Right? The Many Loves of Lattice Theory

      2022, Notices of the American Mathematical Society
    View all citing articles on Scopus
    View full text