ABSTRACT
Natural selection successfully explains how organisms accumulate adaptive change despite that traits acquired over a lifetime are eliminated at the end of each generation. However, in some domains that exhibit cumulative, adaptive change—e.g., cultural evolution, and earliest life—acquired traits are retained; these domains do not face the problem that Darwin’s theory was designed to solve. Lack of transmission of acquired traits occurs when germ cells are protected from environmental change, due to a self-assembly code used in two distinct ways: (i) actively interpreted during development to generate a soma, and (ii) passively copied without interpretation during reproduction to generate germ cells. Early life and cultural evolution appear not to involve a self-assembly code used in these two ways. We suggest that cumulative, adaptive change in these domains is due to a lower-fidelity evolutionary process, and model it using Reflexively Autocatalytic and Foodset-generated networks. We refer to this more primitive evolutionary process as Self–Other Reorganisation (SOR) because it involves internal self-organising and self-maintaining processes within entities, as well as interaction between entities. SOR encompasses learning but in general operates across groups. We discuss the relationship between SOR and Lamarckism, and illustrate a special case of SOR without variation.
1. INTRODUCTION
Evolution refers to cumulative, adaptive change over time; thus, an evolutionary process produces increasingly adapted entities. Under standard models of natural selection (dating back to Darwin), evolution is driven by differential replication of randomly generated heritable variations in a population over generations, resulting in some traits becoming more prevalent than others.1 Natural selection has provided the foundation of modern biology, and a unifying principle for describing species change well after the origin of life. However, as noted by Gould [38], “Evolution, in fact, is not the study of origins at all… Evolution studies the pathways and mechanisms of organic change following the origin of life.” Since natural selection requires the existence of multiple competing entities that are selected amongst, it cannot explain how the first living entity arose (nor the second) [23]. The theory of natural selection is an explanation for how life diversified after it originated; the means by which it originated is currently an unsolved problem [80].
Darwin’s theory arose in response to the paradox of how organisms accumulate adaptive change despite that traits acquired over a lifetime are eliminated at the end of each generation. He devised a population-level explanation: although acquired traits are discarded, inherited traits are retained, so evolution is due to preferential selection for those inherited traits that confer fitness benefits on their bearers. But although Darwinian principles have been applied to early life [9, 15, 16] and to cultural evolution [4, 8, 14, 72, 91, 102], neither origin of life research, nor cultural evolution research, is plagued by the problem that acquired traits are extinguished at the end of a generation. By culture, we mean extrasomatic adaptations such as behavior and artifacts that are socially rather than sexually transmitted, as well as the complex whole of the knowledge, beliefs, art, laws, customs, and so forth, acquired by the members of a society [94]. Because of lateral or horizontal transfer in both culture and very early life, these domains do not face the paradox that Darwin’s theory was designed to solve [23, 25, 97].
One could argue that even in these domains where acquired traits are not eliminated at the end of each generation, the observed cumulative, adaptive change may nevertheless be due to natural selection. However, species that discard traits acquired over a lifetime possess a set of coded self-assembly instructions which are used in two distinct and separate ways: (i) actively interpreted during development to generate a soma, and (ii) passively copied without interpretation during reproduction to produce germ cells [98, 67]. This sequestering of germ cells from developmental changes is responsible for the sine qua non of a Darwinian process: lack of transmission of acquired traits [46]. This tells us that it is a rather complex structure— i.e., a set of coded self-assembly instructions used in these two distinct ways—that enables cumulative, adaptive change without transmission of acquired traits. The above-mentioned forms of evolution (that of earliest life, and cultural evolution) may not possess such a structure. Note also that although culture involves replication, as when one artist copies a painting made by another, this is not self-replication; the painting does not replicate itself.
Accordingly, it would seem appropriate to pay particular attention in these domains to the role of epigenetic and non-Darwinian (e.g., Lamarckian) processes, and indeed, it is increasingly accepted that non-Darwinian processes play a role in evolution [23, 57, 58, 97, 104]. This paper goes further by providing an existence proof that evolution is possible in the absence of variation and selection, and mathematically describing a type of entity that allows for this kind of evolutionary process. This second component is important, because although entities of many kinds change due to acquired traits—e.g., a rock tumbling down a stream will acquire rounded edges—such change may not be adaptive. In contrast to the rock—and also, in contrast to neutral evolution [60, 61]—the evolutionary change considered in this paper is adaptive, i.e., it contributes to the entity’s efficiency, survival, or replication. Humans might find smoother rocks more appealing, and thus they may be more valuable to humans, but this increased smoothness is not valuable to the rock itself; indeed, the process that rounds the edges of a rock culminates in its disintegration. Similarly, optimization algorithms produce outcomes that are deemed adaptive not because they benefit the algorithms themselves; they are deemed adaptive only insofar as they benefit us.
In origin of life research, there is a sizeable literature on models of the evolvability of primitive metabolic networks prior to DNA (e.g., [65, 84, 85, 95, 105]). Here, in order to describe the origin of both biological life and culture, within a common framework for the origins of evolutionary processes, we take a more general mathematical approach. The approach addresses the problem of network control [68] using a generalized notion of catalysis, or facilitated interaction; existing elements (referred to as the foodset) interact, and stimulate (or catalyze) the generation of new (foodset-derived) elements, thereby introducing a means of endogenous control.
The paper begins with an introduction to the general framework for modeling the origins of evolutionary processes, which will be used to develop the argument. We then provide a mathematical model for evolution in the absence of variation and selection with this framework. Next, we explain why we refer to this form of evolution as Self–Other Reorganisation (SOR), and describe what type of structure is able to exhibit this form of evolution. To accommodate an interdisciplinary readership, a list of standard definitions of terms used in this paper is provided in the Supplementary Materials. Acronyms used in this paper are listed alphabetically in Table 1.
2. AUTOCATALYTIC NETWORKS
The evolution process described here involves entities that (1) are self-maintaining (i.e., they have a means of preserving their structure), and (2) interact with each other by way of their environment. The theory of autocatalytic networks grew out of studies of the statistical properties of random graphs consisting of nodes randomly connected by edges [13]. As the ratio of edges to nodes increases, the size of the largest cluster increases, and the probability of a phase transition resulting in a single giant connected cluster also increases. The recognition that connected graphs exhibit phase transitions led to their application to efforts to model of the origin of life (OOL) [56, 57]. Applying graph theory to the OOL, the nodes represent catalytic molecules, and the edges represent reactions. It is exceedingly improbable that any catalytic molecule present in Earth’s early atmosphere catalysed its own formation. However, reactions generate new molecules that catalyse new reactions, and as the variety of molecules increases, the variety of reactions increases faster. As the ratio of reactions to molecules increases, the probability increases that the system undergoes a phase transition. When, for each molecule, there is a catalytic pathway to its formation, they are collectively autocatalytic, and the process by which this is achieved has been referred to as autocatalytic closure [57]. The molecules thereby become a self-sustaining, self-replicating, living protocell [51]. Thus, the theory of autocatalytic networks provides a promising avenue for understanding how biological life began [105].
2.1. Reflexively Autocatalytic and Foodset-generated (RAF) networks
Autocatalytic networks have been developed mathematically in the theory of Reflexively Autocatalytic and Foodset-generated (RAF) networks [52, 79, 90]. The term reflexively is used in its mathematical sense, meaning that every element is related to the whole. The term foodset refers to the elements that are initially present, as opposed to those produced through interactions amongst them. (Elements of a ‘foodset’ are not the same thing as ‘food’; the foodset is simply the raw building blocks available.)
We now summarise the key concepts of RAF theory. A catalytic reaction system (CRS) is a tuple consisting of a set X of element types, a set
of reactions, a catalysis set C indicating which element types catalyse which reactions, and a subset F of X called the foodset. A Reflexively Autocatalytic and F-generated set (i.e., a RAF) is a non-empty subset
of reactions that satisfies the following properties:
Reflexively autocatalytic (RA): each reaction
is catalysed by at least one element type that is either produced by
or present in the foodset F; and
F-generated: all reactants in
can be generated from the foodset F by using a series of reactions from
itself.
A set of reactions that forms a RAF is simultaneously self-sustaining (by the F-generated condition) and (collectively) autocatalytic (by the RA condition) because each of its reactions is catalysed by an element associated with the RAF. Note that in RAF theory, a catalyst does not necessarily cause more of something to be produced (or at a faster rate); the catalyst is simply the impetus that allows a reaction to proceed.
A CRS need not have a RAF; alternatively, it may contain many RAFs. When a CRS does contain RAFs there is a unique maximal one, the maxRAF. It is the fact that a CRS may contain multiple RAFs that allows RAFs to evolve, as demonstrated both in theory and in simulation studies, through selective proliferation and drift acting on the RAFs that are subsets of the maxRAF [52, 95].
2.2. Biological Applications of RAFs
RAFs have proven useful for modelling the OOL and the onset of biological evolution [52, 90, 95, 105]. In the OOL context, a RAF emerges in systems of polymers (molecular strings composed of repeated units called monomers) when the complexity of these polymers (as measured by their maximum length) reaches a certain threshold [57, 74]. The phase transition from no RAF to a RAF incorporating most or all of the elements depends on (1) the probability of any one polymer catalyzing the reaction by which a given other polymer is formed, and (2) the maximum length (number of monomers) of polymers in the system. This transition has been formalised and analysed mathematically, and using simulations, and RAF theory has been applied to real biochemical systems [48, 49, 50, 52, 74].The theory has proven useful for identifying how phase transitions might occur and at what parameter values.
2.3. Applications of RAFs to Culture and Cognition
Computational models of cultural evolution (e.g., [17, 43, 63]) have described adaptive change in the absence of autocatalytic structure. However, in such models, the cultural outputs are said to become more adapted (or complex) not because they change in ways that enhance the cultural outputs’ own preservation or evolution, but because they change in way benefit the artificial agents producing them. The programmers may define up front what constitutes a superior cultural output, but the models do not incorporate how different cultural outputs interact with agents in ways that affect survival or evolvability. As such, these models are not forced to confront the question of what kind of structure can sustain cultural evolution. In the autocatalytic model presented in this paper, (1) we do consider how the outputs interact with the internal structure of the agents, modeled as RAF networks, and (2) it is not the outputs that are considered to be evolving, but the self-organizing, self-preserving, and essentially autocatalytic minds that generate and make use of these outputs.
Autocatalytic networks have been used to model cognitive transitions associated with the origin of culture and the capacity for cumulative cultural evolution [18, 19, 20, 32, 34, 33], and the onset of behavioral modernity (i.e., the capacity to think and act like modern humans) [33]; for related approaches see [2, 6, 12, 76]. Also, the evolution and reproduction of RAFs has been studied in an OOC context [34]. RAFs have also been used to develop a process model of conceptual change (i.e., the process by which a child becomes an active participant in cultural evolution) [29].
Whereas the OOL involves networks of chemical reactions, the OOC setting involves conceptual networks of knowledge and memories, and the products and reactants are not catalytic molecules but culturally transmittable mental representations2 (MRs) of knowledge, experiences, ideas, schemas, and scripts. Whereas a chemical reaction network may be bounded by a semipermeable lipid membrane, a conceptual network supervenes upon a brain within a body, which thus constitutes a boundary on the cognitive RAF. The OOL and OOC settings both exhibit an underlying RAF structure. Just as reactions between molecules generate new molecules, interactions between MRs generate new MRs, which in turn enable new interactions. In a cognitive/cultural context, the term foodset refers to MRs that are innate, or obtained through social learning or through individual learning from the natural world or human-made artifacts. In contrast, foodset-generated refers to MRs that are generated from scratch by the individual, as a result of mental operations such as induction or concept combination. Initially they are generated solely through interactions amongst foodset representations, but as the proportion of foodset-generated items increases they come to play a larger role in these interactions. There are generally multiple RAFs in the conceptual network of an individual, which may be subsumed by the maxRAF that connects them; though, they may also be disconnected (as when inconsistent views are held by the same individual).
The RAF approach to culture and its cognitive underpinnings offers several advantages over other approaches:
Semantic grounding. the problematic circularity of defining MRs in terms of other MRs [42] is avoided by distinguishing between foodset items and foodset-derived elements. This provides a natural means of grounding abstract concepts in direct experiences. Complex understandings, modeled as foodset-derived MRs, emerge through mental operations, modeled as ‘reactions,’ that can be traced back to foodset MRs in a given individual.
Reactivity of ideas. In cognitive RAFs, mental representations (MRs) become aligned with needs and desires (e.g., the need to solve a problem or resolve cognitive dissonance), and the emotions they elicit. In so doing, they incite (‘catalyze’) interactions between other mental representations (modeled as ‘reactions’), which can be carried out recursively. As a result, whereas conventional conceptual network models require external input to continue processing, RAFs ‘catalyze’ conceptual change endogenously (in the absence of new external input), resulting in new conduits by which needs and goals can be met. Thus, a RAF network is self-organizing, and conceptual restructuring can percolate throughout the network and affect its global structure.
Track cultural change within and across conceptual networks. In a RAF model, acquired traits are modeled as foodset-generated elements: traits that come into existence within a given entity. For example, if Alice comes up with an idea herself, for Alice that idea is an acquired trait, modeled as a foodset-generated item. If Alice expresses the idea to Bob, we have transmission of an acquired trait. The idea becomes part of Bob’s foodset, because he did not generate it from scratch. However, if he used Alice’s idea to develop his own new idea, this new idea would be an acquired trait for Bob, modeled as a foodset-generated element. Thus, the distinction between foodset and foodset-generated elements makes it possible to trace innovations back to the individuals that generated them, and and track how new ideas and cultural outputs emerge from previous ones.
3. A RAF MODEL OF EVOLUTION WITHOUT VARIATION AND SELECTION
We now demonstrate a primitive non-Darwinian form of evolution using RAF networks. Consider a group of indistinguishable entities, as illustrated in Fig. 1. (They may be CRSs such as those associated with very early life [3, 10, 23, 37, 53, 89, 97], culturally evolving human conceptual networks (e.g., [17, 35]), or some other structure we have never encountered.) The entities are described as identical RAFs.
Identical entities exhibit cumulative adaptive change over time, as indicated by transition from light to dark, as a result of cumulative environmental changes, indicated by p, p′…. Catalysis is indicated by dashed green arrows. Reactions are indicated by blue squares. Reactants and products within the RAF of entity i are indicated by Greek letters. Stimulus s catalyses a reaction in the RAF of entity i, resulting in product p. The structure of i’s RAF is illustrated in the blow up on the upper left (with the set of element types X = {α,β, γ, δ, ∊, s}, food set F = {α, δ, s}, and with ∊ giving rise to p). The product p in turn catalyses a reaction sequence in j at time t2, leading to p′, and so on. In the scenario outlined here, stimulation of i by s occurs only once, but that is sufficient to generate p, which is sufficient to set in motion a sequence of adaptive changes.
One entity in , which we call entity i, encounters an environmental stimulus, which we refer to as s. In the case of a biochemical reaction network, s could be a molecule that crosses the lipid membrane encasing it. In the case of culture, s could be a visual or auditory stimulus. What makes s a novel stimulus is that at t0 it was not part of the reaction network of any entity in
. Stimulus s is nonetheless part of the foodset, because it is available as an initial reactant or catalyst, as opposed to being directly or indirectly constructed from available reactants.3
Stimulus s triggers one or more reaction(s) within the RAF network of entity i. In a OOL setting the reactions are chemical reactions, while in a OOC setting they are mental operations that modify MRs. (Recursive sequences of such ‘reactions’ have been referred to as cognitive catalytic processes (CCPs) [32, 34]). The formation of a transient CCP is illustrated in Fig. 2(ii) of [35].) This reaction (or reaction chain) culminates in the generation of a product p into the environment. Thus, p was not part of the foodset; it is the end result of the series of reactions (or interactions) triggered by s. Note that s can only trigger formation of p if the necessary reaction sequence takes place, and this reaction sequence requires the RAF.
Comparison of (a) evolution through natural selection and (b) evolution by Self-Other Reorganisation (SOR). An X indicates that the entity was selected against (i.e., did not contribute to the next generation). Individual entities are represented by circles, with variation among them represented by different patterns. (In the example shown in (b), there is variation but there need not be, as illustrated in Fig. 1). In both (a) and (b), the darker the color the better adapted the individual (regardless of pattern). In (a), the darker color of one of the individuals in the third generation is due to mutation. Both (a) and (b) result in cumulative, adaptive change over time. However, (a) is relatively slow because it works due to differential replication of randomly generated heritable variation in a population over generations, such that some traits become more prevalent than others, with acquired changes being discarded at the end of each generation. In contrast, change in (b) is entirely due to acquired change over time.
Product p confers some adaptive benefit on i; exposure to p may improve the efficiency of i’s reaction network (or conceptual network) by reducing the number of steps required for a particular reaction sequence (or problem-solving process). It may have a protective function (e.g., by making the environment less inviting for a predator) or (in a cultural context) it may serve as a source of pleasure that enhances wellbeing (e.g., a work of art). Having generated p, i returns to its initial state, except that it now experiences the adaptive benefit of p.
Once p is released into the environment, it can be encountered by other entities in . (In an OOL context, p may be transmitted because it passes through a lipid membrane or protocell wall. In an OOC context, p may be transmitted via social learning, or by way of an artifact.) A recipient of p could have had the capacity to generate p on its own, but by chance had never done so. (In a OOL context, for example, a particular catalyst may have never come in contact with its reactant. In a cultural context, an individual may possess two concepts but never thought of combining them.) Alternatively, it is possible that the recipient did not previously possess the capacity to generate the product.
Since all entities are identical to i and exist in the same environment as i, they too undergo the adaptive benefit of p, and this benefit percolates throughout .4 The generation of p makes it possible to generate a new product, p′, which confers even greater adaptive benefit than p. An entity j assimilates p, which triggers a sequence of one or more reactions in j, resulting in p. Entity j benefits from the enhanced adaptive quality of p′ over p. The ability to produce p′ and experience its adaptive benefit percolates throughout the group
. Thus, each version of p paves the way for its successor. The entities in
continue to generate increasingly beneficial versions of the original p (e.g., p, p′, p″,…) catalyzed by the products of other entities. They collectively come to possess the ability to generate and benefit from this family of products through the restructuring of their RAFs. Thus, we have cumulative adaptive change over time. Notice that in this model there is no selection and no competitive exclusion, and nor is there birth or death; it is the same set of entities at the beginning as at the end.
3.1. Modelling this process
We consider a simple model based on two processes: percolation and generation. Let ρ denotes the rate of percolation of products, or knowledge (i.e., MRs) of these products through the group , with neighbourhood structure represented by a directed graph
. (The nodes of this graph are the entities in
, and the arcs indicate directed percolation links). For the generation process, we let λ denote the rate (per entity) at which new products are generated. Provided that the ratio ρ/λ is not too small, the entities evolve via cumulative, adaptive change, with the only variation being between those in
that have not yet switched to a new product and those that have (this might be likely in a cultural setting when the entity generating a new product is equidistant from all the others, or in a social media context, where everyone sees something at the same time). Moreover, for large values of ρ/λ, each percolation step will be complete before the next new product is generated, so there will be no variation amongst the entities.
Conversely, as ρ/λ → 0, the entities diverge from each other with respect to the products they generate. Their complexity (which may be estimated through analysis of the maximal number of modifications to any product at any given time for all entities in ) is expected to be lower than in the previous scenario of shared cumulative adaptive change.
To help formalise the model, we adopt the following terminology. Given the products p, p′, p″,.…, we say that the entity that first gives rise to one of these products generates it. In the case where this product is produced from an earlier product in the series (rather than being the original product p in response to a stimulus), we say the entity transforms the existing product to the new one. We model the increase in the adaptive value of products within using two stochastically independent non-deterministic processes.
First, the generation of new products by entity i involves either generating a new product, or transforming the most recent version of any available product; for simplicity, we assume that these are equally probable. For example, if entity i has the products (p, p′, q, r, r′, r″) currently available, then it can either transform p′, or q, or r″, or generate a new product, and each of these four outcomes has the same probability, namely 1/4. This process across entities is assumed to be described by independent exponential random variables with a fixed rate per entity of λ. (Making λ independent of the entity is consistent with the assumption that all entities are initially identical). Second, each newly generated product begins to percolate through the group by moving along the arcs of the directed graph according to a continuous-time random walk at rate ρ. For simplicity, we will treat the two processes—generation and percolation—as stochastically independent. Let
(the number of entities in
). We will assume that the directed graph
that describes the community interactions within
has the property that from each entity in
, it is possible to reach any other entity in
by following some directed path in
(i.e.,
is ‘strongly connected’); however, no further assumptions are made regarding this graph.
We now introduce some notation to keep track of the different versions of products that arise in the process described in Section 3. Suppose that product p first arises from entity i1, and product p is then further modified by entity i2, and so on, with the last modification (before time t) being made by entity ik. We denote the sequence of products thus generated within up to time t as: p(i1), p(i1, i2), p(i1, i2, i3),.… More generally, we denote such a sequence by writing (p(i1),…, p(i1, i2,…, ik) : k ≥ 1) (thereby allowing the possibility that a product is generated but not transformed, in the case where k =1). We refer to the number k of terms in this sequence as the complexity of the final product; thus, when an entity transforms a product, it increases its complexity by 1 (in particular, the first product p(i1) has complexity 1).
Note that under the assumptions of the model, the entities i1,…,ik are not necessarily distinct (i.e., an entity may enhance a product more than once, either consecutively, or later in the sequence). There may also be several such sequences generated within ; for example, in addition to the previous sequence, one might also have p(j1), p(j1, j2),…, p(j1, j2,…, jl), along with possibly other sequences generated over the time interval [0,t].
We let denote the probability of event * and
denote the expectation of random variable **. Let Tρ(i) be the expected time for a product generated by entity i to percolate (within
) to every entity in
, and let Tρ = max{Tρ(i) : i ∈ G}. For a wide range of standard percolation processes, the following properties then hold: (i) for ρ > 0, we have
; (ii) for all η > 0,
, and (iii)
. This last property implies that when ρ is large, products are highly likely to percolate throughout the entire group
in a short time.
If we start this process at time 0 with no products present, let τ0, τ2,…,τk be the random variables that describe the time intervals between the generation of products across the collection of entities in . By the assumptions of the model, the τi variables are independent and exponentially distributed, with each variable having an expected value of 1/(Nλ). Thus,
is the time until k products have been generated (this has a gamma distribution with expected value k/(Nλ)). Let μ = Nλ. Then, for any η> 0,
and
(by the Markov inequality). Let εk denote the following event: for each of the first k products generated, each product percolates to each entity in
before the next new product (in this collection of size k) is generated in
. We then have:
Setting in (1) and applying Property (iii) above gives:
Thus, as ρ becomes large, the entities evolve collectively, and any variation is transient and short-lived. We will refer to this limiting case as the community-based model. One can model this process by the following novel type of Pólya Urn model:
Consider an urn that initially has a single white ball. At each step (the timing of which follows a Poisson process at rate r), a ball is selected from the urn uniformly at random. If the selected ball is white, it is returned to the urn along with a ball of a new colour (not present already in the urn). If the selected ball is coloured, it is removed and replaced by a ball of the same colour but a darker shade.
To connect this urn process to the community-based model described above, note that selecting a white ball corresponds to the generation of a new product (which results in a ball of a new colour being added to the urn), while selecting a coloured ball and exchanging it for a darker one of that colour corresponds to the transformation of an existing product. Thus, r = Nλ.
We now compare the community-based model (corresponding to ρ large) to the opposite extreme, where ρ becomes small. In that latter case, the probability that there is percolation between any two entities in over the interval [0, t] tends to 0, and so products are only generated within entities but not shared between them. We will refer to this limiting case as the individual-based model. Note that in this individual-based model, entity ij may possibly generate a new product p(ij), or generate p(ij) and then transform it (producing p(ij, ij) and so on (or it might not generate any new products at all). Note that, in general, p(ij) may be different from p(ik) (for k ≠ j) (i.e., different entities may either produce or transform different products).
For the individual-based model, we have N independent samples of the above Urn model but with r = λ. By contrast, with the community-based model, we have a single sample of the above Urn model, but with r = Nλ. Note that both models have the same expected number of generation events, but they have quite different dynamics, as we now describe.
Firstly, in the community-based model, there is only short-lived or transient variation among the entities, whereas in the individual-based model, the individuals diverge from each other in terms of the collections of products available to them. However, a subtler difference is that in the community model, the complexity of items is significantly higher than in the individual model, in a sense that we now make precise.
To analyse this in more detail, let Xt denote the number of steps in this Urn process (i.e., where a ball is sampled and the urn modified) over the interval [0,t). Then Xt has a Poisson distribution with mean rt. Next, let Yt denote the number of times a white ball is selected from the urn over the interval [0,t), let Zt denote the number of times a coloured ball is selected from the urn over the time interval [0,t), and let Ct denote the number of coloured balls in the urn at time t. Notice that the following two identities hold:
Let . We claim that as r → ∞:
and so (applying the Bonferroni inequality):
The first limit in (4) holds because Xt has a Poisson distribution with mean rt, and so Xt/rt converges in probability to 1. For the second limit in (4), let Tkr denote the time until Yt first hits kr, in which case, a well known identity applies:
Now, Tkr is a sum of kr independent exponential random variables, with means 1/r, 2/r, …, kr/r and variances . Thus, Tkr has expected value
and variance
Applying Eqn. (6) now shows that as r grows, thereby justifying the second part of Eqn. (4).
We now apply Eqns. (2) and (3). These reveal that the condition Xt/Yt ≥ kr is equivalent to the condition that Zt/Ct ≤ kr – 1. By the well-known ‘pigeonhole principle’ in combinatorics, this last inequality implies that at least one of the (Ct) coloured balls must be at least kr – 2 shades darker than when that colour first appeared in the urn (because otherwise, each of the Ct coloured balls must be selected at most kr – 2 times, in which case Zt ≤ (kr – 2)Ct).
It now follows from Eqn. (5) (noting that the term in that equation (namely ) is of order
)), that for large r (for fixed t), there is a high probability that at least one coloured ball is present in the urn that is of order
shades darker than it was when it first appeared in the urn. Since the community-based model has r = Nλ, we arrive at the following conclusion regarding the influence of the size of
on complexity:
Over a given period of time, some products in the community-based model have an expected complexity of order at least
.
By contrast, for the individual-based model, we have r = λ for each entity, and so we have N independent and identically distributed samples of a process where the maximum complexity of products across each group exhibit a lower (logarithmic) dependence on N. (Moreover, these complex products are likely to exist only in one or a few entities, rather than being shared across the group). To see this, note that the complexity of any product associated with an entity (up to time t) is bounded above by the number of generation steps for that entity, which has a Poisson distribution with mean λt, and the maximum of N independent and identically distributed Poisson random variables is known to be dominated asymptotically (with N) by a log(N) term [59].
We note that the above model is different from the model described by [1]), which was based on autocatalytic ‘cores’ (which are closely related to RAFs [52]) and also involved an ambient food set. However, the goal in that paper was to establish bounds on the rate of growth in the number of autocatalytic cores as a function of the number of molecular species in a chemical reaction system.
4. Self-Other Reorganisation (SOR)
We have shown how entities that possess an abstract structure mathematically described by RAFs can exhibit evolution—i.e., cumulative, adaptive change—and how it is possible for this process to operate in the absence of variation and/or selection. We refer to this kind of evolutionary process as Self–Other Reorganisation (SOR) because it consists of (i) self-organization of RAF networks, and (ii) interactions amongst these networks that alter their potential for future configurations [26, 28, 31, 99]. SOR was developed on the basis of theory and findings that have come out of the post-modern synthesis in biology [23, 88]. We now compare SOR to other established concepts in evolutionary theory: learning, natural selection, Lamarckian evolution, and the neutral theory of molecular evolution.
4.1. SOR versus Learning
Learning plays a role in SOR, since RAFs are central to SOR, and in cognitive RAFs the foodset consists of MRs acquired through social learning and individual learning. These processes constitute the external component of SOR, as opposed to the internal component, which involves catalyzed reactions amongst MRs resulting in new MRs that were not available as initial reactants, i.e., foodset-derived MRs. In the extreme, SOR could work with just one entity. In this case, the foodset consists of MRs acquired through individual learning; there are no MRs acquired through social learning. However, so long as individually learnt information can trigger cognitive ‘reactions’, it would be possible for some degree of cumulative, adaptive change to take place. There is a long history in psychology of viewing the assimilation of new information and the accommodation of existing knowledge in response to this new information as working hand in hand. Assimilation—the learning component—adds foodset itmes, but accommodation—an internal process—adds foodset-derived items. Thus, to the extent that they do work hand-in-hand, learning can be considered a kind of SOR. However, SOR is a broader concept than learning, because in the general case it operates on groups of entities, and can span biological generations. It is unlikely that the converse is true, i.e., that evolution by SOR could work through social learning alone, with no individual learning, because there would be nothing to socially transmit. As demonstrated in a computational model of cultural evolution, turning off social learning resulted in cumulative adaptive change (exclusively within individual agents), albeit much more slowly. However, if social learning was the only means of learning available, there was nothing to kickstart the process, so there was no cumulative adaptive change [17].
4.2. SOR versus Natural Selection
SOR is distinctly different from a Darwinian or selectionist process. The distinction between Darwinian evolution and SOR is summarised in Table 2 and illustrated in Fig. 2. SOR involves not competition and survival of some at the expense of others, but transformation of all.5 In other words, it isn’t that the fittest survive (in their entirety) while the less fit do not, but rather, all agents develop more adaptive structure (such as happens in learning and creative cognition).6 Like natural selection, SOR has mechanisms for preserving continuity and introducing novelty, but reproduction is lower-fidelity than it is for natural selection, because it is the culmination of haphazard catalyzed interactions, as opposed to the accuracy afforded by copying from a self-assembly code, as in a Darwinian process.
A self-assembly code is a set of coded instructions that is (i) actively interpreted through a developmental process to generate a soma, and (ii) passively copied without interpretation through a reproduction process to generate self-copies, i.e., new sets of self-assembly instructions that are in turn used in these two distinct ways [46, 67]. This conception of a self-assembly code grew out of Von Neumann’s work on universal constructors and relf-replicating automata [98]. In a Darwinian process, traits acquired through development of the soma and its interaction with the environment (active interpretation of the code) do not affect traits vertically transmitted through reproduction (passive uninterpreted use of the code). The self-assembly code passed down through the germ-line is sequestered, i.e., protected from the influence of acquired traits [46]. Thus, the division of labour between these two ways of using the self-assembly code is responsible for the signature characteristic of a selectionist process: lack of transmission of acquired traits. As a result, while acquired traits (those obtained during an organism’s lifetime, such as a tattoo, or knowledge of a recipe) may affect the soma, and potentially affect interpretation of the self-assembly instructions, they do not affect the content of the self-assembly instructions. Therefore, though inherited traits (e.g., blood type) get transmitted vertically from parent to offspring by way of genes, acquired traits do not. Thus, it is the division of labour between these two ways of using the self-assembly code that is responsible for the sine qua non of a Darwinian process: lack of transmission of acquired traits.
Now let us turn to the earliest structures that could be said to be alive, prior to the evolution of something as complex as a DNA- or RNA-based self-assembly code. Without a self-assembly code, there were no vertically inherited traits; all change was horizontally transmitted (i.e., acquired). Therefore, the evolution of early life was not due to differential replication of heritable variation in response to selection; it was non-Darwinian [23, 97, 104].
The situation is analogous for culture. Human culture does not (currently) involve a selfassembly code that gets used in these two distinct ways, and as such, it does not exhibit this signature trait of a selectionist process. No traits are inherited by way of a self-assembly code; all change is acquired, and acquired change is horizontally transmitted.8 Although some elements of culture, such as kinship terminologies, have a generative logic, they do not use a self-assembly code: a set of coded instructions that is (i) actively interpreted through development to generate a soma, and (ii) passively copied without interpretation through reproduction to generate self-copies, i.e., new sets of self-assembly instructions that are in turn used in these two distinct ways [99].
One might also be tempted to suggest that natural language is a cultural self-assembly code. However, (1) although natural language involves encoding, it is not a set of encoded instructions for the self-replication of natural languages, and (2) language evolution does not exhibit the signature characteristic of evolution by way of a self-assembly code: lack of transmission of acquired traits. Though some have argued that humans are biologically predisposed to achieve language, language itself is characterised by horizontal—not vertical—transmission. Therefore, language evolution is not due to the mechanism Darwin proposed: differential replication of heritable variation in response to selection [21, 26]. Results from computational modelling suggest that to cross the ‘Darwinian threshold’ from non-selectionist to selectionist evolution requires the emergence of a self-assembly code [97]. There is no evidence that language or any other component of culture has crossed this threshold, though it is not impossible that culture is moving toward a ‘cultural Darwinian threshold’; in other words, it may exist in the state biological life was in before the last universal common ancestor [103].
We posit that both early life and culture evolve not through a Darwinian process but through SOR. A Darwinian explanation works in biology to the extent that retention of acquired change is negligible compared with retention of selected change; otherwise, the first (which can operate instantaneously) can overwhelm the second (which takes generations). Moreover, the period we associate with biological generations, from birth through to reproductive maturity and parenthood, is in general significantly longer than the stretch of time between when an individual acquires a cultural trait (e.g., an idea) and then expresses (their own version of, or their own take on) that cultural trait. That is, although enculturation is a lifelong process, a particular item of cultural information can be transmitted almost instantaneously, and hundreds may be transmitted back and forth during a conversation. This can make the relative speed of Darwinian evolution slower still.
4.3. SOR versus Lamarckian Evolution
As mentioned above, the term ‘Lamarckian’ is widely understood to refer to the transmission of characteristics to offspring that were acquired through use or disuse during the entity’s lifetime. According to interpretations of Lamarckism that specify that Lamarckian evolution requires genetic transmission to biological offspring (e.g., [55]), SOR would not be classified as a form of Lamarckian evolution, as it is not restricted to biologically evolving entities. According to less restrictive interpretations of Lamarckism that encompass cultural as well as biological change (e.g., [72]), SOR could be considered a form of Lamarckian evolution. However, unlike Lamarckism, SOR is precise about the kind of structure required (specifically, SOR requires RAF networks), and why this mechanism makes cumulative, adaptive change possible (as described above).
4.4. SOR versus the neutral theory of molecular evolution
The neutral theory of molecular evolution posits that much genetic variation in populations is the result of mutation and genetic drift, not selection [60, 61]. Random genetic drift in geographically isolated populations can lead to new forms appearing and others disappearing, without selection. Neutral change and drift are also observed in cultural evolution [41, 82]. However, drift is as likely to be disadvantageous as it is to be advantageous—it is (statistically) neutral. In contrast, SOR is not (statistically) neutral. Like natural selection, SOR is a mechanism of cumulative, adaptive change.
5. Implications
The feasibility of evolution in the absence of variation and selection —i.e., evolution by SOR— and the claim that early life and cultural evolution are both promising candidates for this second form of evolution, imply that we must be cautious about applying concepts and methodologies developed in a Darwinian evolutionary context in these two domains. Since biological acquired traits are usually (though not always) discarded, and since a self-assembly code must stay intact to preserve its self-replication capacity, the joining of bifurcations in biological lineages is infrequent; thus, a phylogenetic tree correctly captures its branching structure. Speciation makes inter-lineage transfer of information relatively rare in biological organisms. By comparison, since cultural acquired traits are not discarded, and there is no cultural self-assembly code, the joining of bifurcations in cultural ‘lineages’ is commonplace, and thus cultural ‘lineages’ tend to be network-like rather than tree-like [22, 69, 93]. These distinctions become clear in phylogenetic studies. For multicellular species, phylogenetic networks calculated for different protein or RNA sequences tend to be tree-like. However, phylogenetic trees constructed for different protein or RNA sequences across protists, bacteria or viruses reveal much more blending and greater network-like organisation [86]. Gene expression profiles exhibit cross-similarity and greater convergent evolution as well. Since cultural relatedness frequently arises through horizontal (inter-lineage) transmission, there is extensive blending of knowledge from different sources.
Another (related) problem that arises when methods developed for selectionist evolutionary processes are applied to culture is due to convergent evolution, in which similar forms arise independently because they are alternative solutions within similar design constraints. Examples include (i) the body shape and structure similarity between the Tasmanian tiger and the fox, (ii) wasp-imitating hover flies and (iii), the origin of basic and more complex brain structure across the tree of life [73, 106]. Because biological organisms must solve many problems (reproduction, locomotion, digestion, etc.), the probability that a species will be mis-categorised because of convergent evolution (i.e., on the basis of how it solves any one problem) is low. Cultural artifacts, on the other hand, are generally constructed with a single use in mind (though artifacts developed for use in one context may be used to solve other problems; for example, a screwdriver may be used to open a can of paint.) Thus, for cultural outputs, the probability of mis-categorisation arising through the assumption that similarity reflects homology is significantly higher. Therefore, the cost of proceeding as if a Darwinian framework were applicable to culture when it is not is high. Some have claimed that in practice this does not invalidate a phylogenetic approach to culture [39]. However, such conclusions come from analyses of datasets that involve little horizontal transmission (indeed, the creative blends that are the source of cultural novelty are often treated as ‘outliers’ and are intentionally discarded from analysis). This problem does not arise with SOR because it does not assume that superficially similar artifacts are homologous. Indeed, viewing culture from the perspective of SOR suggests that what is evolving is the structures of the conceptual networks that generate artifacts, not the artifacts themselves [66].
Such considerations have led some to develop network-based models of cultural evolution [5, 12, 17, 31, 30, 62, 69, 96]. This body of research suggests that horizontal transmission can significantly alter the pattern of relatedness. For example, a network-based analysis of Baltic psaltery data that incorporated not just superficial physical attributes but also abstract conceptual attributes (such as markings indicative of sacred symbolic imagery), it was possible to resolve ambiguities arising from a phylogenetic analysis and generate a lineage more consistent with other historical data [96]. Horizontal cultural transmission may involve change in superficial features despite the preservation of deep structure, as occurs in metaphor [64], analogy [36, 47], and cross-domain transfer, in which a source from one domain (e.g., music) inspires or influences a creative work in another (e.g., painting) [78, 83]. This kind of complexity and hierarchical structure cannot be captured without taking a network approach to cultural evolution, which provides further support for the theory that culture evolves through SOR.
Interestingly, similar issues arise with the simplest life forms. Because of phenomena such as mutualism, lineage reticulation (due to horizontal gene transfer and allopolyploidy—the combining the genomes of different parental species), certain traits evolve with astonishing speed, thereby diminishing the continuity and distinctiveness of species [77, 104]. Indeed, the stability of genetic information is so compromised that sequencing three Escherichia coli genomes revealed that fewer than 40% of the genes were common to all three [100]. As a result, the boundaries between many prokaryote species are fuzzy, and exhibit reticulate patterns of evolution, thus calling into question the appropriateness of the notion of the “tree of life”[45, 54, 71]. The limitations of Darwinism as an explanation of the forms and dynamics of living things is increasingly recognised, and the role of epigenetic processes has become increasingly appreciated. Nevertheless, because reticulate (horizontal) proceses are much less present in biological evolution than cultural evolution, natural selection provides a reasonable approximation.
6. Concluding comments
Although natural selection is a theory of evolution (a spectacularly successful one in biology), the term ‘evolution’ is sometimes assumed to be synonymous with variation and selection. Even in research on cultural evolution—which is sometimes (though not universally) regarded as Lamarckian—it is generally assumed that variation and selection are essential for culture to evolve. Using RAF networks, this paper demonstrates by way of example, or ‘existence proof’ that evolution is possible in the absence of variation, selection, and competition. We refer to this kind of evolution as Self-Other Reorganisation (SOR) because it occurs not through competitive exclusion such that only the fittest reproduce, but through the assimilation, restructuring, and exchange of components. SOR is a primitive form of evolution that can generate change quickly since it does not require the discarding of acquired traits, and it can take place in the absence of birth or death. Because it does not use coded self-assembly instructions, it is more haphazard than natural or artificial selection, but sufficient for cumulative, adaptive change.
RAFs have proven useful in two areas of research that might appear to be quite distinct, but that both involve networks and evolutionary processes. In the OOL, the underlying reaction system is a biochemical reaction network, and in the OOC it is a conceptual network. Since cultural evolution lacks a self-assembly code, and lacks the signature characteristic of Darwinian evolution—discarding of acquired traits at the end of a generation—it seems reasonable to investigate the extent to which the OOL and the OOC share a common theoretical framework. This is consistent with the proposal that the entities that evolve through culture are not discrete artifacts or units of behaviour such as songs, jokes or memes, but the minds that generate them [21], which have a structure that is self-organising and self-maintaining [40, 70], and therefore lends itself to the application of RAF formalisms [32, 34, 33]. We suggest that SOR may be operative during the early stage of any evolutionary process, that it was responsible for both the origin and early evolution of both organic life and human culture, and that RAFs provide a means of modeling evolution through SOR. The approach offers an established formal framework for integrating biological, cultural, and cognitive proceses, and embedding this synthesis in the study of self-organizing structures and their role in evolutionary processes. The framework suggests avenues for future research, such as elaboration of the RAF model to include how new elements might be synthesized from the foodset elements and previous foodset-generated elements, and to describe how new elements might be secreted from one RAF and incorporated into another.
SOR is a broader concept than learning, as it can produce adaptive change in groups of entities, and it can span biological generations. Competition may occur in SOR, but unlike selectionist evolution it is not essential. Because SOR does not require competition or variation, it may increase homogeneity among members of a culturally evolving group of individuals by increasing the amount of shared knowledge and experience among group members. It may thereby play an important role in fostering group identity, cohesion, and cooperation [99] (see also [2]).
7. Acknowledgements
This research was funded by Grant 62R06523 from the Natural Sciences and Engineering Research Council of Canada. We thank the editor and reviewers for helpful comments and suggestions on the manuscript.
Footnotes
Minor typos corrected and one reference corrected.
↵1 The term ‘selection’ is used in an informal sense in a way that is synonymous with ‘choosing’ or ‘picking out’. One could say that selection—-in the everyday sense of the term-—occurs in a competitive marketplace through the winnowing out of inferior products. However, the term ‘selection’ is used here in its scientific sense.
↵2 Although we use the term ‘mental representation,’ our model is consistent with the view (common amongst ecological psychologists and in cognition and quantum cognition communities) that what we call mental representations do not ‘represent’ but instead act as contextually elicited bridges between mind and world.
↵3 It is a matter of time before one of the entities encountered s; the more abundant s is, the sooner this chance encounter. The encounter with s may result from movement of s into the environment, or discovery of a previously uncharted part of the environment rich in s.
↵4 The word ‘percolates’ is used here to refer to the ‘communal exchange’ (between-entities) component of the process, as opposed to the ‘self-organization’ (within-entities) component of the process. If one entity encounters p before another, that first entity may temporarily function more efficiently. We note also that the scenario could unfold such that each entity is affected solely by its own products, and thus, after the impact of the initial generation of p by i, there could be little interaction, direct or indirect, amongst them. In other words, it is possible that j is affected by the p′ it generated itself (as when someone is affected by looking at their own art). However, it is also possible that j is affected by the p′ generated by another entity in
(as in when someone is affected by looking at art made by someone else). Finally, we note that possession of a reactant (which in the cognitive scenario is a MR of a particular product) does not guarantee catalysis of that reactant. For example, someone may have a memory of an artwork, but while the initial experience of it yielded little cognitive restructuring, the second experience of it triggers a productive stream of thought.
↵5 Although SOR is an adaptive evolutionary process, communal exchange is not necessarily beneficial to the recipient. Transmission of useful plasmids through horizontal exchange among bacteria or protists may be beneficial, but transmission of viruses may be damaging. Similarly, transmission of useful technologies may be beneficial to the recipient, but transmission of misinformation may be harmful.
↵6 It has been suggested that creative cognition occurs through a Darwinian process [7], but this theory has been sharply criticized [11, 24, 27, 92], and its current primary proponent now advocates a non-Darwinian version of the theory [87].
7 In SOR there is no replication by way of a self-assembly code. However, there is replication of a more piecemeal, imprecise kind that occurs through the transmission/duplication of components. The replication of conceptual RAF structure (e.g., when a parent teaches a child how to cook) may entail the generation of novelty (e.g., the child may intentionally double the quantity of chocolate chips), but replication has still taken place. Indeed, novelty gets introduced each generation even when organisms replicate; offspring are not identical to parents. We do not use the term ‘replicator’ here since it is often assumed that replicator evolution necessarily involves inheritance of germ-line material.
↵8 Note that vertical and horizontal transmission must be defined with respect to the relevant evolutionary process. Transmission of cultural information from parent to offspring is sometimes erroneously referred to as vertical transmission (e.g., [8]). Although the individuals in question are in a parent–child relationship with respect to their status as biologically evolving organisms, this may not be the case with respect to their status as participants in cultural evolution. Indeed, although childbirth entails one mother and one father, there is no limit to the number of ‘parental influences’ on the ‘birth’ of an idea. A related error is to say that in cultural evolution, there is a third form of transmission–oblique transmission–in which traits are transmitted from non-kin members of the parental generation (e.g., [8]). As far as cultural evolution is concerned, it is not strictly relevant whether the information comes from biological kin or non-kin. In a similar vein, although dual inheritance theorists speak of culture as a second form of inheritance [44, 81, 101, 75], the distinguishing feature of an inherited trait is that it is transmitted vertically (e.g., from parent to offspring) by way of a self-assembly code (e.g., DNA), and therefore not obliterated at the end of a generation. This is not the case with respect to cultural traits [25] (nor is it even the case for all biological traits).
References
- [1].↵
- [2].↵
- [3].↵
- [4].↵
- [5].↵
- [6].↵
- [7].↵
- [8].↵
- [9].↵
- [10].↵
- [11].↵
- [12].↵
- [13].↵
- [14].↵
- [15].↵
- [16].↵
- [17].↵
- [18].↵
- [19].↵
- [20].↵
- [21].↵
- [22].↵
- [23].↵
- [24].↵
- [25].↵
- [26].↵
- [27].↵
- [28].↵
- [29].↵
- [30].↵
- [31].↵
- [32].↵
- [33].↵
- [34].↵
- [35].↵
- [36].↵
- [37].↵
- [38].↵
- [39].↵
- [40].↵
- [41].↵
- [42].↵
- [43].↵
- [44].↵
- [45].↵
- [46].↵
- [47].↵
- [48].↵
- [49].↵
- [50].↵
- [51].↵
- [52].↵
- [53].↵
- [54].↵
- [55].↵
- [56].↵
- [57].↵
- [58].↵
- [59].↵
- [60].↵
- [61].↵
- [62].↵
- [63].↵
- [64].↵
- [65].↵
- [66].↵
- [67].↵
- [68].↵
- [69].↵
- [70].↵
- [71].↵
- [72].↵
- [73].↵
- [74].↵
- [75].↵
- [76].↵
- [77].↵
- [78].↵
- [79].↵
- [80].↵
- [81].↵
- [82].↵
- [83].↵
- [84].↵
- [85].↵
- [86].↵
- [87].↵
- [88].↵
- [89].↵
- [90].↵
- [91].↵
- [92].↵
- [93].↵
- [94].↵
- [95].↵
- [96].↵
- [97].↵
- [98].↵
- [99].↵
- [100].↵
- [101].↵
- [102].↵
- [103].↵
- [104].↵
- [105].↵
- [106].↵