Network Memetics: Formalizing the Theory through Network Science
Introduction
This document formalizes the concepts of Extended Meme Theory through the language of network science. The goal is to transition from a descriptive model to a predictive one, using the ready-made mathematical apparatus of graph theory.
Conceptual foundation: For the underlying ideas, examples, and case studies, see Extended Meme Theory – the main document of the project (30 Parts).
Table of Contents
- Part I. Why Network Science
- Part II. Basic Concepts of Network Science
- Part III. Heterogeneous Networks with Hubs and Zipf’s Law
- Part IV. Small-World Networks
- Part V. The Meme as a Node, the Association as an Edge
- Part VI. Meme “Strength” Is Centrality
- Part VII. Memeplex as a Cluster (Community)
- Part VIII. Dynamics: How Memes Activate and Compete
- Part IX. The Immune System – a Network View
- Part X. Inter-Level Transitions: Multilayer Networks
- Part XI. Communicative Asymmetry of the Graph
- Part XII. What Can Now Be Predicted
- Part XIII. Consciousness Level – a Metric of the Level of Consciousness
- Part XIV. Triple Binding – the Unity of Conscious Experience
- Part XV. The Diffusion Engine and Embedding Space
- Part XVI. Inter-Agent Exchange, Scarcity, and Critical Periods
- Part XVII. Subsumption of Competing Theories of Consciousness
- Part XVIII. Retrodiction: COGITATE (2025)
- Appendix: Key Formulas
- Key Works
Part I. Why Network Science
The problem: the theory is descriptive
The current theory of memes explains phenomena post-hoc:
- “The memeplex defends itself” – but how do we measure defense?
- “One meme is stronger than another” – in what units?
- “Clusters of memes” – where are the boundaries?
We need a transition from metaphors to measurable quantities.
The solution: network science
Network science is an interdisciplinary field that studies complex systems as networks of nodes and links. It has already been successfully applied to:
| Domain | Nodes | Links |
|---|---|---|
| Social networks | People | Acquaintances |
| Neuroscience | Neurons | Synapses |
| Epidemiology | People | Contacts |
| Semantic networks | Concepts | Associations |
The key idea
The memeplex is literally a network, not a metaphor.
- Memes – nodes (fractal: each meme consists of sub-memes)
- Associations – edges (connections between memes, $w \in [-1, +1]$)
- Strength of association – edge weight
- “Strength” of a meme – node centrality
- Clusters – communities (memeplexes)
- Hub – role: an element with anomalously high centrality
This gives us:
- Rigorous definitions
- Measurable metrics
- Testable predictions
- Ready-made algorithms
In particular, the formalism allows the construction of a quantitative Consciousness Level metric CL (Part XIII), based on $\sigma_{SW}$, the activity of the Self-Model Cluster (SMC), and the balance of the memeplex, as well as the formalization of triple binding (Part XIV) – a mechanism explaining the unity of conscious experience through structural, temporal, and competitive binding.
Part II. Basic Concepts of Network Science
Graph: nodes and edges
A graph $G = (V, E)$ consists of:
- $V$ – a set of nodes (vertices)
- $E$ – a set of edges (links between nodes)
In the context of memetics:
- Node = meme
- Edge = association between memes
- Edge weight = strength of association
Key metrics
| Metric | What it measures | Formula |
|---|---|---|
| Degree | Number of connections | $k_i = \sum_j A_{ij}$ |
| Betweenness centrality | Bridge role | $C_B(v) = \sum_{s \neq v \neq t} \frac{\sigma_{st}(v)}{\sigma_{st}}$ |
| Eigenvector centrality | Connection to important nodes | $x_i = \frac{1}{\lambda}\sum_j A_{ij}x_j$ |
| Clustering | Neighbor connectivity | $C_i = \frac{2e_i}{k_i(k_i-1)}$ |
| Modularity | Partitioning into groups | $Q = \frac{1}{2m}\sum_{ij}\left[A_{ij} - \frac{k_ik_j}{2m}\right]\delta(c_i, c_j)$ |
Numerical example: a network of 5 memes
Consider a simple network:
A --- B
| |
C --- D --- E
Adjacency matrix:
| A | B | C | D | E | |
|---|---|---|---|---|---|
| A | 0 | 1 | 1 | 0 | 0 |
| B | 1 | 0 | 0 | 1 | 0 |
| C | 1 | 0 | 0 | 1 | 0 |
| D | 0 | 1 | 1 | 0 | 1 |
| E | 0 | 0 | 0 | 1 | 0 |
Computing degrees:
| Node | Degree $k$ | Interpretation |
|---|---|---|
| A | 2 | Connected to B and C |
| B | 2 | Connected to A and D |
| C | 2 | Connected to A and D |
| D | 3 | Connected to B, C, and E – hub |
| E | 1 | Connected only to D – periphery |
Node D is central: it connects parts of the network.
Part III. Heterogeneous Networks with Hubs and Zipf’s Law
Why some memes become hubs
Many real-world networks – from the internet to social connections – exhibit heavy-tailed degree distributions:
Important clarification (Broido & Clauset, 2019): A strict power law is found in only ~4% of studied networks. Alternative distributions include log-normal, power law with exponential cutoff, and Zipf-polylog (Lee et al., 2024). However, what matters for the theory of memeplexes is not the mathematical strictness of the power law, but the presence of hubs – nodes with a disproportionately large number of connections. Hubs are empirically observed in all networks with heavy tails.
As a first approximation, such networks obey a power-law distribution:
$$P(k) \sim k^{-\gamma}$$where $k$ is the node degree (number of connections), and $\gamma$ is the exponent (typically $2 < \gamma < 3$).
What this means: Most nodes have few connections, but a small number of nodes (hubs) have orders of magnitude more connections. This is not coincidence – it is a mathematical inevitability.
Mechanism: preferential attachment
Why do hubs arise? Because of the preferential attachment mechanism:
$$\Pi(k_i) = \frac{k_i}{\sum_j k_j}$$The formula means: The probability of acquiring a new connection is proportional to the number of existing connections. “The rich get richer.”
C connected to B (randomly)"] T1 --> T2["t=2: A—B—C, B—D
D chose B (B has more connections)"] T2 --> T3["t=3: A—B—C, B—D, B—E
E chose B (B has even more connections)"] T3 --> T4["t=4: B becomes a hub"] end style T4 fill:#e74c3c
Zipf’s Law: rank vs. frequency
Zipf’s law is a special case of the power-law distribution:
$$f(r) \sim \frac{1}{r}$$where $r$ is the rank of the element. The most popular element occurs twice as frequently as the second, three times as frequently as the third, and so on.
In a memeplex, this means:
| Meme rank | Relative strength | Examples in a personal memeplex |
|---|---|---|
| 1 | 1.00 | “Self” – the center of identity |
| 2 | 0.50 | Key value (family / career / faith) |
| 3 | 0.33 | Second key value |
| 10 | 0.10 | Important but non-central belief |
| 100 | 0.01 | Peripheral opinion |
How to verify that a network is scale-free
It is not sufficient to plot a log-log graph and see a “straight line.”
Why a log-log plot is needed: On a linear scale, the “tail” of the distribution (rare hubs with high degree) is practically invisible. Only a log-log scale reveals that the data follow a power law.
Correct procedure (Clauset, Shalizi & Newman, 2009):
- Estimate the parameter $\gamma$ using MLE (maximum likelihood estimation), not OLS regression
- Determine $x_{min}$ – the lower bound of the power-law regime (where the “tail” begins)
- Goodness-of-fit test (Kolmogorov-Smirnov): compare empirical and theoretical distributions
- Compare with alternatives: log-normal, exponential, stretched exponential
- Likelihood ratio test between competing models
| Test result | Interpretation |
|---|---|
| $p > 0.1$, LR favors power law | Network is likely scale-free |
| $p > 0.1$, LR favors log-normal | Heavy tails without strict power law |
| $p < 0.1$ | Power law is rejected |
Methodological position of BMC (heavy-tailed, not strict scale-free). The debate about “scale-free networks” continues: Broido & Clauset (2019) showed that only ~4% of networks pass strict tests for pure power-law; Voitalov et al. (2019) using a less strict (regularly varying) definition obtained ~65%; Serafino et al. (2021) via finite-size scaling – ~50%.
BMC does not require a strict power law. A heavy-tailed distribution is sufficient – of any form (power-law, log-normal, stretched exponential), as long as: (1) hubs exist ($k_i \gg \langle k \rangle$), (2) preferential attachment operates, (3) hub displacement is possible. The specific form of the tail is in the red zone of the Structural Equivalence Test (SET) (does not transfer between levels); the existence of hubs and their dynamics is in the green zone. A useful distinction is heavy-tailed vs. thin-tailed, not power-law vs. log-normal (Holme, 2019).
For semantic (cognitive) networks – the most relevant for BMC L1 – the power law has been confirmed more rigorously: Steyvers & Tenenbaum (2005) showed $\gamma \approx 3.0$–$3.2$. Formalization of this position: SM, Part I.
Graph interpretation:
- Steps 1–2: Find $x_{min}$ – the point where the power-law regime begins. Data to the left of it (gray zone) do not follow the power law
- Step 3: The KS test compares the empirical and theoretical distributions. A small KS distance and $p > 0.1$ means we do not reject the power law
- Steps 4–5: Compare with alternatives. A positive LLR means the power law describes the data better
Practical tool: The Python package
powerlaw(Alstott et al., 2014) implements the entire procedure.
Comparison of network types
| Characteristic | Random network | Heterogeneous network with hubs | Memeplex |
|---|---|---|---|
| Degree distribution | Poisson (bell curve) | Heavy-tailed (power law, log-normal, etc.) | Heavy-tailed |
| Typical node | Close to average | Nearly unconnected | Peripheral meme |
| Hubs | Absent | Inevitable | Meme hubs |
| Robustness to random failures | Medium | High | High |
| Vulnerability to hub attack | Low | Critical | Critical |
| Epidemic threshold | Exists ($\lambda_c > 0$) | Low (depends on distribution form) | Very low |
Numerical example: network evolution
Consider network growth with preferential attachment:
Start (t=0): 2 nodes, 1 connection
A — B
Degrees: A=1, B=1
After adding 8 nodes (t=8):
| Node | Time of appearance | Degree $k$ | Share of connections |
|---|---|---|---|
| A | 0 | 4 | 20% |
| B | 0 | 7 | 35% – hub |
| C | 1 | 3 | 15% |
| D | 2 | 2 | 10% |
| E | 3 | 2 | 10% |
| F–J | 4–8 | 1 each | 2% each |
Conclusion: Node B, which appeared earlier and gained a small initial advantage, became a hub. This is not merit – it is mathematics.
Implications for memetics
| Observation | Explanation through network heterogeneity |
|---|---|
| Meme hubs exist | Hubs are an inevitable result of preferential attachment |
| Changing a central meme is difficult | A hub has orders of magnitude more connections than the periphery |
| A new meme can become a hub | Only if the old hub has “fallen” (crisis, trauma) |
| A memeplex is resilient to random attacks | Peripheral memes can be removed without consequences |
| Targeted attack on a hub is devastating | Removing a hub fragments the network |
See also: Conceptual explanation of hubs in the memeplex – EMT, Part XI.
Percolation threshold: resilience to attacks
Percolation is the transition of a network from a connected state to a fragmented one upon the removal of nodes or edges. The key parameter is the percolation threshold $f_c$: the fraction of nodes that must be removed to destroy the giant component.
For targeted attack (removing nodes in order of decreasing degree):
$$f_c = 1 - \frac{1}{\kappa - 1}$$where $\kappa = \frac{\langle k^2 \rangle}{\langle k \rangle}$ is the network heterogeneity coefficient.
For networks with heavy tails: the more heterogeneous the network (larger $\kappa$), the higher the resilience to random failures and the lower the resilience to targeted attacks.
Note: The exact values of $f_c$ depend on the distribution form. For strict power law with $\gamma < 3$: $\kappa \to \infty$, $f_c \to 1$. For log-normal and other heavy tails, the effect is qualitatively the same but quantitatively weaker.
This means: to destroy a heterogeneous network with hubs, one must remove nearly all hubs.
| Attack type | Random network | Network with hubs |
|---|---|---|
| Random (random failure) | $f_c \approx 0.5$ | $f_c$ is high (resilient) |
| Targeted (targeted attack) | $f_c \approx 0.5$ | $f_c$ is low (vulnerable) |
Practical implication for memeplexes: Destabilizing a memeplex requires a precise attack on hubs. Random criticism of peripheral beliefs will not destroy a memeplex – but an attack on central identity memes can trigger a cascading fragmentation.
Percolation in signed networks. Standard percolation assumes that removing a hub always reduces connectivity. In signed networks ($w \in [-1, +1]$), this is not necessarily the case: removing a hub with a large number of negative edges can increase effective connectivity, since inhibition of neighboring nodes is lifted. Thus, a targeted attack on an “antibody” hub (one actively suppressing a group of memes) can produce a paradoxical result – instead of fragmentation, previously suppressed nodes integrate into the main cluster. This means that for signed memeplexes, $f_c$ depends not only on topology but also on the sign distribution of edges: a memeplex with a high proportion of negative connections (polarized, “paranoid”) can be simultaneously topologically connected and functionally fragmented.
Source: Nature Reviews Physics (2024) – review of percolation theory in complex networks.
Part IV. Small-World Networks
The “small world” phenomenon
Small-world – a class of networks combining two properties:
- High clustering – neighbors of a node are often connected to each other
- Short average paths – any two nodes are connected through a small number of steps
This is the formalization of the “six degrees of separation” phenomenon (Milgram, 1967).
The Watts-Strogatz model (1998)
Starting from a regular lattice, each edge is randomly rewired with probability $p$:
At small $p$ ($\approx 0.01$–$0.1$), the network remains highly clustered, but the average path length drops sharply.
The small-worldness metric
Small-worldness coefficient (Humphries & Gurney, 2008):
$$\sigma = \frac{C / C_{random}}{L / L_{random}}$$where:
- $C$ – the clustering coefficient of the network
- $L$ – the average path length
- $C_{random}$, $L_{random}$ – values for a random network of the same size
| $\sigma$ | Interpretation |
|---|---|
| $\sigma > 1$ | Small-world network |
| $\sigma \gg 1$ | Pronounced small-world effect |
| $\sigma \approx 1$ | Random network |
Numerical example
A network of 100 nodes with average degree $k=6$:
| Parameter | Regular ($p=0$) | Small-world ($p=0.1$) | Random ($p=1$) |
|---|---|---|---|
| Clustering $C$ | 0.50 | 0.35 | 0.06 |
| Average path $L$ | 8.3 | 3.2 | 2.9 |
| $\sigma$ | – | 3.4 | 1.0 |
Small-world in memeplexes
Hypothesis: Personal memeplexes are small-world networks.
| Small-world property | Manifestation in a memeplex |
|---|---|
| High clustering | Subpersonalities (profession, family, hobbies) – dense clusters |
| Short paths | Any thought can “lead” to any other in 3–4 associations |
| “Weak ties” between clusters | Bridges between contexts (work <-> religion) |
Why this matters:
- Rapid spreading activation (information does not get stuck in a cluster)
- Modularity is preserved (clusters do not merge)
- Explains the simultaneous integration and differentiation of consciousness
- The coefficient $\sigma_{SW}$ enters as a multiplier in the Consciousness Level (CL) metric: $CL = \sigma_{SW} \cdot A_{SMC} \cdot f(Balance)$ (see Part XIII)
Source: Han et al. (2025), Brain-X – a review of small-world organization of the brain and its connection to cognitive functions.
Neuroanatomical confirmation at all scales. Small-world + hubs is not a metaphor but a measured property of the brain from the cellular to the connectome level (Bassett & Bullmore, 2006, The Neuroscientist). The Watts-Strogatz model shows: rewiring just ~1–5% of connections in a lattice radically shortens the average path while preserving clustering – explaining how a memeplex combines modularity (subpersonalities = dense clusters) with rapid global integration. Hubs (nodes with heavy-tailed $P(k) \sim k^{-\gamma}$) serve as bridges between modules; their damage is catastrophic ($f_c \approx 0.05$–$0.18$), while random losses of ~50% of nodes are survived without loss of connectivity – directly matching our predictions of hub attack vs. random failure. The universality of this pattern (gene networks, social networks, the internet) confirms cross-level isomorphism: the same topology is reproduced at all levels of BMC.
Part V. The Meme as a Node, the Association as an Edge
Concept mapping
| Meme theory | Network science | Neural analog |
|---|---|---|
| Meme (fractal: consists of sub-memes) | Node (vertex) | Cell assembly / engram ($~10^{3}$–$10^{5}$ neurons) |
| Association ($w \in [-1, +1]$) | Edge | Ensemble overlap (shared neurons) |
| Strength of association | Edge weight | Number of shared neurons x Hebbian strengthening |
| Memeplex (cluster of memes) | Subgraph / community | Large-scale functional network |
| Hub (role: high centrality) | Hub node | Meme with anomalously high degree / eigenvector centrality |
| BMC (G + M + I + S) | Complete graph + metadata | Brain + body as a whole |
Meme activation: a continuous variable with functional thresholds
Meme activation is a continuous quantity:
$$a_i(t) \in [0, 1]$$where $a_i(t)$ is the activation level of node $i$ at time $t$. Discrete “states” of a meme do not exist – a meme’s behavior is determined by the continuous $a_i$ and the orthogonal property SIT.
Important note: Node activation is always non-negative ($a_i \geq 0$), but edge weights can be negative: $w_{ij} \in [-1, +1]$. A negative weight means an inhibitory (suppressive) connection: activation of node $j$ reduces activation of node $i$.
Two functional thresholds define qualitatively distinguishable behavior:
| Threshold | Value | Function |
|---|---|---|
| $\theta_{act}$ | ~0.5 | Behavioral influence: a meme with $a_i > \theta_{act}$ participates in decision-making, spreading activation, and competition for WM |
| $\theta_{low}$ | ~0.1 | Effective zero: a meme with $a_i < \theta_{low}$ does not affect spreading activation and awaits an external trigger for reactivation |
Between the thresholds ($\theta_{low} < a_i < \theta_{act}$), a meme does not drive behavior but can perform a narrative function – creating a sense of personal continuity, serving as material for nostalgia (for details, see EMT, Part VIII).
These thresholds are not boundaries of discrete states but rather points at which the character of a meme’s influence on behavior changes.
Open meme (SIT > 0) – an orthogonal property independent of activation level. A meme-question (structural gap) has $SIT_i > 0$: SIT periodically raises $a_i$ above $\theta_{act}$ (the thought “surfaces”), then WM switches away and $a_i$ drops, but SIT prevents edges from decaying ($\lambda_{open} = 0.5\lambda$). A meme can be open at any current $a_i$ – SIT and activation are independent. Formally: edge decay is suppressed as long as $SIT_i > 0$ (see Part VIII).
Neural substrate of nodes and edges
The formalization of a meme as a graph node is not merely an abstraction. Behind each element of the network model lies a concrete neurobiological object.
Node = cell assembly (Hebb, 1949): an ensemble of $~10^{3}$–$10^{5}$ neurons that activate together. The ranges of meme activation correspond to physical regimes of the ensemble:
| Activation range | Ensemble regime | Neural substrate |
|---|---|---|
| $a_i > \theta_{act}$ | Cell assembly above excitation threshold, active synaptic transmission | Sustained firing, AMPA + NMDA receptors engaged |
| $a_i < \theta_{low}$ | Silent synapses (Isaac et al., 1995): structure preserved, function absent | NMDA receptors only, spines persist (Yang et al., 2009) |
| $\theta_{low} < a_i < \theta_{act}$ | Subthreshold activation, sparse population coding | Stochastic activation of individual neurons in the ensemble |
| SIT > 0 (any $a_i$) | SIT-driven reactivation: DMN periodically raises the ensemble above threshold | Hippocampus + mPFC generate sharp-wave ripples (SWR), reactivating the gap-ensemble |
Edge = ensemble overlap: $w_{ij} \propto |ensemble_i \cap ensemble_j| \cdot f(co\text{-}activation)$ (Cai et al., 2016, Nature). Positive weights = excitatory glutamatergic connections. Negative weights ($w < 0$) = GABAergic inhibition via inhibitory interneurons.
Winner-takes-all: lateral inhibition in the competition formula (see below, Part VIII) is realized through inhibitory circuits – basket cells and chandelier cells provide rapid suppression of competing ensembles.
For more detail: Full description of cell assemblies, neural reuse, overlapping engrams – see BM, Part III.
Sigmoid justification from the Hodgkin-Huxley model: The sigmoid $\sigma(x)$ in the spreading activation formula is not an arbitrary choice but an approximation of the nonlinear I-O curve of a real neuron. The HH model (Hodgkin & Huxley, 1952, J. Physiol.) describes the spike through 4 coupled ODEs, where voltage-gated channels create threshold dynamics: below threshold – silence, above – full action potential (all-or-nothing). The coupled feedback (V <-> gates) in HH is isomorphic to the coupling of activation <-> edge weights in the memeplex: meme activity changes weights (Hebb), weights determine activity (spreading). The refractory period after a spike explains the impossibility of continuous meme activation and the inevitable decline of $a_i$ after a peak. Our model operates at the level of cell assemblies (thousands of HH neurons), and HH justifies why the substrate S has precisely those properties (threshold behavior, nonlinearity, discrete states) that we take as given.
Ambivalence is not neutrality
With the introduction of signed weights, an important distinction arises:
- Neutrality: $\bar{w}_m \approx 0$ with weak connections – indifference
- Ambivalence: simultaneously strong + and – connections to different parts of the memeplex
High ambivalence at $\bar{w}_m \approx 0$ is an unstable state, tending toward resolution through structural balance (see Part IX).
Visualization: the memeplex as a network
The following diagram shows a memeplex of 6 memes forming two clusters:
- Cluster 1 (blue): faith, ritual, congregation – religious practice
- Cluster 2 (red): morality, taboo, myth – ethics and narrative
Numbers on edges are connection weights (association strength).
See also: Memeplexes, the life cycle of a meme – EMT, Parts V and VIII.
Part VI. Meme “Strength” Is Centrality
The problem of “strength” in the original theory
We used to say that some memes are “stronger” than others, without defining this rigorously. Now we can:
The “strength” of a meme = its centrality in the memeplex network.
Three types of centrality
1. Degree Centrality: ease of activation
$$C_D(i) = \frac{k_i}{n-1}$$where $k_i$ is the node degree and $n$ is the number of nodes in the network.
Interpretation: A meme with high degree centrality is easily activated – it has many connections, and any of its neighbors can “wake it up.”
Example: The meme “Russia” in the mind of a Russian person – connected to hundreds of other memes (language, history, family, work).
2. Betweenness Centrality: the bridge role
$$C_B(v) = \sum_{s \neq v \neq t} \frac{\sigma_{st}(v)}{\sigma_{st}}$$where $\sigma_{st}$ is the number of shortest paths between $s$ and $t$, and $\sigma_{st}(v)$ is the number of such paths through $v$.
Interpretation: A bridge-meme connects different clusters. If it is removed, parts of the memeplex become isolated.
Example: The meme “family” can be a bridge between the “work” cluster and the “religion” cluster.
3. Eigenvector Centrality: connection to the important
$$x_i = \frac{1}{\lambda}\sum_j A_{ij}x_j$$Interpretation: It is important not merely to have many connections, but to be connected to important memes. A node’s centrality is proportional to the sum of its neighbors’ centralities.
Example: The meme “status” may have few direct connections, but all of them are with key identity memes.
Numerical example: a network of 6 memes
Consider the network:
A — B — C — D — E — F
|
G
Adjacency matrix and computations:
| Meme | Degree | Betweenness | Eigenvector |
|---|---|---|---|
| A | 1 | 0.00 | 0.18 |
| B | 2 | 0.40 | 0.35 |
| C | 3 | 0.60 | 0.49 |
| D | 2 | 0.40 | 0.42 |
| E | 2 | 0.20 | 0.35 |
| F | 1 | 0.00 | 0.18 |
| G | 1 | 0.00 | 0.28 |
Conclusions:
- C is the key meme (high betweenness and eigenvector)
- An attack on C fragments the network
- G is isolated, despite being connected to the central C
Step-by-step betweenness centrality calculation
Let us examine in detail how betweenness is computed for node C in the network above.
Step 1: Find all shortest paths between pairs of nodes
| Pair | Shortest path | Length | Passes through C? |
|---|---|---|---|
| A–B | A–B | 1 | No |
| A–C | A–B–C | 2 | Endpoint |
| A–D | A–B–C–D | 3 | Yes |
| A–E | A–B–C–D–E | 4 | Yes |
| A–F | A–B–C–D–E–F | 5 | Yes |
| A–G | A–B–C–G | 3 | Endpoint |
| B–C | B–C | 1 | Endpoint |
| B–D | B–C–D | 2 | Yes |
| B–E | B–C–D–E | 3 | Yes |
| B–F | B–C–D–E–F | 4 | Yes |
| B–G | B–C–G | 2 | Endpoint |
| C–D | C–D | 1 | Endpoint |
| C–E | C–D–E | 2 | Endpoint |
| C–F | C–D–E–F | 3 | Endpoint |
| C–G | C–G | 1 | Endpoint |
| D–E | D–E | 1 | No |
| D–F | D–E–F | 2 | No |
| D–G | D–C–G | 2 | Yes |
| E–F | E–F | 1 | No |
| E–G | E–D–C–G | 3 | Yes |
| F–G | F–E–D–C–G | 4 | Yes |
Step 2: Count paths passing THROUGH C (not as an endpoint)
Paths through C: A–D, A–E, A–F, B–D, B–E, B–F, D–G, E–G, F–G
Total: 9 paths
Step 3: Compute betweenness
Total number of pairs (excluding C): $\binom{6}{2} = 15$
$$C_B(C) = \frac{9}{15} = 0.60$$Interpretation: 60% of all shortest paths in the network pass through node C. This makes C a critical “bridge” – its removal splits the network into isolated parts.
Correspondence table
| Theory term | Network metric | Formula |
|---|---|---|
| “Strong” meme | High eigenvector centrality | $x_i = \frac{1}{\lambda}\sum_j A_{ij}x_j$ |
| Easily activated meme | High degree centrality | $C_D(i) = k_i/(n-1)$ |
| Bridge meme | High betweenness centrality | $C_B(v) = \sum \sigma_{st}(v)/\sigma_{st}$ |
| Peripheral meme | Low centrality of all types | – |
| Isolated meme | Degree 0 or 1 | $k_i \leq 1$ |
See also: Meme energetics, replication as a selection criterion – EMT, Part VI.
Assortativity: who are hubs connected to?
Assortativity is a measure of the correlation between degrees of neighboring nodes. It answers the question: are hubs preferentially connected to other hubs or to the periphery?
Assortativity coefficient (Newman, 2002):
$$r = \frac{\sum_{jk} jk(e_{jk} - q_j q_k)}{\sigma_q^2}$$where:
- $e_{jk}$ – the fraction of edges connecting nodes of degrees $j$ and $k$
- $q_k$ – the degree distribution at the ends of a randomly chosen edge
- $\sigma_q^2$ – the variance of that distribution
Range: $r \in [-1, 1]$
| $r$ | Pattern | Structure | Example networks |
|---|---|---|---|
| $r > 0$ | Assortative | Hubs connected to hubs | Social networks, co-authorship |
| $r < 0$ | Disassortative | Hubs connected to periphery | Internet, biological networks |
| $r \approx 0$ | Neutral | No correlation | Random networks |
Hypothesis for personal memeplexes: Memeplexes are more likely assortative – the identity core (central memes) is densely interconnected, forming a stable “elite club.”
| Property | Assortative memeplex | Disassortative memeplex |
|---|---|---|
| Identity core | Dense, interconnected | Dispersed, via periphery |
| Vulnerability | Attack on one core threatens all | Each hub is isolated |
| Cognitive style | Integrated thinking | Compartmentalized |
Methodological note: For networks with heavy tails, the standard coefficient $r$ can be biased. Modern approaches use rank-based assortativity (J. Complex Networks, 2025), which is robust to heterogeneity in degree distributions.
Part VII. Memeplex as a Cluster (Community)
Modularity: partitioning into groups
Modularity $Q$ measures how well a network is partitioned into dense clusters with weak connections between them:
$$Q = \frac{1}{2m}\sum_{ij}\left[A_{ij} - \frac{k_ik_j}{2m}\right]\delta(c_i, c_j)$$where:
- $m$ – total number of edges
- $A_{ij}$ – element of the adjacency matrix
- $k_i, k_j$ – node degrees
- $\delta(c_i, c_j) = 1$ if nodes are in the same cluster
Range: $Q \in [-0.5, 1]$. High values ($Q > 0.3$) indicate a clear cluster structure.
Clarification for signed networks: Modularity $Q$ is computed on the subgraph of positive edges ($w > 0$). Negative edges encode rejection, not community membership, and including them distorts the cluster structure. This is consistent with the approach of Traag & Bruggeman (2009) to modularity in signed networks.
Memeplexes within a memeplex
A personal memeplex is not a monolith but a hierarchy of clusters:
Dashed lines represent weak ties between clusters (bridges).
Modularity visualization
Numerical example: computing modularity
A network of 8 nodes, 2 clusters:
Cluster 1: {A, B, C, D} – 5 internal edges Cluster 2: {E, F, G, H} – 5 internal edges Inter-cluster edge: D–E
Computation:
- Total edges: $m = 11$
- Within clusters: 10 edges
- Expected randomly: ~5.5 edges
This is high modularity – a clear separation.
Prediction: modularity and memeplex behavior
| Modularity | Characteristic | Prediction |
|---|---|---|
| High ($Q > 0.4$) | Clear clusters | Resilience, but rigidity |
| Medium ($Q \approx 0.2$–$0.4$) | Balance | Adaptiveness |
| Low ($Q < 0.2$) | Blurred boundaries | Flexibility, but vulnerability |
Hypothesis: Modularity increases with age, explaining age-related rigidity.
Network motifs: recurring connection patterns
Network motifs are small subgraphs that occur in a network significantly more often than in random networks. They represent the “building blocks” of complex networks.
Main motif types:
| Motif | Structure | Function | Hypothesis for memeplexes |
|---|---|---|---|
| Triangle | 3 interconnected nodes | Resilience, redundancy | Belief core: “God – morality – congregation” |
| Feed-forward loop | A -> B -> C, A -> C | Noise filtering, delay | Logical chain: “fact -> interpretation -> conclusion” |
| Bi-fan | 2 sources -> 2 targets | Combinatorial processing | Context integration: “work + family -> priorities” |
| Chain | A -> B -> C -> D | Sequential processing | Narrative, causal chain |
Hypothesis: Specific motifs correspond to “thinking templates”:
- Excess of triangles -> rigid, dogmatic thinking (everything mutually supports everything)
- Excess of feed-forward loops -> analytical, causal thinking
- Excess of bi-fans -> integrative, contextual thinking
Clinical application: Studies show that motif patterns in cognitive networks differ in neurodegenerative diseases (Alzheimer’s) and in healthy individuals (Battiston et al., multilayer motif analysis).
Source: Milo et al. (2002), Science – the first description of network motifs; for modern methods see the literature on multilayer networks.
Modularity dynamics: splitting and merging of memeplexes
Modularity $Q$ is not a static parameter but a dynamic variable describing processes of memeplex splitting and merging (see EMT, Part XIX).
Splitting: $Q$ rises -> bifurcation
When a memeplex spreads into an environment with a different G-layer (different geography, culture, political context), some memes cease to match observed reality -> dissonance grows -> inter-cluster connections weaken -> $Q$ rises.
$$\frac{dQ}{dt} = \alpha \cdot D(P, Reality_{local}) - \beta \cdot \sum_{cross} w_{ij}$$Splitting = the moment when $Q > Q_{crit}$ and two subclusters become independent memeplexes.
Formalization:
SPLITTING(Memplex P, Environment G_new):
for each meme m in P:
D_m = dissonance(m, G_new) // mismatch with local reality
if D_m > theta:
weaken cross-cluster edges of m: w_ij *= (1 - eta * D_m)
Q_new = compute_modularity(P)
if Q_new > Q_crit:
{P1, P2} = partition(P, max_modularity) // optimal partition
// P1, P2 retain a shared core (shared hubs) but have different peripheries
return P1, P2
Merging: $Q$ falls -> boundary dissolves
When two memeplexes $A$ and $B$ encounter a reality that neither covers adequately, but $A \cup B$ does -> cross-connections strengthen -> $Q(A, B)$ falls -> the boundary dissolves.
$$\frac{dQ_{AB}}{dt} = -\gamma \cdot [D(A, Reality) + D(B, Reality) - D(A \cup B, Reality)]$$If $D(A \cup B) \ll D(A) + D(B)$ (together they cover more), merging accelerates.
Formalization:
MERGING(Memplex A, Memplex B, Environment):
for each pair (a, b) where a in A, b in B:
if compatible(a, b): // positive connection possible
w_ab += eta * relevance(a) * relevance(b) // cross-connections
Q_AB = compute_modularity(A union B)
if Q_AB < Q_merge:
C = merge(A, B)
// Hubs of A and B are encapsulated as subclusters of C
return C
Encapsulation: inheritance without loss
During merging, hubs of the original memeplexes are not destroyed but encapsulated – they become subclusters of the new memeplex. This means that the $Q$ of the new memeplex > 0 (it does not become a monolith), but $Q < Q_{crit}$ (subclusters are connected, not isolated).
$$Q_{merged} \in (0, Q_{crit})$$If the dominant memeplex weakens ($Q$ of the new memeplex rises), suppressed hubs reactivate: the encapsulated subcluster becomes an autonomous memeplex again -> “resurrection.”
Q = 0.3"] -->|D with reality| SP2["Q rises
Q = 0.5"] SP2 -->|Q > Q_crit| SP3["P1 + P2
(shared core)"] end subgraph MERGE["Merging: Q falls"] MG1["A (Q_A = 0.3)"] --> MG3["A union B
Q_AB = 0.25"] MG2["B (Q_B = 0.3)"] --> MG3 MG3 -->|Q < Q_merge| MG4["C = merge(A,B)
hubs of A,B encapsulated"] end style SP3 fill:#e74c3c style MG4 fill:#27ae60
Predictions:
| Prediction | Test |
|---|---|
| Before the split of a political party / religious denomination, the $Q$ of the semantic network rises | Longitudinal: semantic networks before/after the split |
| When scientific disciplines merge, the $Q$ of the resulting network is lower than the average $Q$ of the originals | Comparing semantic networks before/after interdisciplinary synthesis |
| “Resurrected” memeplexes have the same cluster structure as before suppression | Comparing semantic networks of revived vs. original cultural traditions |
Counter-intuitive predictions:
1. Splitting increases CL. A rigid memeplex (high $Q$) has poor small-world properties: clusters are isolated, $L$ is large, $\sigma_{SW}$ is low (relationship between modularity and functional integration: Bertolero, Yeo & D’Esposito, 2015, PNAS; Sporns & Betzel, 2016, Annual Review of Psychology). During splitting, each fragment is a dense subgraph with good internal connectivity and short average path -> $\sigma_{SW}$ of each fragment is higher than that of the original rigid memeplex -> CL rises. Prediction: people who have experienced a worldview collapse (loss of religious faith, exit from an ideological organization) report increased subjective clarity of consciousness during the transitional period, not confusion. Testable: clarity-of-thinking scales (Mindful Attention Awareness Scale) + PCI in people undergoing deconversion. Counter-intuitive: fragmentation is associated with confusion, but BMC predicts clarity.
2. Merging temporarily decreases CL. When two memeplexes merge, the combined structure initially has a high average path $L$ (no short routes between former subclusters) and suboptimal clustering -> $\sigma_{SW}$ drops -> CL decreases. Prediction: cultural integration (immigrants assimilating a second culture; scientists mastering an adjacent discipline) is accompanied by a temporary cognitive “fog” – a decrease in cognitive flexibility and subjective clarity, followed by recovery as cross-cluster connections form. Testable with longitudinal cognitive tests (Stroop, task switching) in bicultural individuals during acculturation. Counter-intuitive: broadening one’s horizons is associated with enrichment, not temporary decline.
3. Critical memeplex size $N_{crit}$ for splitting. Splitting requires that two viable sub-communities can form within the memeplex (each with a sufficient number of hubs and connections for autonomous existence). At $N < N_{crit}$, this is impossible: the memeplex is too small to divide into viable fragments. Prediction: simple belief systems (few memes, few clusters) are more resistant to fragmentation than complex ones. A religion with 5 dogmas will split with more difficulty than one with 50. A small sect is more stable than a large denomination. Testable: correlation between $N$ (number of doctrinal elements) and the frequency of historical schisms for religious movements. Counter-intuitive: intuition suggests that complexity = stability (more connections = stronger). BMC predicts the opposite: complexity = more degrees of freedom for $Q > Q_{crit}$.
Part VIII. Dynamics: How Memes Activate and Compete
Spreading Activation: propagation along connections
When an external stimulus (trigger) activates a meme, activation spreads along connections:
$$a_i(t+1) = \sigma\left((1 - \lambda) \cdot a_i(t) + \sum_j w_{ij} \cdot a_j(t)\right)$$where:
- $a_i(t)$ – activation of node $i$ at time $t$
- $w_{ij} \in [-1, +1]$ – the weight of the connection between $i$ and $j$ (can be negative)
- $\lambda$ – decay coefficient (applied to the node’s own state)
- $\sigma$ – sigmoid: $\sigma(x) = \frac{1}{1 + e^{-\beta(x - x_0)}}$, clamping the result to $[0, 1]$
When $w_{ij} < 0$: activation of node $j$ suppresses node $i$. This models not only competition (lateral inhibition) but also active rejection – aversion to an idea, negative priming (Tipper, 1985).
Note: Decay $(1-\lambda)$ is applied to the node’s own state, not to the result of activation. The sigmoid guarantees $a_i \in [0, 1]$ (activation cannot be negative but can be suppressed to zero). This corresponds to the classic spreading activation model (Collins & Loftus, 1975).
Neural substrate of the formula:
| Formula element | Neurobiological analog | Source |
|---|---|---|
| $\sigma(\cdot)$ – sigmoid | Sigmoidal I-O curve of the neuron (firing rate vs. input current) | Wilson & Cowan 1972 |
| Threshold $\theta$ ($x_0$ in $\sigma$) | Resting membrane potential -> action potential (~-55 mV) | Hodgkin & Huxley 1952 |
| $w_{ij} > 0$ | Glutamatergic (excitatory) synapses | – |
| $w_{ij} < 0$ | GABAergic (inhibitory) interneurons – lateral inhibition | – |
| $(1-\lambda) \cdot a_i(t)$ | Leak current: spontaneous decay of the membrane potential | – |
Connection to Hopfield networks: Our spreading activation is a continuous version of discrete Hopfield inference: $x_i \leftarrow \text{sign}(\sum_j w_{ij} x_j)$ (Hopfield, 1982, PNAS). The difference: we use a sigmoid instead of the sign function, yielding continuous states $a_i \in [0,1]$. The memeplex can be described as an energy landscape: stable meme configurations (subpersonalities) = attractors (local energy minima), recall = “rolling down” to the nearest attractor, SIT = high energy of an unclosed gap. The capacity of a Hopfield network is ~0.14N (Amit et al., 1985), imposing a theoretical ceiling on the number of stable memeplex configurations. False minima (spurious states) – a direct analog of false closure: the memeplex can “roll down” to an explanation that never existed in the training data – superstitions, conspiracy theories, pathological beliefs.
Spreading visualization
Competition: winner-takes-all
When multiple memes are activated simultaneously, lateral inhibition occurs:
$$a_i(t+1) = \max\left(0, \; a_i(t) - \alpha \sum_{j \neq i} a_j(t)\right)$$where $\alpha$ is the inhibition coefficient. The function $\max(0, \cdot)$ guarantees non-negative activation.
Alternative formulation (softmax normalization):
$$a_i(t+1) = \frac{e^{a_i(t) / T}}{\sum_j e^{a_j(t) / T}}$$where $T$ is a “temperature” parameter (as $T \to 0$, we obtain hard winner-takes-all).
Result: the most activated meme suppresses the rest.
Delta-modulated salience: Absolute activation $a_i(t)$ is insufficient for WM selection. A meme on a stable plateau ($a_i(t) \approx a_i(t-1)$) is in long-term memory, not working memory. For correct WM selection, softmax is modulated by a phasic factor and habituation:
$$salience_i(t) = a_i(t) \cdot (1 + \beta \cdot |\Delta a_i(t)|) \cdot \frac{1}{1 + \eta \cdot \tau_i(t)}$$where $\Delta a_i(t) = a_i(t) - a_i(t-1)$ is the change in activation per step (phasic signal), $\tau_i(t)$ is the number of consecutive steps the meme has been in WM without significant $|\Delta a_i|$ (tenure), $\beta$ is the phasic response gain, and $\eta$ is the habituation rate. The phasic factor ensures attention capture by a node whose activation has changed (neurobiological analog: phasic dopamine, prediction error). Habituation ensures eviction of nodes on a plateau (neurobiological analog: synaptic adaptation). Whenever there is significant $|\Delta a_i| > \epsilon$, the counter $\tau_i$ resets (dishabituation). Salience determines selection into Active WM (~3–4 pointers); memes evicted from Active WM retain a synaptic trace $\psi_i$ and remain in Latent WM (see Activity-silent WM). Softmax is applied to $salience_i$ instead of $a_i$:
$$P_i(t) = \frac{e^{salience_i(t) / T}}{\sum_j e^{salience_j(t) / T}}$$Interaction with negative neighbors: In a signed network, competition is more complex. A meme with positive neighbors receives support; a meme with negative neighbors receives active resistance. The ultimate “strength” of a meme in competition depends on the balance of its positive and negative connections, not merely on the absolute number of connections.
Stochasticity and temperature (Boltzmann machine): The parameter $T$ in softmax is not merely a technical detail but a fundamental property of the memeplex (Ackley, Hinton & Sejnowski, 1985, Cognitive Science). The Boltzmann distribution $P(s) = e^{-E(s)/T} / Z$ connects the energy of a configuration with the probability of its activation. As $T \to 0$ – hard winner-takes-all (rigidity, getting stuck in false closure); at high $T$ – stochastic exploration (escape from local minima). Ontogeny: the Sponge stage = high $T$, the Museum stage = low $T$; $T$ decreases with age in parallel with the growth of modularity $Q$. Contrastive learning ($\Delta w \propto \langle reality \rangle - \langle fantasy \rangle$) is isomorphic to our wake/sleep cycle: wakefulness = positive phase (data correct the model), sleep = negative phase (free-running model, BLEND). Hidden neurons in the Boltzmann machine correspond to abstract memes (values, principles) – they have no direct sensory input but arise automatically from optimization.
Numerical example: activation from a trigger
Initial conditions:
- The trigger activates meme A with strength 1.0
- Spreading coefficient: $\beta = 0.7$
- Decay coefficient: $\lambda = 0.3$
Connection network:
| Connection | Weight |
|---|---|
| A -> B | 0.8 |
| A -> C | 0.7 |
| B -> C | 0.5 |
| C -> D | 0.6 |
Step-by-step computation:
| Step | A | B | C | D |
|---|---|---|---|---|
| t=0 | 1.00 | 0.00 | 0.00 | 0.00 |
| t=1 | 0.70 | 0.56 | 0.49 | 0.00 |
| t=2 | 0.49 | 0.39 | 0.52 | 0.21 |
| t=3 | 0.34 | 0.27 | 0.36 | 0.22 |
Result: After 3 steps, the cluster A-B-C is activated, and activation has reached D.
Temporal dynamics
Meme synthesis: graph growth
A memeplex grows not only through externally incoming memes but also through the synthesis of new memes from existing ones. Three formal operations on the graph:
1. Mutation (already described): During transmission/storage, a meme is distorted (schema storage, Fidelity < 1). This is a change to an existing node.
2. Recombination (BLEND):
$$m_{new} = blend(m_i, m_j): \quad V' = V \cup \{m_{new}\}, \quad E_{new} = \{(m_{new}, n) : n \in N(m_i) \cup N(m_j), \; P(n) > \theta_{blend}\}$$The new node $m_{new}$ inherits some of the connections of both parents. The inheritance probability $P(n)$ depends on the connection strength in the parent node. Example: “democratic socialism” inherits some connections from “democracy” and some from “socialism.”
3. Abduction/insight:
A new node is created in a zone of high betweenness potential – a structural “hole” between clusters:
$$m_{new}: \quad \arg\max_{position} \sum_{c_1 \neq c_2} \frac{1}{d(c_1, c_2)^2}$$where $d(c_1, c_2)$ is the distance between clusters. Abduction fills a “bridge” between previously unconnected areas of knowledge.
Sources: Fauconnier & Turner (2002) – Conceptual Blending; Gabora (1997) – MAV model.
Conceptual basis: Three mechanisms of synthesis – see EMT, Part I.
Neurobiological basis: BLEND during sleep – see BM, Part IV.
Heterogeneous topology and meme displacement
In a network with hubs, meme competition is not a “war” but a redistribution of connections. When a powerful new meme appears, it does not destroy old ones but “pulls” their connections toward itself.
Formula for connection loss (hypothesis of this theory):
$$\Delta k_i = -\beta \cdot \frac{k_j - k_i}{\sum_m k_m}$$where:
- $\Delta k_i$ – change in degree of node $i$
- $k_j$ – degree of the new hub (competitor)
- $\beta$ – redistribution coefficient
Methodological note: This formula is a hypothesis of this theory, not a standard result of network science. In the literature on adaptive networks (Gross & Blasius, 2008), an analogous mechanism is described through rewiring – redirecting connections depending on node states. The exact functional form requires empirical verification.
Signed edges and hub displacement. With $w \in [-1, +1]$, the degree $k_i$ of a node counts all edges, including negative ones. This creates a non-trivial effect: a hub with a large number of negative connections (many “enemies”) formally has a high degree, but its displacement by a competitor can increase network connectivity – removing inhibitory edges lifts suppression of surrounding nodes. On the other hand, an “antibody” meme (high Fidelity, $w < 0$) may be resistant to displacement precisely because of its negative connections: the memeplex needs it as an immune tool, and its displacement cost is higher than what the formula for neutral networks predicts.
Interpretation: The stronger the new hub ($k_j$) and the weaker the old meme ($k_i$), the more connections the old meme loses.
| Ordinary competition | Displacement through hubs |
|---|---|
| Memes “fight” | Connections flow |
| Loser is destroyed | Loser loses connections but exists |
| Result: replacement | Result: marginalization |
| Abrupt transition | Gradual fading |
| Old meme = enemy | Old meme = periphery |
Practical implication: Displacing a meme does not require refuting it. It is sufficient to offer a new meme with more connections – and the old one will gradually be marginalized.
Edge decay (forgetting)
Edge weights between memes are not static – they decay without activation. This is the network formalization of Ebbinghaus’s forgetting curve.
Decay formula with reactivation:
$$w_{ij}(t) = w_{ij}(t_0) \cdot e^{-\lambda(t - t_0)} + \sum_{k} \delta(t - t_k) \cdot \Delta w$$where:
- $w_{ij}(t_0)$ – edge weight at the initial moment
- $\lambda$ – decay coefficient (typically 0.1–0.5 per day)
- $\delta(t - t_k)$ – delta function: activation at time $t_k$
- $\Delta w$ – weight increment upon activation
Empirical basis:
- Ebbinghaus (1885): forgetting curve – exponential memory decay
- Thorndike (1914): decay theory, memory trace
- Temporal networks (Holme & Saramaki, 2012): formalization of edge decay in dynamic networks
Implications for memetics:
| Phenomenon | Network explanation |
|---|---|
| Forgetting (passive) | $w_{ij} \to 0$ in the absence of activation |
| Forgetting (active) | I-suppression reduces $F_i$ and weakens $w_{ij}$ – see Active forgetting and reconsolidation |
| Spaced repetition | Regular $\delta$-impulses maintain $w$ |
| Spaced repetition consolidation | Repeated reactivations reduce $\lambda_{eff}$ (LTP -> structural LTP) |
| “Remembered as if it were yesterday” | High initial $w_0$ + recent activation |
| Skill decay | Peripheral connections decay faster than central ones |
Consolidation through spaced repetition:
Each co-activation of two connected memes (both above $\theta_{high}$) reduces the effective decay rate of their connection. This models the transition from early LTP to late LTP and further to structural LTP (formation of stable dendritic spines):
$$\lambda_{eff} = \frac{\lambda}{{1 + \kappa \cdot n_{react}}}$$where:
- $\lambda$ – base decay coefficient (accounting for centrality and sign)
- $\kappa$ – consolidation factor (spaced repetition factor, typically 2.0)
- $n_{react}$ – number of co-activations accounting for the refractory period
Constraints:
- Refractory period ($\Delta t_{min} \geq 10$ steps): continuous co-presence in working memory does not count as separate repetitions. Models the neurobiological refractory period between LTP-induction sessions
- Positive edges only: consolidation ($n_{react}$) applies only to $w > 0$. Negative edges (antagonism) have different plasticity (iLTD/iLTP) and are not “learned” through repetition
- Open meme exclusion: edges incident on open memes (SIT gaps) do not accumulate $n_{react}$ – their resilience is ensured by decay resistance ($\lambda_{open} = \lambda \cdot 0.5$), not consolidation
Neurobiological basis:
| Stage | $n_{react}$ | Mechanism | $\lambda_{eff}/\lambda$ |
|---|---|---|---|
| No repetitions | 0 | Base decay (LTD, spine retraction) | 1.00 |
| Early LTP | 1–2 | AMPA phosphorylation, CaMKII | 0.33–0.20 |
| Late LTP | 3–5 | Protein synthesis, BDNF | 0.14–0.09 |
| Structural LTP | >5 | New spine growth, stable engram | <0.09 |
Modulation by consolidation level $\kappa_i$: The base $\lambda$ additionally depends on the meme’s consolidation level – sensory memes ($\kappa = 0$) decay an order of magnitude faster, while LTM memes ($\kappa = 2$) decay an order of magnitude slower (see the section on consolidation level $\kappa_i$).
Differential decay:
Not all connections decay equally. The coefficient $\lambda$ depends on:
$$\lambda_{ij} = \lambda_0 \cdot \frac{1}{1 + \alpha \cdot C(i) \cdot C(j)}$$where $C(i)$, $C(j)$ are node centralities. Connections between central memes decay more slowly.
Asymmetric decay (negativity bias):
Additionally, $\lambda$ depends on the sign of the weight:
$$\lambda_{ij} = \begin{cases} \lambda_{pos} & \text{if } w_{ij} > 0 \\ \lambda_{neg} & \text{if } w_{ij} < 0 \end{cases}, \quad \lambda_{neg} < \lambda_{pos}$$Negativity bias (Baumeister et al., 2001): negative information is processed more deeply and remembered better. Negative connections decay more slowly than positive ones. This explains the persistence of prejudices and phobias.
Molecular basis of edge decay:
Exponential weight decay is not merely a mathematical convenience but reflects concrete neurobiological processes:
| Timescale | Mechanism | Decay formula | Source |
|---|---|---|---|
| ~100 ms | Short-term depression (STD): vesicle pool depletion | $\tau \sim 0.1$ s | Zucker & Regehr 2002 |
| Hours | Long-term depression (LTD): receptor dephosphorylation | $\tau \sim$ hours | Malenka & Bear 2004 |
| Days | Protein degradation: ubiquitin-proteasome pathway | $\tau \sim$ days | – |
| Weeks | Spine retraction: loss of dendritic spines | $\tau \sim$ weeks | Yang et al. 2009 |
Synaptic downscaling during sleep (Tononi & Cirelli, 2003, SHY) is the main mechanism of PRUNE: proportional weakening of all synapses -> strong pathways survive, weak ones are erased. This is the neurobiological realization of exponential decay.
Asymmetric decay ($\lambda_{neg} < \lambda_{pos}$): the negativity bias has a molecular basis – stress hormones (cortisol, norepinephrine) enhance the consolidation of negative associations via the amygdala. Adrenaline potentiates LTP in the basolateral amygdala, ensuring more durable encoding of negative connections.
Sleeper effect (Hovland et al., 1949): when $w_{ij}(t_0) < 0$ and there is no reinforcement of the negation, the absolute value $|w_{ij}|$ decays toward zero -> renewed contact in a new context can shift the weight into positive territory. This explains why rejected ideas may be accepted later.
Numerical example:
| Time (days) | $w$ without activation | $w$ with activation on day 3 |
|---|---|---|
| 0 | 1.00 | 1.00 |
| 1 | 0.74 | 0.74 |
| 3 | 0.41 | 0.85 (reactivation) |
| 7 | 0.15 | 0.46 |
| 14 | 0.02 | 0.18 |
Parameters: $\lambda = 0.3$, $\Delta w = 0.6$.
Sign inversion
The sleeper effect is a special case of a more general mechanism of sign inversion, in which a connection abruptly changes polarity while preserving its absolute magnitude (see EMT, Part XXI).
Mechanism:
- Meme $m$ has a negative connection to meme $X$: $w_{mX} < 0$ (rejected, antibody)
- New memes $n_1, n_2, \ldots$ form positive connections to $m$ (new context, new information)
- Cumulative pressure builds: $\sum_{k} w_{n_k, m} \cdot a_{n_k}$ grows
- When $\sum w_{positive} > |w_{negative}|$ -> bifurcation: the sign flips
- Fidelity is preserved (the meme is well-studied!) -> $|w|$ remains high
- Result: the former “enemy” becomes a “hub” with the same strength
Formalization:
$$w_{mX}(t) = w_{mX}^{direct} + \sum_{k \in N(m)} w_{mk} \cdot w_{kX} \cdot a_k(t)$$Inversion occurs when:
$$\sum_{k \in N(m)} w_{mk} \cdot w_{kX} \cdot a_k > |w_{mX}^{direct}|$$Key property – bifurcation, not gradient. Inversion occurs abruptly (phase transition), not gradually (Strogatz, 2015, Nonlinear Dynamics and Chaos – saddle-node bifurcation as a model of abrupt switching). This is confirmed by psychology: a sudden change of attitude toward a person, idea, or party – it “clicks” in a single moment rather than drifting gradually.
Connection to structural balance. Sign inversion is a mechanism for restoring structural balance (Heider, 1946, J. Psychology; Cartwright & Harary, 1956, Psychological Review – generalization to signed graphs). If a triple $(m, n, X)$ has an odd number of negative edges (an unstable configuration), the inversion of $w_{mX}$ restores parity -> balance.
Predictions:
| Prediction | Test |
|---|---|
| Sign inversion correlates with the appearance of new “bridge” memes | Longitudinal: semantic networks before/after attitude change |
| Inversion speed depends on the number of new positive connections (more context -> faster) | Experimental variation of the number of “reframing” stimuli |
| After inversion, $ | w |
Counter-intuitive predictions:
1. Fidelity preservation is quantitative: $|w_{after}| = |w_{before}|$. A former enemy becomes an ally of equal intensity, not a neutral acquaintance. “The convert’s zeal” is not a cognitive bias but a direct consequence of the mechanism: inversion changes the sign but not the absolute value of the edge (fidelity = a measure of how well-studied, not of attitude). Prediction: on attitude scales (feeling thermometer, semantic differential), the intensity of attitude toward the inverted object does not decrease when the sign changes. Converted anti-communists become equally fervent communists; former atheists become equally convinced believers. Testable with longitudinal attitude measurements before/after conversion.
2. Inversion via intermediaries is more effective than direct contact. The formula $\sum_{k \in N(m)} w_{mk} \cdot w_{kX} \cdot a_k > |w_{mX}^{direct}|$ means: inversion is triggered by indirect positive evidence (through mutual acquaintances, intermediary memes), not direct positive experience with the “enemy.” Counter-intuitive: common sense says “get to know your enemy personally, and the prejudice will fade” (contact hypothesis; Allport, 1954, The Nature of Prejudice). BMC predicts that direct contact without prior reframing through intermediaries is ineffective: it increases $|w_{mX}^{direct}|$ (familiarity) but does not change the sign. This is consistent with meta-analyses of the contact hypothesis: the effect of contact is significantly enhanced in the presence of institutional support and mutual acquaintances (Pettigrew & Tropp, 2006, J. Personality and Social Psychology) – in BMC terms, these are precisely “positive intermediaries.” Testable: compare the effectiveness of (a) direct contact with an outgroup vs. (b) stories from friends/authorities about positive experience with the outgroup. BMC predicts (b) > (a).
3. Hubs invert last, but the cascade from them is strongest. A hub $h$ (high centrality, many neighbors) has a higher inversion threshold: $|w_{hX}^{direct}|$ is large (the hub is well-“studied”), and more positive intermediaries are needed to overcome it. But if the hub inverts, it itself becomes a powerful intermediary for all of its $k_h$ neighbors -> avalanche-like cascading inversion. Prediction: (a) radical worldview shifts start from peripheral beliefs and cascade toward hubs, not the reverse; (b) a direct “attack on the central belief” (frontal argumentation) is the least effective persuasion strategy; (c) the distribution of inversion times in the network has a heavy tail: a long silent period (peripheral inversions) -> a sudden cascade (the hub has inverted). Testable by analyzing temporal patterns of attitude change in social networks.
Verification on the prototype: Scenario S3 (direct attack on the hub Science_trust: $-0.3$ stimulus + FEAR/DISGUST) confirms (a): the memeplex activates an immune response (FEAR up, DISGUST up), the hub reduces activation but does not invert – the frontal attack is ineffective. In scenario S8 (sleeper effect), inversion occurs through intermediaries (CARE, SEEKING -> positive context) – an indirect path, as the model predicts. MC results: S3 pass rate 50/50 (hub held), S8 pass rate 50/50 (inversion through intermediaries occurred).
Structural Incompleteness as a Source of Persistent Activation (SIT)
Edge decay explains forgetting: unused connections weaken according to $w(t) = w_0 \cdot e^{-\lambda t}$. But unsolved problems – “what is consciousness?”, “did I lock the door?”, “how to prove the Riemann hypothesis?” – do not obey this law. They return after days, months, years without an external trigger. The current formulas do not explain this phenomenon.
Structural Incompleteness Tension (SIT) – the tension of structural incompleteness – supplements edge decay with a second mechanism: structural gaps in the memeplex generate their own SEEKING activation, independent of external stimuli.
Definition of a structural gap
Definition. A structural gap $g$ in cluster $C$ is a position in the topology of $C$ possessing high betweenness potential (many pairs of nodes whose shortest paths would pass through $g$ if $g$ were a real node), but not filled by an actual meme.
Formally: let $B^*(g, C)$ be the betweenness centrality that node $g$ would receive if it were realized. Then:
$$relevance(g) = \frac{B^*(g, C)}{\max_{v \in C} B(v, C)}$$A gap with high $relevance$ is a central “hole” in the cluster, without which many connections remain broken.
The SIT formula
$$SIT(C) = \sum_{g \in gaps(C)} relevance(g) \cdot centrality(C) \cdot (1 - closure(g))$$where:
- $relevance(g) \in [0, 1]$ – the relative betweenness potential of position $g$
- $centrality(C) \in [0, 1]$ – eigenvector centrality of cluster $C$ in the full memeplex network (the more “important” the cluster, the higher the tension from its incompleteness)
- $closure(g) \in [0, 1]$ – the degree of gap closure (0 = fully open, 1 = closed)
The Learning Progress (LP) filter
Not all unfilled gaps generate activation. A problem on which there is no progress gradually “lets go”:
$$LP(C, t) = \frac{d}{dt} closure(C, t)$$When $LP > 0$ – a sense of progress, SIT amplifies SEEKING. When $LP \to 0$ – stagnation, the SIT contribution fades. When $LP \gg 0$ suddenly (new information) – a return to the unsolved problem.
Effective SIT contribution to SEEKING:
$$SIT_{eff}(C, t) = SIT(C) \cdot f(LP(C, t))$$where $f(LP)$ is a sigmoidal filter: $f(LP) = \sigma(k \cdot LP)$, providing rapid engagement when progress appears and smooth fading during stagnation.
False closure: why humanity invented gods
SIT generates persistent cognitive tension, and the brain strives to resolve it at any cost – even by filling the gap with a node of zero empirical validity. This is false closure:
$$closure_{false}(g) > 0, \quad validity(node_g) \approx 0$$With false closure, SIT formally decreases and tension drops – but the cluster becomes vulnerable to falsification. The mechanism explains:
| Phenomenon | Structural gap | False closure |
|---|---|---|
| Religion | “Why does the world exist?” | “God created it” |
| Superstitions | “Why did something bad happen?” | “A black cat crossed my path” |
| Astrology | “Why am I the way I am?” | “I’m a Scorpio” |
| Conspiracy theories | “Why is the world unjust?” | “A secret world government” |
| Magical thinking | “How to influence the future?” | “Rituals/prayer help” |
Evolutionary logic: False closure is evolutionarily more advantageous than persistent SIT, because chronic SIT consumes cognitive resources. For survival, a wrong answer (low cognitive load) is better than an eternal question (constant drain on SEEKING). Science is the systematic replacement of false closure with valid closure, which requires cultural institutions that suppress the natural impulse to fill gaps quickly.
Contrast: edge decay vs. SIT
| Property | Edge decay | SIT |
|---|---|---|
| Direction | Weakening of connections | Generation of activation |
| Trigger | Absence of use | Structural incompleteness |
| Temporal profile | Exponential decay | Persistent (until closure or LP-collapse) |
| Requires external stimulus | No (passive) | No (endogenous) |
| Cluster dependence | Only on edge weight | On $centrality(C)$ and $relevance(g)$ |
| Neural substrate | Spine retraction, SHY | DMN, dopaminergic system |
Numerical example
Consider two clusters: one with an open gap, the other fully closed.
Cluster A (unsolved problem “how does gravity work”):
- $gaps(A) = \{g_1\}$, $relevance(g_1) = 0.8$ (central question)
- $centrality(A) = 0.7$ (important cluster)
- $closure(g_1) = 0.3$ (partial answers exist)
- $LP(A, t) = 0.1$ (slow progress)
Cluster B (mastered skill “riding a bicycle”):
- $gaps(B) = \emptyset$
- $SIT(B) = 0$ – no gaps, no tension, normal edge decay operates
theory of
relativity] --- A2[Quantum
mechanics] A1 --- A3[Gravitational
waves] A2 --- A4[Quantum
entanglement] A3 -.- GAP["? Quantum
gravity
(gap)"] A4 -.- GAP GAP:::gap end subgraph "Cluster B: SIT = 0" B1[Balance] --- B2[Pedaling] B2 --- B3[Steering] B1 --- B3 B3 --- B4[Braking] end classDef gap fill:#ff6b6b,stroke:#c0392b,stroke-width:2px,stroke-dasharray: 5 5
Cluster A generates SEEKING activation proportional to $SIT_{eff}(A, t)$, even without external stimuli. A physicist may not read about quantum gravity for years, but when a new paper appears ($LP$ spikes) – they immediately return to the problem.
See also: The biosubstrate of SIT (DMN, the extended SEEKING formula) – BM, Parts III–IV; conceptual exposition – EMT, Part VII; engineering implementation – AGI_F, Part III.
Motor learning example: tennis
Edge decay + selection manifest clearly in motor learning. Consider how a tennis player refines a stroke.
Motor variability as exploration (Dhawale et al., 2017, Annual Review of Neuroscience, DOI: 10.1146/annurev-neuro-072116-031548): the beginner generates many movement variants – this is analogous to creating edges between the meme “stroke” and various motor patterns. Variability is not noise but exploration of the solution space.
Reward-based retention (Galea et al., 2015, Current Biology, DOI: 10.1016/j.cub.2015.06.030): a dopaminergic signal upon a successful hit strengthens LTP for a specific pattern -> that variant receives a higher initial weight $w_0$ and decays more slowly.
Generalization gradient:
$$P(survive) \propto e^{-d^2/2\sigma^2}$$where $d$ is the “distance” of the variant from the successful pattern in motor space, and $\sigma$ is the generalization width.
Connection to Fidelity: Successful motor patterns -> dopaminergic reinforcement -> enhanced consolidation during sleep (Yang et al., 2014) -> high Fidelity. Unsuccessful ones -> no reinforcement -> edge decay -> variant lost.
| Learning phase | Network process | Neural mechanism |
|---|---|---|
| Beginning | Many edges with low $w$ (all variants) | High motor variability, exploration |
| Selection | $\Delta w > 0$ for successful variants, decay for the rest | Dopamine -> LTP for the “correct” pattern |
| Consolidation | Strong edges strengthened during sleep, weak ones erased | SHY + spine formation (Yang 2014) |
| Mastery | One stable pattern, $Fidelity \to 1$, $habit \to 1$ | Myelination, automatization (basal ganglia) -> Auto(S) |
Openness-gated reactivation
During Hebbian reactivation of edges (both nodes active above $\theta_{high}$), cross-cluster reactivation is modulated by openness $O$:
$$\Delta w_{react} = \begin{cases} \Delta w & \text{if } c(u) = c(v) \text{ (within cluster)} \\ \Delta w \cdot O & \text{if } c(u) \neq c(v) \text{ (between clusters)} \end{cases}$$Justification: Openness to experience (McCrae & Costa, 1999) reflects readiness to integrate “foreign” memes. With age, $O$ decreases (Roberts et al., 2006), suppressing cross-cluster reactivation. This explains the growth of modularity with age ($Q_{old} > Q_{young}$): within-cluster connections continue to strengthen, while between-cluster connections lose their restoration mechanism.
Empirical basis: Wulff et al. (2019) showed that the semantic networks of older adults have higher modularity and fewer “bridge” connections – consistent with the selective suppression of cross-cluster reactivation at low $O$.
Associative memory vs. isolated memorization
Problem: Why are some facts remembered easily and for a long time, while others are forgotten within days? Why do all mnemonic techniques work through creating associations?
Network answer: A connected meme has higher centrality -> its connections decay more slowly according to the differential decay formula:
$$\lambda_{ij} = \lambda_0 \cdot \frac{1}{1 + \alpha \cdot C(i) \cdot C(j)}$$An isolated meme (no connections) -> low centrality -> high $\lambda$ -> rapid decay. A connected meme (many connections) -> high centrality -> low $\lambda$ -> slow decay.
Empirical basis:
- Craik & Lockhart (1972), “Levels of Processing”: deep (semantic) processing -> better memorization than shallow (phonetic). Network interpretation: semantic processing creates more connections.
- Bower et al. (1969): organized material is remembered 2–3 times better than a random list. Organization = structuring connections.
- Collins & Loftus (1975), spreading activation model: memory is a network where activation spreads along connections.
Neurobiological mechanism of association:
Overlapping engrams (Cai et al., 2016, Nature) – the physical mechanism of associative linking: memories formed in close temporal windows (~6 hours) share neurons, creating a causal link. Hippocampal structures provide two complementary processes:
- Dentate gyrus = pattern separation: splitting similar inputs into unique representations (creating distinct nodes)
- CA3 = pattern completion: restoring the full pattern from partial input (reactivation by trigger)
Method of loci (memory palace) works through hippocampal spatial coding: place cells and grid cells of the entorhinal cortex provide spatial anchoring, creating additional edges to spatial memes. Each “locus” = a spatial ensemble overlapping with the target meme.
Mnemonic techniques as connection creation:
| Mnemonic technique | Network mechanism | Effect on centrality |
|---|---|---|
| Method of loci (memory palace) | +N connections to spatial memes (rooms, objects) | Degree: 1 -> N+1 |
| Imagery associations | +k connections to visual/emotional memes | Degree: 1 -> k+1, + emotional enhancement |
| Spaced repetition | $\delta$-impulses maintain $w$ (see above) | Preserves existing degree |
| Chunking | Combining into a memeplex with internal connections | Creates a dense subgraph |
| Acronyms and rhymes | +connections to phonetic patterns | Degree: 1 -> 2+ |
| Storytelling | +connections to narrative structures | Degree: 1 -> many (characters, events, logic) |
Numerical example: influence of connections on forgetting time
Using the differential decay formula with parameters: $\lambda_0 = 0.3$ per day, $\alpha = 0.5$.
For simplicity: $C(i) \approx k_i$ (degree centrality).
| Number of connections $k$ | Effective $\lambda$ | Time until $w < 0.1$ | Comment |
|---|---|---|---|
| 0 (isolated) | 0.30 | ~8 days | Quickly forgotten |
| 3 | 0.15 | ~15 days | 2x longer |
| 10 (memory palace) | 0.06 | ~38 days | 5x longer |
| 30 (deep understanding) | 0.02 | ~115 days | Nearly 4 months |
Calculation: $\lambda_{eff} = \frac{0.3}{1 + 0.5 \cdot k}$; time $t^* = \frac{-\ln(0.1)}{\lambda_{eff}} = \frac{2.3}{\lambda_{eff}}$.
Conclusion: Mnemonic techniques increase degree -> decrease effective $\lambda$ -> slow decay -> increase Fidelity.
Practical takeaway: If you want to remember something for a long time – do not try to “memorize by rote.” Instead:
- Create connections to what you already know (semantic)
- Add spatial connections (method of loci)
- Add emotional connections (vivid imagery)
- Maintain connections through activation (spaced repetition)
Fidelity is orthogonal to weight
Fidelity (storage completeness) and weight (sign of the relationship) are independent characteristics. A meme can be well-studied and yet rejected.
| $w > 0$ (accepted) | $w < 0$ (rejected) | |
|---|---|---|
| High Fidelity | Active belief | Antibody: a well-studied “enemy” |
| Low Fidelity | Vague sympathy | Vague antipathy |
Consequence for the Fidelity formula: Degree $k_m$ counts all connections of a meme – both positive and negative. A rejected meme with many negative connections can have high Fidelity because the memeplex stores it in detail for immune defense purposes.
Differentiated storage: completeness of preservation (Fidelity)
Problem: The brain has limited capacity. Storing thousands of memes in full is impossible.
Solution: Memes are stored with varying completeness (Fidelity) depending on rank and usage.
Relationship to consolidation level $\kappa_i$: Fidelity correlates with $\kappa$ but is not identical to it. A meme can be in LTM ($\kappa = 2$) with skeletal Fidelity – knowledge that has long been unused but is structurally stable (example: a forgotten foreign language with a preserved core). Conversely, a meme in STM ($\kappa = 1$) can have high initial Fidelity – detail-encoded but not yet consolidated. See the section on consolidation level $\kappa_i$.
Fidelity function:
$$Fidelity(m, t) = \frac{k_m^{\gamma}}{k_{max}^{\gamma}} \cdot e^{-\lambda_f (t - t_{last})} \cdot (1 - e^{-\beta \cdot age})$$where:
- $k_m$ – degree of meme $m$
- $k_{max}$ – maximum degree in the network
- $\gamma$ – preferential preservation exponent (typically 0.5–1.0)
- $\lambda_f$ – completeness decay rate (typically 0.01–0.1 per month)
- $t_{last}$ – time of last use
- $\beta$ – consolidation rate (typically 0.1–0.5)
- $age$ – time since first acquisition
Three components of the formula:
- Rank factor $\frac{k_m^{\gamma}}{k_{max}^{\gamma}}$ – hubs are preserved more completely
- Usage factor $e^{-\lambda_f (t - t_{last})}$ – what is unused degrades
- Consolidation factor $(1 - e^{-\beta \cdot age})$ – older memes are more stable than new ones
Three storage modes
| Mode | Fidelity | What is stored | Example |
|---|---|---|---|
| Full | > 0.7 | Core + all connections + details | Native language, primary profession |
| Skeletal | 0.3–0.7 | Core + main connections | A forgotten foreign language |
| Trace | < 0.3 | Only the core (or a fragment of the core) | “I think it started with A…” |
(lambda_f * t > threshold_1) Skeletal --> Trace: Prolonged neglect
(lambda_f * t > threshold_2) Trace --> [*]: Complete loss Trace --> Skeletal: Reactivation
(exposure) Skeletal --> Full: Intensive use
(exposure * duration) note right of Full: Fidelity > 0.7 note right of Skeletal: 0.3 < Fidelity < 0.7 note right of Trace: Fidelity < 0.3
Key example: a foreign language
Scenario: 5 years of study -> 10 years without use -> one week in the environment
| Stage | Time | Fidelity | Mode | State |
|---|---|---|---|---|
| After training | t = 0 | 0.90 | Full | Fluent command |
| After 5 years | t = 5 | 0.55 | Skeletal | “I remember, but can’t speak” |
| After 10 years | t = 10 | 0.25 | Trace | “I studied it once” |
| After reactivation | t = 10 + 7 days | 0.60 | Skeletal | “It’s coming back!” |
Why is reactivation fast (days) rather than slow (years)?
- During learning: new synapses are created (slow)
- During reactivation: existing weakened connections are strengthened (fast)
- The core (grammatical structures) is preserved — only “warming up” connections is needed
Reactivation Dynamics
$$\frac{d(Fidelity)}{dt} = \rho \cdot (1 - Fidelity) \cdot exposure(t)$$where:
- $\rho$ — reactivation rate (typically 0.1–0.5 per day⁻¹)
- $exposure(t)$ — exposure intensity (0–1)
Important property: Reactivation rate depends on current Fidelity:
- At Fidelity = 0.2: $\frac{dF}{dt} = 0.3 \cdot 0.8 \cdot 1.0 = 0.24$ per day
- At Fidelity = 0.8: $\frac{dF}{dt} = 0.3 \cdot 0.2 \cdot 1.0 = 0.06$ per day
The lower the current completeness — the faster the initial reactivation (given a preserved core).
Adaptive Value of Differential Storage
| Advantage | Mechanism |
|---|---|
| Resource economy | Rarely used memes are stored compactly |
| Rapid reactivation | The skeleton allows restoring a meme without full retraining |
| Prioritization | Hubs (high rank) are automatically stored more completely |
| Flexibility | Periphery can be updated without restructuring the core |
Neurobiological basis: Physical storage structure and neurobiology of modes — see BM, Part IV.
Meme structure: Core, connections, details — see EMT, Part I.
Application to AGI: Skeletal storage in artificial systems — see AGI_F, Part III.
Consolidation Level $\kappa_i$: Multi-Level Memory as an Emergent Property
The mechanisms of spreading activation, edge decay, Fidelity, and SIT give rise to multi-level memory as an emergent property of graph dynamics — without a separate memory module. Memory levels emerge from combinations of existing meme parameters.
Definition. The consolidation level $\kappa_i(t) \in \{0, 1, 2\}$ of meme $m_i$ is a derived discrete variable:
$$\kappa_i(t) = \begin{cases} 0 \; (\text{sensory}) & \text{if } m_i \text{ is in the S-layer, did not pass the I-filter} \\ 1 \; (\text{STM}) & \text{if } I_{passed}(m_i) \land \neg\,\text{LTM-condition} \\ 2 \; (\text{LTM}) & \text{if } n_{react}(m_i) \geq N_{crit} \lor Fidelity(m_i, t) \geq F_{LTM} \end{cases}$$$\kappa_i$ is not an independent parameter but a classifying function of $(I_{passed}, n_{react}, Fidelity, age)$. Analogy: thermodynamic state of matter (gas / liquid / solid) — derived from temperature and pressure, not an independent variable.
Notation remark. The symbol $\kappa$ in the formula $\lambda_{eff} = \lambda/(1 + \kappa \cdot n_{react})$ denotes the scalar consolidation factor (spaced repetition factor). Here $\kappa_i(t)$ is the consolidation level of meme $i$. Context distinguishes: $\kappa$ without subscript — scalar, $\kappa_i(t)$ — per-meme level.
Independence of $\kappa_i$, $a_i$, and SIT
Three meme variables — $\kappa_i$ (consolidation depth), $a_i$ (current activation), and $SIT_i$ (structural incompleteness) — are independent:
- $\kappa_i$ is determined by meme history: $(I_{passed}, n_{react}, Fidelity, age)$
- $a_i$ is determined by current dynamics: spreading activation, incoming signal
- $SIT_i$ is determined by the presence of an unclosed gap
Examples of combinations:
- $\kappa = 2$, $a_i \approx 0$: long-consolidated knowledge, currently inactive (“studied French 10 years ago”)
- $\kappa = 1$, $a_i > \theta_{act}$: fresh idea, not yet consolidated, but currently driving behavior
- $\kappa = 2$, $SIT > 0$: a question a person has been pondering for years (“what is consciousness?”)
Limitation: A meme with $\kappa = 0$ (in the S-layer) has not yet passed the I-filter — it can only have high current $a_i$ (sensory input) and cannot carry SIT.
$\kappa$-Transition Conditions
| Transition | Condition | Mechanism | Neurobio analog |
|---|---|---|---|
| $0 \to 1$ | $I_{score}(m_i) \geq \theta_I$ | I-filter passed the meme | Hippocampal encoding (E-LTP) |
| $1 \to 2$ | $n_{react} \geq N_{crit}$ OR $F \geq F_{LTM}$ | Spaced repetition / hub-status | Late LTP, CREB, structural LTP |
| $1 \to 2$ (fast) | $G_{align}(m_i) > \theta_G$ AND emotional tag | Emotional significance | Amygdalar enhancement, SWR-tagging |
How sleep replay leads to fulfilling these conditions: Each nightly replay increases $n_{react}$; cumulatively over days–weeks, core memes reach $N_{crit}$. The full process is formalized below — see Consolidation Process. | $2 \to 1$ | $F < F_{STM}$ AND $n_{react}$ not growing; OR reconsolidation destabilization ($\Delta_{PE} > \theta_{destab}$); OR sustained I-suppression ($\tau_{supp} \geq T_{suppress}$) | Deconsolidation | Spine retraction, late-LTP degradation; anisomycin-blockable lability | | $1 \to \varnothing$ | $a_i < \theta_{prune}$ for $T_{prune}$ AND $F < F_{trace}$ | Pruning | Synaptic downscaling (SHY) |
Dependence of Decay Rate on $\kappa_i$
The base $\lambda$ from the edge decay formula is modulated by consolidation level:
$$\lambda_{base}(\kappa_i) = \lambda_0 \cdot r(\kappa_i), \quad r(0) = 10, \;\; r(1) = 1, \;\; r(2) = 0.1$$Sensory memes ($\kappa = 0$) decay 10× faster than the base rate; LTM memes ($\kappa = 2$) — 10× slower. The full decay formula accounting for all modulators:
$$\lambda_{ij}^{full} = \lambda_0 \cdot r(\kappa_i) \cdot \frac{1}{1 + \alpha \cdot C(i) \cdot C(j)} \cdot \frac{1}{1 + c_{sr} \cdot n_{react}} \cdot s(w_{ij})$$where $c_{sr}$ is the spaced repetition factor (denoted $\kappa$ in the edge decay formula above, renamed to avoid collision with $\kappa_i$), $s(w) = 1$ for $w > 0$, $s(w) = \lambda_{neg}/\lambda_{pos} < 1$ for $w < 0$ (negativity bias).
Engram Allocation: Competition for Consolidation
Not all STM memes ($\kappa = 1$) transition to LTM. Memes compete for limited consolidation resources — analogous to CREB/excitability allocation in neuroscience (Josselyn & Frankland, 2015):
$$P_{cons}(m_i) \propto C_E(m_i) \cdot \bigl(1 + \gamma_G \cdot G_{align}(m_i)\bigr) \cdot \bigl(1 + \gamma_E \cdot E_{emot}(m_i)\bigr)$$where $C_E$ is eigenvector centrality (hubs have priority), $G_{align}$ is alignment with G-drives, $E_{emot}$ is emotional intensity. Highly central memes with emotional G-linkage “win” the competitive selection.
Consequence: memory linking — memes created within a close temporal window ($\Delta t < T_{link} \sim 6$ h) share co-activation → acquire joint edges → consolidate together (Cai et al., 2016, Nature).
Schema-Congruence and $\kappa$-Transition Speed
The speed of the $1 \to 2$ transition depends on the meme’s compatibility with the existing memplex:
| I-compatibility | Speed of $\kappa: 1 \to 2$ | Initial Fidelity | Encoding detail |
|---|---|---|---|
| High (congruent with schema) | Fast | Low (gist) | Few episodic details |
| Low (incongruent) | Slow | High (details) | Detailed hippocampal encoding |
| Very novel (high SIT + G) | Fast (emotional tag) | High | Emotionally colored |
U-shaped curve: both congruent and very novel material is remembered well — through different mechanisms. This coincides with the SLIMM model (van Kesteren et al., 2012): congruent → mPFC → fast integration; incongruent → hippocampus → detailed encoding.
SIT / Prediction Error as $\kappa$-Transition Gate
SIT directly determines the type of encoding:
| SIT level | Prediction Error | Effect on κ |
|---|---|---|
| Low | PE ≈ 0 | Update of existing meme, κ unchanged |
| Medium | PE moderate | Creation of new meme with $\kappa = 1$ |
| High + $G_{align}$ | PE high + emotion | Accelerated $\kappa: 1 \to 2$ (emotional tag) |
| Excessive | PE » θ | Inverted-U: not consolidation, but extinction / I-suppression |
WM as a Pointer System over $\kappa$-Levels
Working memory (top-k by salience) is not a separate storage level but a pointer system to memes at any $\kappa$-level. A WM pointer binds a meme to the current context:
$$WM(t) = \text{top-}k\bigl\{\, salience_i(t) \;:\; a_i(t) > \theta_{display} \,\bigr\}$$The capacity bottleneck is not information volume but the number of content-independent pointers (~3–4 primary + chunking). Chunking = grouping related memes (memplex) under one pointer (Oberauer, 2025, Trends in Cognitive Sciences). The next level is automatization: $wm\_cost$ decreases from 1 (chunked) to 0 (automatic), fully freeing the pointer.
WM Ontogenesis: $k_{active}(t_{dev})$
The ceiling of ~4 active pointers is a property of the mature substrate. An immature substrate has a lower ceiling: theta-gamma coupling in PFC matures gradually as myelination progresses, and the number of gamma cycles fitting within one theta cycle increases with age (Bola & Bhatt, 2024, Consciousness and Cognition; see BM: PFC myelination). Convergence at ~4 active slots in crows, macaques, and rats (Hahn et al., 2021) points to an evolutionary optimum, not a Homo sapiens artifact.
Formula:
$$k_{active}(t_{dev}) = k_{min} + (k_{max} - k_{min}) \cdot \sigma(\lambda_{dev} \cdot (t_{dev} - t_{0.5}))$$where $k_{min} \approx 1$ (neonatal substrate), $k_{max} \approx 4$ (mature substrate), $t_{0.5}$ is the half-saturation point, $\lambda_{dev}$ is the steepness. Parameters $t_{0.5}$ and $\lambda_{dev}$ are species-dependent and individually variable (analogous to $T_i$).
Empirical anchors:
| Age | $k_{active}$ (estimate) | Cognitive consequences in BMC |
|---|---|---|
| ~1 year | ~1 | No chunking: only single memes |
| ~3 years | ~2 | Simple pairs, first conditioned associations (secondary binding) |
| ~7 years | ~3 | “Age of reason”: tertiary binding, Piaget’s operations, beginning of automatization |
| ~15+ years | ~4 | Full capacity: metacognition, long chains |
| >65 (decline) | ↓ | PFC atrophy, theta-gamma decoupling → memplex rigidity, de-automatization more expensive |
Sources: Cowan, 2001; Gathercole et al., 2004, Memory; Gogtay et al., 2004, PNAS.
Consequences:
- Auto(S) is impossible at $k = 1$: $|S| \geq 2$ requires $k \geq 2$ for chain learning → automatization is impossible before ~3 years.
- Age barrier of transmission: A meme requiring $n$ simultaneous elements cannot be transmitted when $k_{active} < n$. Hence the age stratification of culture.
- Critical threshold ~7 years: $k \approx 3$ = minimum for Piaget’s concrete operations + tertiary binding. The “age of reason” is not a metaphor but a $k$ threshold.
- $k$ is a G-parameter: set by the substrate (PFC maturation), not by learning. Training improves chunking and strategies, but not $k_{active}$.
- Decline: PFC atrophy (>65) → theta-gamma decoupling → $k_{active}$↓ → compensated by stigmergy (notebooks, rituals) + highly automatized skills ($habit \to 1$) are preserved.
G→WM Competition: $k_{eff}(t)$
$k_{active}(t_{dev})$ sets the ceiling, but available WM capacity may be lower. Two channels reduce available capacity:
- G-capture (affective, subcortical): G-programs at high activation capture PFC computational resources, reducing the number of free pointers (Pessoa, 2009, TiCS). Empirically: CDA K-score drops from ~3.5 to ~2 under negative affect — a loss of ~50% WM capacity (Figueira et al., 2017, SCAN).
- Signal-capture (communicative, cortical): Signal memes (grounding, routing, fidelity maintenance) occupy WM slots, displacing navigational and survival-relevant memes. At $k_{eff} \approx 3\text{–}4$, even one signal meme = 25–33% loss of survival-relevant capacity.
Formula:
$$k_{eff}(t) = k_{active}(t_{dev}) - n_{captured}^G(t) - n_{captured}^{signal}(t)$$ $$n_{captured}^G(t) = \left\lfloor \frac{\sum_u a_u(t) \cdot w^{capture}_u}{\theta_{capture}} \right\rfloor$$$n_{captured}^{signal}(t)$ = number of WM slots occupied by active signal memes (grounding associations, routing state, fidelity maintenance). Determined by the number of active communication processes — not analogous to $w^{capture}$ (no fixed weights; depends on communicative load).
Constraint: $k_{eff} \geq 1$. At $k_{eff} \to 0$ — panic freeze: loss of deliberation, Self(t) narrows to $\{u: a_u > \theta\}$ (nearly pure G without M).
Evolutionary consequence (Language Emergence Threshold): Language = permanent M-capture of WM. It arises only when 4 conditions are simultaneously met: (1) resource surplus (rich environment), (2) $k_{eff}$ sufficient for survival + signal, (3) executive planning (converting signal to navigation), (4) memetic pressure (social density). See EMT Part XXVIII.
Capture weights:
| G-program | $w^{capture}$ | Mechanism |
|---|---|---|
| FEAR | 1.0 | Survival override: amygdala→dlPFC inhibition |
| RAGE | 0.8 | Substance P → PFC disorganization; crude attack planning preserved |
| PANIC/GRIEF | 0.7 | CRF → sustained PFC load (rumination) |
| LUST | 0.3 | Partial attentional capture |
| CARE | 0.2 | Oxytocin → minimal PFC load, compatible with cognitive activity |
| PLAY | 0 | Reverse effect: reduces FEAR/RAGE tonus → WM freed |
| SEEKING | 0 | Recruits WM (directs it), but no capture |
Remark: $T$ is already accounted for — $a_u(t)$ includes T-modulation ($a_{FEAR}$ depends on $T_{FEAR}$). $w^{capture}$ is a property of the G-program, not of the individual.
$\psi$-traces for utility nodes. G-programs leave their own synaptic traces. Dynamics are analogous to memes, but with different parameters:
$$\psi_u(t + \Delta t) = \psi_u(t) \cdot e^{-\lambda_\psi^G \cdot \Delta t}$$where $\lambda_\psi^G < \lambda_\psi^M$: emotional traces decay slower than cognitive ones (amygdala-mediated consolidation). Difference from meme $\psi$: utility nodes have no consolidation level ($\kappa$ is not applicable — G-programs are innate, not consolidated). $\psi_u$ describes temporary sensitization of the circuit, not storage.
PTSD loop. Traumatic FEAR: $a_{FEAR} \gg \theta_{act}$ → $\psi_{FEAR}$ is anomalously high. After deactivation: $\psi_{FEAR}$ decays slowly ($\lambda_\psi^G < \lambda_\psi^M$). Contextual trigger → ping → FEAR reactivation → WM-capture ($n_{captured}$↑) → $k_{eff}$↓ → inability to deliberate → no rational override → $\psi_{FEAR}$ refresh → vicious cycle. Formally: PTSD = $\psi_u$ stuck above $\theta_\psi$ with $\lambda_\psi^G \to 0$.
Link to $P_{auto}$: G-activation → $k_{eff}$↓ → $WM\_load$↑ → $P_{auto}$↑ → relapse to the most entrenched habit (see automatization). This closes the chain: affect → WM-capture → automatic behavior.
Interaction with Self(t). $Self(t) = SMC \cup WM(t) \cup \{u: a_u > \theta\}$. Utility nodes remain in the third set (not in WM), but their capture reduces $|WM(t)|$. In panic, $Self(t)$ narrows to $\{u: a_u > \theta\}$ — nearly pure G without M.
Prediction P-WM1: FEAR induction reduces CDA (K-score) by ~50%; PLAY induction does not reduce or slightly increases it. Test: CDA change-detection under fear-induction vs play-induction vs neutral (Eysenck et al., 2007, Cognition & Emotion; Stout et al., 2017, Scientific Reports).
Neurobiological basis: Theta-gamma coupling, PFC myelination, dual competition — see BM: PFC myelination, BM: G-programs and WM.
Engineering implementation: k_eff pseudocode, capture weights — see AGI_F: k_eff.
Activity-Silent WM: Synaptic Trace $\psi$
The pointer system (top-k by salience) describes active working memory — memes in the focus of attention. However, neuroscientific data show that information is stored in WM in two ways — through sustained neural activation (active WM) and through short-term synaptic traces (activity-silent WM; Stokes, 2015, TiCS; Wolff et al., 2017, Nature Neuroscience). This explains the classic discrepancy: ~3–4 active items (Cowan, 2001) + ~3–4 latent = 7±2 (Miller, 1956).
Computationally verified. Working memory capacity $k \approx 4$ has been reproduced as an evolutionary equilibrium in the BMC engine: populations with heritable WM capacity and metabolic brain cost in a delayed-reward foraging environment converge to $k \approx 4.4$ from both directions — Cowan’s $4 \pm 1$ range. Three-point sweep ($\beta_{wm}$: 0.04, 0.06, 0.08) confirmed the inverted U-curve. See DOI: 10.5281/zenodo.19310012.
Definition. For each meme $m_i$ we introduce a third continuous variable — the synaptic trace:
$$\psi_i(t) \in [0, 1]$$Three axes of a meme:
- $a_i(t) \in [0,1]$ — activation (access to attention)
- $\kappa_i(t) \in \{0, 1, 2\}$ — consolidation depth (sensory / STM / LTM)
- $\psi_i(t) \in [0,1]$ — synaptic trace (short-term synaptic potentiation)
All three variables are independent: a meme with $a_i \approx 0$ can have $\psi_i > 0$ (latent in WM — not in consciousness, but recoverable) or $\psi_i = 0$ (not in WM). Limitation: $\psi_i > 0$ is impossible at $\kappa_i = 0$ (meme has not passed the I-filter — no encoding in WM).
Dynamics of $\psi$:
$$\psi_i(t + \Delta t) = \psi_i(t) \cdot e^{-\lambda_\psi \cdot \Delta t}$$where $\lambda_\psi$ is the trace decay rate ($T_{1/2} \sim$ minutes). When a meme enters Active WM through top-k selection: $\psi_i \leftarrow 1$. Upon repeated pinging: $\psi_i$ is refreshed ($\psi_i \leftarrow \min(1, \psi_i + \delta_{refresh})$), the meme remains latent without full reactivation.
Two-compartment WM model:
| Compartment | Definition | Capacity | Status |
|---|---|---|---|
| Active WM | $\{m_i : i \in \text{top-}k_{active}(\text{salience})\}$ | $k_{active} \approx 3{-}4$ | Conscious |
| Latent WM | $\{m_i : \psi_i > \theta_\psi,\; i \notin \text{Active WM}\}$ | $k_{latent} \approx 3{-}4$ | Non-conscious, recoverable |
| Total WM | Active $\cup$ Latent | $\approx 7 \pm 2$ | Miller/Cowan paradox resolved |
Pinging — reactivation of latent memes. A contextual signal (spreading activation from active memes) can reactivate a latent meme:
$$a_i^{ping}(t) = \psi_i(t) \cdot \sum_{j \in Active\_WM} w_{ij} \cdot a_j(t)$$Reactivation condition:
$$a_i(t) + a_i^{ping}(t) > \theta_{act} \;\;\land\;\; salience_i^{ping}(t) \in \text{top-}k_{active}$$where $salience_i^{ping}$ is computed using the standard salience formula with boosted activation. Reactivation is not automatic: the meme must pass top-k competition, displacing a less salience-relevant meme from Active WM. If $a_i + a_i^{ping} < \theta_{act}$, but $\psi_i > \theta_\psi$ — the meme remains latent, and $\psi_i$ receives a refresh ($\psi_i \leftarrow \min(1, \psi_i + \delta_{refresh})$).
SIT-gap driven pinging (theoretical extension): an open SIT-gap with non-zero activation periodically generates spreading activation reaching latent memes linked to the gap’s theme. This is the BMC explanation of the Zeigarnik effect for WM: unfinished tasks are maintained in Latent WM because gap pulsation keeps $\psi_i > \theta_\psi$.
Pointer lifecycle:
| Phase | Condition | $a_i$ | $\psi_i$ | Status |
|---|---|---|---|---|
| Allocation | Meme wins top-k competition | $> \theta_{act}$ | $\leftarrow 1$ | Active WM |
| Active | Meme is held in top-k | $> \theta_{act}$ | $= 1$ (refresh) | Active WM |
| Eviction | Displaced by more salience-relevant meme | $\downarrow$ (decay) | decay begins | $\to$ Latent WM |
| Latent | $\psi_i > \theta_\psi$, $a_i < \theta_{act}$, $i \notin$ top-k | $< \theta_{act}$ | $> \theta_\psi$ | Latent WM |
| Freed | $\psi_i < \theta_\psi$ | $< \theta_{act}$ | $< \theta_\psi$ | Not in WM |
$\psi$ and chunking: When a chunk (memplex linked by a single WM pointer) is deactivated, all member memes receive $\psi_i = \psi_{chunk}$. When one meme from the chunk is pinged — reactivation spreads to the entire chunk through strong internal edges. Chunking effectively increases Latent WM capacity: a single $\psi$-trace addresses an entire memplex.
Latent WM and coherence. The coherence constraint ($Binding_{comp}$, Part XIII) applies only to Active WM — memes in the focus of attention must be mutually non-contradictory. Latent WM is not constrained by coherence: latent memes are not in consciousness, and contradictions between them are not experienced. Upon pinging and reactivation, a latent meme enters Active WM and is subjected to competitive binding — it may be rejected if $D(m_i, m_j) > \theta_D$ with memes in Active WM. This explains “intrusive thoughts” — a latent meme with high $\psi$ and strong ping is reactivated despite conflict with the current Active WM, causing a sensation of dissonance.
Predictions:
| # | Prediction | Test |
|---|---|---|
| P-$\psi$1 | Upon distraction (distractor task, 10–30 s) a person recovers ~3–4 of 7 WM items — those that receive a ping upon context return | Behavioral: retention test with vs without context reinstatement |
| P-$\psi$2 | SIT-gap increases retention of latent memes linked to gap themes compared to closed topics | Behavioral: recall for open vs closed tasks after delay |
| P-$\psi$3 | Coherent sets (low $D$) are recovered from Latent WM faster than incoherent ones, given equal $\psi$ | RT: pinging latency for coherent vs incoherent sets |
| P-$\psi$4 | During chunk-ping: activating one meme from the chunk reactivates the entire chunk (all-or-none); unrelated memes reactivate individually | Behavioral: cued recall — chunk-member cue vs unrelated cue |
Neurobiological basis of $\kappa$ and $\psi$: Molecular markers of $\kappa$-levels, transition mechanisms, and the synaptic substrate of activity-silent WM ($\psi$) — see BM, Part III. Engram allocation: Josselyn & Frankland (2015). Memory linking: Cai et al. (2016), Nature. Schema-congruence: van Kesteren et al. (2012), SLIMM. Activity-silent WM: Stokes (2015), TiCS; Rose et al. (2016), JoCN; Wolff et al. (2017), Nature Neuroscience. Generative model of memory: Spens & Burgess (2024), Nature Human Behaviour.
Conceptual framework: Multi-level memory and activity-silent WM — see EMT, Part VIII.
Engineering implementation: $\kappa$ and $\psi$ as node attributes, two-compartment WM — see AGI_F, Part III.
Episodic Memory: Discretization of Experience
$\kappa$-levels determine the depth of meme storage. But memory is organized not only by depth — it is discretized into episodes: time-bounded sets of co-active memes indexed by content-independent markers.
Episode as a Formal Structure
Definition. An episode $\varepsilon_k$ is a tuple:
$$\varepsilon_k = (B_k, \; M_k(t), \; \tau_k, \; \partial\varepsilon_k)$$where:
- $B_k \in \{0,1\}^{|V|}$ — barcode (sparse binary vector, $\|B_k\|_0 / |V| \approx 0.05{-}0.10$)
- $M_k(t) = \{m_i : a_i(t) > \theta_{episode}\}$ — set of memes active in episode $k$
- $\tau_k = [t_{start}^k, \; t_{end}^k]$ — temporal interval
- $\partial\varepsilon_k$ — boundary conditions (what triggered the beginning/end)
The barcode $B_k$ is a content-independent index: a random sparse pattern generated by the hippocampus at the start of a new episode. It does not encode content — it addresses it, similar to a hash code (Chettih et al., 2024, Cell).
Difference from $\kappa$: $\kappa$ is a property of an individual meme (how consolidated it is). An episode is a property of a group of memes (what was co-active in one temporal interval). A meme can participate in multiple episodes, and its $\kappa$ does not depend on which episode it first appeared in.
Event Boundary Detection: When One Episode Ends and Another Begins
An episode boundary arises upon abrupt change of memplex state. Formally, the system monitors a context vector $\mathbf{c}(t)$ — the aggregate of current activations:
$$\Delta_{PE}(t) = \|\mathbf{c}(t) - \mathbf{c}(t-1)\|$$A new episode $\varepsilon_{k+1}$ begins when at least one of the following conditions is met:
| Condition | Formula | Interpretation |
|---|---|---|
| PE jump | $\Delta_{PE}(t) > \theta_{boundary}$ | Abrupt context change |
| G-modulation switch | $\|G_{active}(t) - G_{active}(t{-}1)\| > \theta_G$ | Switching the dominant drive |
| Temporal gap | $t - t_{last\_input} > T_{gap}$ | Pause in incoming information |
| SIT-gap closure | $closure_j(t) \geq 1.0$ while $closure_j(t{-}1) < 1.0$ | Closing a question = natural breakpoint |
When a boundary fires:
- The current $\varepsilon_k$ is closed: $t_{end}^k = t$
- A new barcode $B_{k+1}$ is generated (sparse random co-activation)
- $\varepsilon_{k+1}$ opens with $t_{start}^{k+1} = t$
Link to $\kappa$-gating (see above): PE serves a dual function — it determines both the fate of an individual meme ($\kappa$-transition) and an episode boundary. But the thresholds differ: $\theta_{boundary} > \theta_{\kappa}$ — not every creation of a new meme opens a new episode.
Temporal Order within an Episode: Time Cells
Within a single $\varepsilon_k$, memes are not merely co-active — they are temporally ordered. In addition to theta-binding (~125 ms, synchronizing components within a single moment, see Part XIV), there exists a slower scale: time cells encode the meme’s position on the episode’s temporal axis (seconds — minutes).
Formally, each meme $m_i$ in episode $\varepsilon_k$ receives an ordinal label:
$$\phi_k(m_i) = \frac{t_{activate}(m_i) - t_{start}^k}{t_{end}^k - t_{start}^k} \in [0, 1]$$The label $\phi_k$ is computed upon episode closure (when $t_{end}^k$ is known) or during replay; during an active episode, the absolute timestamp $t_{activate}(m_i)$ is used.
This allows retrieval to reconstruct not only what was in the episode but also in what order — a necessary condition for narrative memory.
Two levels of temporal encoding:
| Scale | Mechanism | What it encodes | Substrate |
|---|---|---|---|
| ~125 ms | Theta-binding (Part XIII) | Which components = one moment | Theta rhythm, phase precession |
| Seconds–minutes | Time cells, $\phi_k$ | Order within an episode | CA1, ramping neurons |
Retrieval: Pattern Completion via Barcode
Episode retrieval = partial reactivation of its barcode. It suffices to activate a portion of $B_k$ so that spreading activation restores the entire pattern:
$$\text{Recall}(\varepsilon_k) \;:\; \exists\, S \subset \text{supp}(B_k), \;\; |S| / \|B_k\|_0 > \rho_{min} \;\;\Rightarrow\;\; \text{pattern completion} \to M_k$$where $\rho_{min} \approx 0.3$ — the minimum fraction of reactivated barcode for full reconstruction.
A cue activates memes that through shared edges activate part of the barcode → the barcode through pattern completion restores the remaining memes of the episode. Hence:
- Retrieval specificity: different barcodes are orthogonal (sparse + random), confusion between episodes is minimal
- Contextual cue: the same stimulus in different episodes is linked to different barcodes → one can recall the specific episode needed
- Partial recall: at $\rho < \rho_{min}$ — only fragments of the episode, order is lost (tip-of-the-tongue effect)
Temporal Chaining: Sequence of Episodes
Sequential episodes are linked through context overlap at the boundary:
$$w(\varepsilon_k, \varepsilon_{k+1}) = |M_k(t_{end}^k) \cap M_{k+1}(t_{start}^{k+1})| / |M_k \cup M_{k+1}|$$The more memes remain active across the boundary — the stronger the associative link between episodes. Also: overlapping engrams (Cai et al., 2016) create a structural bridge — episodes within ~6 hours literally share graph nodes.
Barcode Lifecycle
The barcode is not a permanent structure. Its dynamics:
Ontological status of the barcode: $B_k$ is not a meme (graph node) but an index structure (episode metadata). A barcode has no $\kappa$ of its own; its persistence is determined by replay frequency. Without reactivation, the pattern $B_k$ fades — not through $\kappa$-transitions, but through loss of co-activation coherence among nodes in $\text{supp}(B_k)$.
| Phase | Duration | What happens | State of $B_k$ |
|---|---|---|---|
| Creation | Boundary moment | Sparse random co-activation, $B_k$ is formed | Active: pattern is coherent |
| Wakefulness | Hours | $B_k$ is tagged by awake SWR for nightly replay | Maintained: SWR supports |
| Sleep replay | Night | SWR reactivates $B_k$ → pattern completion → full $\varepsilon_k$ → DECOMPOSE + CONNECT | Maintained: replay strengthens |
| Consolidation | Days–weeks | Content $M_k$ integrates into semantic networks; replay of $B_k$ ceases | Fading: coherence is lost |
| Mature memory | Months–years | Barcode is lost; content = part of semantic network (memes with $\kappa = 2$) | Lost: pattern indistinguishable |
This is trace transformation: episodic memory (detailed, with barcode) → semantic memory (gist, without barcode). BMC predicts this transformation from first principles: content strengthened through replay transitions to $\kappa = 2$; the barcode as an index structure without G-linkage and without replay loses coherence and disappears.
Exception: Emotionally significant episodes ($E_{emotional} > \theta_E$) retain their barcode longer — through the amygdala-hippocampal loop. Hence vivid flashbulb memories: barcode is preserved → retrieval is complete, with details and order.
SIT-Gaps and Episodes: Orthogonal Timescales
A SIT-gap (open meme, Part V) lives across episodes:
ε₁ ──── ε₂ ──── ε₃ ──── ε₄ ──── ε₅
gap_A: ━━━━━━━━━━━━━━━━━━━━━━━ (closure in ε₅)
gap_B: ━━━━━━━━━ (closure in ε₂)
gap_C: ━━━━━━━━━━━━━ (open)
Connections:
- A gap causes thematic coherence between episodes: $\varepsilon_1$ and $\varepsilon_3$ are linked not only chronologically but also through shared gap_A
- Partial closure of a gap within $\varepsilon_k$ → PE → $\varepsilon_k$ is tagged for consolidation
- Full closure ($closure \geq 1.0$) → strong event boundary → $\varepsilon_k$ terminates
Zeigarnik in episodic terms: An open gap prevents associated episodes from being forgotten — barcodes $B_k$ with an unclosed gap do not decay because the gap generates periodic reactivation (SIT-driven pulsation → partial replay → barcode is maintained).
Predictions
| Prediction | Test |
|---|---|
| Event boundaries correlate with PE jumps ($r > 0.5$) | fMRI: hippocampal BOLD during context change vs continuous context |
| Barcodes are orthogonal across episodes (cosine similarity < 0.1) | Calcium imaging: CA1 patterns when recalling different episodes of the same day |
| An unclosed SIT-gap preserves barcodes of associated episodes | Behavioral test: recall of details of unfinished vs finished tasks (Zeigarnik) + fMRI |
| Trace transformation: $Fidelity(\varepsilon_k)$ drops faster than core recall | Test at 1 day vs 1 month: details (order, colors) vs gist (what happened) |
Neurobiological basis: Barcodes (Chettih et al., 2024), time cells, event boundary neurons — see BM, Part IV. Hippocampal indexing theory: Teyler & DiScenna (1986).
Conceptual framework: Episodes as discrete units of experience — see EMT, Part VIII.
Engineering implementation: Episode struct, barcode generation, boundary detection — see AGI_F, Part III.
Overnight Network Optimization: Sleep as a Network Process
Network interpretation of sleep:
The unit of overnight consolidation is the episode $\varepsilon_k$ (see Episodic Memory). Sleep SWR reactivates barcode $B_k$ → through pattern completion restores $M_k$ → the following operations are performed:
- Decomposition (DECOMPOSE): Episode $\varepsilon_k$ → set of components (features) — SWS
- Binding (CONNECT): New edges between features and existing categories — SWS→neocortex
- Recombination (BLEND): Recombination of components from different clusters — REM. Formal operation: $m_{new} = blend(m_i, m_j)$ for nodes from different communities (see Meme Synthesis)
- Pruning (PRUNE): Removal of weak edges ($|w| < \theta_{min}$) — synaptic homeostasis
- Strengthening (STRENGTHEN): Increasing weights of emotional and frequently reactivated edges
Effect of sleep on edge decay:
$$\lambda_{next} = \lambda_{base} \cdot (1 - \eta \cdot S_{quality})$$where $\eta \approx 0.3$ — sleep efficiency.
| Sleep | $S_{quality}$ | $\lambda$ | Time to $w < 0.1$ |
|---|---|---|---|
| Deprivation | 0 | 0.30 | ~8 days |
| Poor | 0.5 | 0.26 | ~9 days |
| Normal | 0.8 | 0.23 | ~10 days |
| Deep SWS | 1.0 | 0.21 | ~11 days |
Consolidation process: How operations on episodes cumulatively produce $\kappa$-transitions, SWR-tagging, trace transformation — see Consolidation Process below.
Neurobiological basis: Triple coupling and the decomposition mechanism — see BM: Sleep as Meme Consolidation Mechanism.
Verified sources: Staresina et al. (2015, Nature Neuroscience) — hierarchical nesting in humans; Latchoumane et al. (2017, Neuron) — causal proof via optogenetics; Yang et al. (2014, Science) — spine formation during sleep; Tononi & Cirelli (2014, Neuron) — SHY review: synaptic downscaling as the basis of PRUNE.
Consolidation Process: From Tagging to κ-Transition
The κ-levels section defines the conditions for transitions, the overnight optimization section defines the operations on the graph. This section formalizes the process linking them: how operations on episodes cumulatively produce $\kappa$-transitions.
SWR-Tagging: Selection of Episodes for Consolidation
Consolidation begins not during sleep, but during wakefulness. During pauses in activity (task completion, distraction), awake SWRs occur that reactivate recent episodes and tag them for overnight consolidation (Buzsaki, 2024, Science: 5–20 SWR after significant experience).
Tagging is a two-level process:
Meme → engram allocation (defined above): $P_{cons}(m_i) \propto C_E(m_i) \cdot (1 + \gamma_G \cdot G_{align}) \cdot (1 + \gamma_E \cdot E_{emot})$
Episode → SWR-tag: the episode inherits the priority of its best meme with a recency correction:
where $\lambda_r$ is the recency decay rate ($T_{1/2} \sim 1$ h), $gap\_rel(\varepsilon_k) = \max\{1 - closure_j : gap_j \text{ active in } \varepsilon_k\}$.
The second argument of $\max$ is the Zeigarnik correction: an episode with an unclosed SIT-gap ignores recency decay because the gap maintains periodic reactivation.
- Sleep → probability of successful replay: determined by oscillation quality (SO–spindle–SWR triple coupling, defined in BM):
where $\alpha_{SO}, \alpha_{spindle}, \alpha_{SWR} \in [0,1]$ are normalized oscillation amplitudes (defined in BM).
Full pipeline: $P_{cons}(m_i) \to P_{tag}(\varepsilon_k) \to P_{cons}^{night}$ — three filters, each of which can block consolidation.
Consolidation Cascade
Each sleep cycle (SWS + REM) performs a sequence of operations on tagged episodes. Operations are defined in overnight optimization; here their ordering and cumulative effect are formalized:
FOR each sleep cycle (n = 1, 2, ...):
FOR each ε_k with P_tag > θ_tag (typically ~ 0.2):
1. REPLAY: reactivate B_k → pattern completion → M_k
2. DECOMPOSE: extract features from M_k (SWS)
3. CONNECT: link features to existing schemas (SWS → neocortex)
4. BLEND: recombine with other ε (REM)
5. PRUNE: weaken peripheral connections (SHY)
6. STRENGTHEN: strengthen core connections (replay-induced L-LTP)
7. κ-ASSESS: recompute κ_i for each m_i ∈ M_k
STOP replay ε_k WHEN:
— all core memes have reached κ = 2, OR
— P_tag(ε_k) has fallen below θ_tag (recency faded, gap closed)
At step 1, replay increases $n_{react}$ for co-active memes. At step 7, memes are evaluated against standard $\kappa$-transition conditions. Cumulative effect: after $N$ cycles, core memes accumulate $n_{react} \geq N_{crit}$ → $\kappa: 1 \to 2$. Cessation of replay → barcode $B_k$ loses support → fades.
The capacity of a single night is limited: the number of replay slots is $\sim 10{-}20$ episodes per night. Episodes compete by $P_{tag}$ for these slots — the strongest win.
Trace Transformation: Detail → Gist
Repeated replay transforms the episode — it does not copy but restructures (Moscovitch, 2024; Nature, 2025). Formalization through per-meme dynamics within $\varepsilon_k$:
Core memes ($C_E(m_i) > \theta_{core}$ within $M_k$, where $\theta_{core}$ is the median $C_E$ across $M_k$): each replay → $n_{react}$++ → STRENGTHEN → weight++ → $Fidelity$ grows:
$$Fidelity_{core}(n) = F_0 + \rho_c \cdot (1 - F_0) \cdot (1 - e^{-\beta \cdot n})$$Peripheral memes ($C_E(m_i) < \theta_{core}$): SHY downscaling → weight– → $Fidelity$ decays:
$$Fidelity_{periph}(n) = F_0 \cdot e^{-\lambda_{SHY} \cdot n}$$where $n$ is the number of replay cycles, $\rho_c \sim 0.3$ is the consolidation strengthening rate (not to be confused with $\rho$ of reactivation from Fidelity), $\beta \sim 0.5$ is the consolidation rate, $\lambda_{SHY} \sim 0.2$ is the homeostatic downscaling rate.
Barcode $B_k$: maintained as long as replay continues. When core memes are integrated into the neocortex ($\kappa = 2$), replay of $\varepsilon_k$ ceases → $B_k$ fades → retrieval via $B_k$ becomes impossible → memory transitions from episodic to semantic.
Result: gist = surviving core memes with $\kappa = 2$; details = pruned peripheral memes; barcode = lost. Mature memory = semantic knowledge without the episodic “wrapper.”
Schema-Congruence: Modulation of Consolidation Speed
The number of replay cycles required for $\kappa: 1 \to 2$ depends on the meme’s I-compatibility with the existing memplex (continuation of the SLIMM analysis from the $\kappa$ section):
$$N_{replay}^{req} = \frac{N_{base}}{(1 + \alpha_{schema} \cdot I_{compat}) \cdot (1 + \alpha_{emot} \cdot E_{emot})}$$| Meme type | $I_{compat}$ | $E_{emot}$ | $N_{replay}^{req}$ | Mechanism |
|---|---|---|---|---|
| Schema-congruent | High | Low | 1–3 cycles | mPFC fast integration, many anchor points |
| Incongruent | Low | Low | 5–10+ cycles | Few connections to schema, detailed encoding |
| Radically novel + G | Low | High | 1–2 cycles | Emotional tag → accelerated LTP |
U-shaped curve: both the familiar (many anchor points in the schema) and the shockingly novel (emotional boost) consolidate quickly; slowest of all is the moderately novel without emotion.
Temporal Linking: Co-Consolidation of Close Episodes
Episodes within the temporal linking window ($\Delta t < T_{link}$) share engram neurons (Cai et al., 2016, Nature) → during replay of one $\varepsilon_j$, $\varepsilon_k$ is partially reactivated → co-consolidation:
$$P_{link}(\varepsilon_j, \varepsilon_k) = e^{-|t_j - t_k| / T_{link}} \cdot \frac{|M_j \cap M_k|}{|M_j \cup M_k|}$$where $T_{link} \sim 6$ h. The first factor is temporal proximity; the second is content overlap (Jaccard index).
Consequence: events of the same day consolidate as a cluster → forming “daily narratives” in autobiographical memory.
Temporal Scale of Consolidation
| Period | What happens | $\kappa$ of memes | Barcode $B_k$ | Fidelity |
|---|---|---|---|---|
| 0–6 h | Awake SWR-tagging | 1 | Active | $F_0$ (initial) |
| 1 night | 1–5 replay cycles | 1 (→2 with emotional tag) | Maintained | Core↑, periph↓ |
| 1 week | Multiple replay | Core: 1→2, periph: 1→∅ | Maintained/Fading | Core: $F_{LTM}$, periph: < $F_{trace}$ |
| 1 month | Trace transformation complete | Core: 2, periph: pruned | Fading/Lost | Core: high, periph: — |
| 6+ months | Semantic knowledge | 2 (gist only) | Lost | Stable (slow $\lambda_{LTM}$ decay) |
Exception: Flashbulb memories — the emotional tag (amygdala) protects the barcode from fading → episodic details are preserved for years.
Predictions
| Prediction | Test |
|---|---|
| Schema-congruent memes consolidate in 1–3 nights; incongruent ones in 5–10 | Recall of new facts (schema-fit vs schema-violating) at 1, 3, 7 days |
| Temporally close episodes ($\Delta t < 6$ h) produce cross-priming | Priming effect for episode pairs: one-day vs multi-day, controlling for content overlap |
| $P_{tag} \times S_{quality}$ predicts recall better than either factor alone | Overnight EEG (SO/spindle/SWR amplitude) + daytime $P_{cons}$ → recall after one week |
| Trace transformation preserves core ($C_E > \theta$) and loses periphery | Recall at 1 day (full details) vs 1 month (gist only): loss correlates with $C_E$ |
$\kappa$-transition conditions: engram allocation, decay rates — see $\kappa$-levels above.
Operations on the graph: DECOMPOSE/BLEND/PRUNE/STRENGTHEN — see overnight optimization above.
Neurobiological basis: Triple coupling, SWR-tagging, SHY, spine formation — see BM, Part IV. Trace transformation at the molecular level — see BM: Trace transformation.
Conceptual framework: Consolidation as active reconstruction — see EMT, Part VIII.
Engineering implementation: sleep_cycle(), swr_tag(), trace_transform() — see AGI_F, Part III.
Active Forgetting and Reconsolidation
Passive decay ($w_{ij} \to 0$ in the absence of activation) is the only forgetting mechanism formalized above. However, neuroscience shows that forgetting can be active: the I-system deliberately suppresses specific memes (Anderson & Green, 2001; Yi Zhong et al., 2012 — Rac1/Cdc42 cascade). Furthermore, memories retrieved from LTM become labile — reconsolidation (Nader et al., 2000).
Two Types of Forgetting
| Passive decay (above) | Active I-suppression (new) | |
|---|---|---|
| Mechanism | $w_{ij} \to 0$ without activation | I-memes deliberately extinguish target meme |
| Formula | $w_{ij}(t) = w_{ij}(t_0) \cdot e^{-\lambda_{eff} t}$ | $\Delta a_i^{suppress} = -\eta_s \cdot I_{sig}(i) \cdot a_i(t)$ |
| Selectivity | None — all inactive memes decay | Yes — specific meme suppressed by I-signal |
| Analog | Synaptic homeostasis (SHY) | Prefrontal GABA inhibition, Rac1 cascade |
| When | Constantly | During conflict ($D > \theta$), intrusions |
| Dependence on $\kappa$ | Yes ($\lambda_{base}(\kappa)$) | Weak: works at any $\kappa$, but LTM ($\kappa=2$) requires more signal |
I-Suppression Signal
$$I_{sig}(m_i, t) = \sum_{k:\, w_{ki} < 0} (-w_{ki}) \cdot a_k(t)$$Summation over all nodes with negative edges to $m_i$. The sign is inverted ($-w_{ki} > 0$) so that $I_{sig} \geq 0$. The more active the I-meme and the stronger the negative connection, the stronger the suppression.
When $I_{sig}(m_i, t) > \theta_I$:
$$a_i(t+1) \leftarrow a_i(t) + \Delta a_i^{suppress}, \quad \Delta a_i^{suppress} = -\eta_s \cdot I_{sig}(m_i, t) \cdot a_i(t)$$Sustained suppression → fidelity damage. We introduce a counter $\tau_{supp}(m_i)$ — the number of consecutive steps in which both conditions are met:
$$\tau_{supp}(m_i, t) = \begin{cases} \tau_{supp}(m_i, t-1) + 1 & \text{if } a_i(t) < \theta_{low} \text{ AND } I_{sig}(m_i, t) > \theta_I \\ 0 & \text{otherwise} \end{cases}$$When $\tau_{supp} \geq T_{suppress}$ (sustained suppression):
$$F_i \leftarrow F_i \cdot (1 - \delta_F), \quad w_{ij} \leftarrow w_{ij} \cdot (1 - \delta_w) \quad \forall j \in N(i)$$Prolonged suppression does not merely deactivate the meme — it destroys its consolidation (analogous to Rac1-induced actin depolymerization, AMPAR internalization).
Gap archiving upon sustained suppression: If I-suppression has led to $a_i < \theta_{prune}$ and the meme has an associated GAP-node → $archived\_gaps.add(gap_i)$. Without this, open_memes would contain “dead” gaps, inflating SIT.
Four Mechanisms of Active Forgetting
| # | Mechanism | BMC formalization | Neurobio analog |
|---|---|---|---|
| 1 | I-mediated suppression | $I_{sig} \to \Delta a_i^{suppress}$; sustained → $F_i \downarrow$ | Prefrontal GABA inhibition |
| 2 | Competition displacement | New meme displaces $m_i$ from WM → $\psi_i$ decay | AMPAR internalization / LTD |
| 3 | Interference | New meme $m_j$ with high $compat(m_j, m_i)$ → competition for engram | Neurogenesis in DG (Akers et al., 2014) |
| 4 | Sleep pruning | $F_i < F_{trace}$ after SHY → removal | Microglial pruning (C1q/C3) |
Mechanisms 2–4 were formalized earlier (WM displacement, engram interference, overnight pruning). Mechanism 1 (I-mediated suppression) is new, formalized above.
Retrieval-Induced Forgetting (RIF)
Recall of meme $m_i$ strengthens $m_i$, but suppresses competing memes through lateral inhibition (Anderson et al., 1994; Anderson, 2003). A competitor = a meme with high semantic similarity but not an ally ($w_{ij} < w_{ally}$; ally memes are protected from RIF):
$$\Delta a_j^{RIF} = -\beta_{RIF} \cdot a_i(t) \cdot \mathbb{1}[compat(m_i, m_j) > \theta_{sim}] \cdot \mathbb{1}[w_{ij} < w_{ally}]$$where $w_{ally} \approx 0.3$ — threshold: edges above this value indicate alliance.
Link to hub protection:
$$RIF_{effect}(m_j) \propto \frac{1}{C_E(m_j)}$$Peripheral memes (low eigenvector centrality) are suppressed under RIF more strongly than hubs. Hubs are protected by virtue of multiple supporting connections.
Reconsolidation: Plasticity Window
Definition of recall. In the context of reconsolidation, recall is the reactivation of a consolidated meme:
$$recalled(m_i, t) \equiv [\kappa_i = 2] \wedge [a_i(t) > \theta_{act}] \wedge [a_i(t-1) < \theta_{act}]$$Only LTM memes ($\kappa = 2$) are subject to reconsolidation. Activation crossing $\theta_{act}$ from below = recall.
Lability trigger: $recalled(m_i, t_r) = true$ AND prediction error $\Delta_{PE}(t_r) > \theta_{recon}$.
Lability function:
$$Labile(m_i, t) = \mathbb{1}[recalled(m_i, t_r)] \cdot \mathbb{1}[\Delta_{PE}(t_r) > \theta_{recon}] \cdot e^{-\frac{t - t_r}{\tau_{recon}}}$$$\tau_{recon}$ — window duration (biology: ~6 hours; AGI: N steps). While $Labile > \theta_{labile}$, the meme is in a labile state: its Fidelity naturally degrades (analogous to the need for protein resynthesis per Nader):
$$F_i(t) \leftarrow F_i(t) \cdot (1 - \delta_{labile} \cdot Labile(m_i, t))$$Reconsolidation outcomes:
| Condition | Result | What happens |
|---|---|---|
| PE moderate, compatible | Update | $w_{ij}$ modified; meme retains $\kappa=2$ |
| PE small, meme reactivated | Strengthen | $n_{react}$++, Fidelity↑; re-stabilization |
| PE large, incompatible ($\Delta_{PE} > \theta_{destab}$) | Destabilize | $\kappa: 2 \to 1$; meme returns to STM |
| No re-stabilization within $\tau_{recon}$ | Erase | $F_i \to F_{trace}$; meme is a candidate for pruning |
Boundary Conditions of Reconsolidation
| Factor | Effect | Formalization |
|---|---|---|
| Memory age | Young $\kappa=2$ is more labile | $P_{recon} \propto 1 / (1 + age_{consolidated})$ |
| Memory strength | High $n_{react}$ + $F_i$ → resists | $P_{recon} \propto 1 / (1 + n_{react} \cdot F_i)$ |
| Reactivation duration | Too brief recall — no lability; too long — new encoding | Inverted-U: $P_{recon}(t_{react}) = t_{react} \cdot e^{-t_{react}/\tau_{opt}}$ |
| PE magnitude | Too small → no lability; too large → new meme instead of update | Zone: $\theta_{recon} < \Delta_{PE} < \theta_{new}$ |
Relationship between I-Suppression and Reconsolidation
I-suppression and reconsolidation are two independent mechanisms for weakening a meme:
- I-suppression: deliberate suppression via $I_{sig}$ without recall; the meme is deactivated, with sustained suppression — fidelity damage.
- Reconsolidation: Fidelity degradation upon recall + PE; without I-involvement.
The mechanisms can coincide: sustained I-suppression weakens a meme → recall in this state → lability is facilitated (low $F_i$ → less resistance to reconsolidation per boundary conditions).
Predictions
| Prediction | Test |
|---|---|
| P-F1: RIF effect inversely correlates with $C_E$ of the suppressed meme: $RIF_{effect} \propto 1/C_E(m_j)$ — peripheral memes are forgotten more strongly than hubs | Think/No-Think paradigm: hub-beliefs (core identity) vs peripheral facts → RIF effect size by eigenvector centrality |
| P-F2: Memes with high $\kappa$ + high $n_{react}$ are resistant to reconsolidation | Reconsolidation protocol (Schiller et al.): compare well-rehearsed vs recent memories → lability rate |
| P-F3: I-suppression (sustained) → structural weakening ($F_i \downarrow$, $w_{ij} \downarrow$); passive decay → only $w \to 0$ | Intentional suppression + fMRI: decreased hippocampal BOLD under sustained; contrast with non-use decay |
| P-F4: Reconsolidation update is possible only in the zone $\theta_{recon} < \Delta_{PE} < \theta_{new}$ | Vary PE magnitude during reconsolidation: too low → no change; too high → new encoding; middle → update |
Conceptual framework: Forgetting as an immune function, therapeutic reconsolidation — see EMT, Part VIII.
Neurobiological basis: Rac1/Cdc42, GABA inhibition, Nader (2000) — see BM, Part IV.
Engineering implementation: i_suppress_cycle(), check_reconsolidation(), active_forget_cycle() — see AGI_F, Part III.
Automatization: Transition from WM-Dependent to WM-Independent Execution
Blocks 1–5 formalized storage, access, and forgetting of memes. Automatization is the next level: the transition from expensive WM-dependent execution of behavioral sequences to WM-independent execution. This is not an additional memory system but a necessary consequence of M » G with limited WM — if all pointers are occupied by routine tasks, there are no resources for new memes, and the memplex stagnates. Automatization removes this constraint.
Automatic Chain Auto(S)
Let $S = (m_1, m_2, \ldots, m_k)$ be a sequence of memes with $w(m_i, m_{i+1}) > 0$ for all $i$. We define:
$$Auto(S) \equiv \bigl[\min_i w(m_i, m_{i+1}) > w_{auto}\bigr] \wedge \bigl[n_{exec}(S) > N_{auto}\bigr]$$where $w_{auto} \approx 0.7$ is the threshold edge weight, $N_{auto} \approx 20$ is the minimum number of successful executions.
Interpretation: A sequence is considered automatic if (a) all edges in the chain are sufficiently strong — no “gaps” requiring WM control, and (b) the chain has been executed sufficiently often — routineness is confirmed. Chunking (Block 4) groups memes under one pointer — automatization goes further: the chain requires no pointer at all.
Semantics: Activation of the trigger $m_1$ cascades along strong edges through $m_2, \ldots, m_k$ without involvement of the WM pointer system. WM does not track intermediate steps — this is not “unconscious monitoring” but the absence of monitoring: the sequence unfolds autonomously.
Habit Dynamics
For each meme $m_i$ participating in behavioral chains, we introduce a continuous variable:
$$habit_i(t) \in [0, 1]$$Update after each execution:
$$habit_i(t+1) = habit_i(t) + \alpha_h \cdot (1 - habit_i(t)) \cdot \mathbb{1}[exec\_success] \cdot sleep\_boost(t)$$where $\alpha_h \approx 0.05$ is the learning rate, $\mathbb{1}[exec\_success]$ is the indicator of successful execution (errors do not strengthen the habit), and:
$$sleep\_boost(t) = 1 + \beta_{motor} \cdot \mathbb{1}[\Delta t_{sleep} < T_{crit}]$$$\beta_{motor} \approx 0.3$ is the motor consolidation bonus during sleep (~30% enhancement; Solano et al., 2024, J Neurosci), $T_{crit} \approx 1h$ is the critical window between training and sleep.
Full automatization threshold: $\theta_{habit} \approx 0.8$. When $habit_i > \theta_{habit}$ for all $m_i \in S$ — the chain transitions to automatic mode. Relationship to Auto(S): successful execution strengthens edges via the Hebbian mechanism ($w(m_i, m_{i+1}) \uparrow$ upon joint activation), so habit growth usually accompanies weight growth, and $habit > \theta_{habit}$ is a sufficient condition for satisfying the edge-weight clause of Auto(S).
Orthogonality: $F_i \perp habit_i$ — Fidelity (content accuracy) and habit (execution fluency) are independent axes. A meme can be high-fidelity but not automatized (a new skill), or automatized but with degraded content (a bad habit with an outdated world model).
WM-Release Rule
Three WM-cost regimes for a sequence of length $k$:
| Regime | $wm\_cost$ | Mechanism | BMC block |
|---|---|---|---|
| Raw | $k$ | Each step = 1 WM pointer | — |
| Chunked | 1 | Memplex under 1 pointer | Block 4 |
| Automatic | 0 | Trigger → chain without WM | Block 6 |
Formally:
$$wm\_cost(S) = \begin{cases} k & \text{if } chunk(S) = false \wedge Auto(S) = false \\ 1 & \text{if } chunk(S) = true \wedge Auto(S) = false \\ 0 & \text{if } Auto(S) = true \end{cases}$$When $Auto(S) = true$: activation of the trigger $m_1$ launches the cascade $m_1 \to m_2 \to \ldots \to m_k$ through strong edges ($w > w_{auto}$) without allocating WM pointers. Freed pointers are available for parallel tasks — an expert drives “on autopilot” and simultaneously carries on a conversation.
DMS/DLS Competition: Deliberative vs Automatic
| Deliberative (DMS) | Automatic (DLS) | |
|---|---|---|
| Trigger | Goal-directed (reward PE) | Stimulus-driven (action PE) |
| Execution | WM-controlled, step-by-step | Cascade, WM-free |
| Monitoring | Active ($A_{SMC}$ high) | Minimal ($A_{SMC}$ low) |
| WM cost | $k$ or 1 (chunked) | 0 |
| Flexibility | High (PE → correction) | Low (rigidity risk) |
Probability of automatic execution:
$$P_{auto}(S, t) = \frac{habit(S)^{\gamma_h} \cdot WM\_load(t)}{habit(S)^{\gamma_h} \cdot WM\_load(t) + (1 - habit(S))^{\gamma_h}}$$where $habit(S) = \min_i habit_i$ for $m_i \in S$, $\gamma_h \approx 2$ is the transition steepness coefficient.
Critical property: Turner et al. (2022, J Neurosci): the deliberative system (DMS) functionally competes with the automatic system (DLS) at the beginning of learning — DMS engagement slows the transition to DLS-dominant mode; the novice cannot “let go of control” even if individual steps are already strengthened. High $WM\_load$ accelerates the transition to automatic mode: when WM is overloaded, deliberative inhibition weakens, and DLS “takes over” control.
Distinction of PE types (Lakshminarasimhan et al., 2024): Goal-directed = reward prediction error (mismatch of expected reward); habitual = action prediction error (mismatch of expected action). The two systems use different error signals, which allows them to operate in parallel on different chains.
MDL / Policy Distillation
Moskovitz et al. (2024, NeurIPS): automatization as behavioral compression following the Minimum Description Length (MDL) principle. The deliberative policy $\pi$ (complete, expensive) is gradually “distilled” into the automatic policy $\pi_0$ (compressed, cheap). S1/S2 (Kahneman, 2011) are not two separate modules but two ends of a continuum of $habit$:
- $habit \approx 0$: System 2 (deliberative, $\pi$, high WM cost)
- $habit \approx 1$: System 1 (automatic, $\pi_0$, zero WM cost)
The brain minimizes the description length of behavior — automatization = compression. This is consistent with BMC: the system strives for minimal WM cost while maintaining adequacy of execution.
Sleep Motor Consolidation
Consolidation of automatic chains occurs during sleep through spindle-SO coupling (distinct from declarative SWR-driven consolidation):
- Spindle-SO coupling → strengthening of edges $w(m_i, m_{i+1})$ in Auto(S)
- Critical window: sleep within ~1h after motor training → $\beta_{motor} \approx 0.3$ (Solano et al., 2024)
- Mechanism: motor consolidation is mediated by cortical spindle-SO coupling, not hippocampal SWR (as in declarative)
De-Automatization
Cost of switching from automatic to deliberative execution:
$$Cost_{override}(S) = c_0 \cdot habit(S)^2 \cdot n_{exec}(S)^{0.5}$$where $c_0$ is the base override cost. Cost is quadratic in habit: the deeper the habit, the more expensive it is to abandon. The square root of $n_{exec}$ adds a “momentum” effect — long-practiced skills are harder to retrain even at equal habit.
Two failure modes:
Rigidity ($habit \to 1$, $n_{exec} \gg 1$): $Cost_{override}$ is so high that the deliberative system cannot take over control. The habit is resistant to PE → behavior does not adapt to changed conditions. Clinically: OCD-like repetitive behaviors.
Relapse (stress → $WM\_load \uparrow$ → $P_{auto} \uparrow$): under stress, WM is overloaded, deliberative inhibition weakens, and the old automatic chain “takes over” control. Mechanism of addiction relapse: stress → WM overload → P_auto for substance-seeking chain ↑ → relapse.
Note: Mastery and Addiction — One Mechanism
$P_{auto}$ describes both mastery and addiction. The difference is in the content of $S$, not the mechanism: for a pianist $S$ = sequence of movements, for an addict $S$ = substance-seeking chain. $WM\_load$ is the bridge: stress increases $P_{auto}$ for all automatic chains simultaneously (through $k_{eff}$↓ → $WM\_load$↑). Under stress, the chain with the highest $habit$ wins — regardless of desirability. This is why stress triggers relapse to the most entrenched habit.
Non-trivial prediction: $Cost_{override}$ for retraining a master ≈ $Cost_{override}$ for breaking an addiction at equal $habit$ and $n_{exec}$. The parameters are identical — only the evaluation of $S$ through utility nodes differs. A pianist retraining fingering and an addict breaking a habit are solving one and the same de-automatization problem.
Predictions
| Prediction | Test |
|---|---|
| P-A1: WM load inversion — an expert spends less WM on the trained task, but more on a parallel task (freed pointers are redistributed) | Dual-task paradigm: novice vs expert, measuring WM load on primary + secondary task |
| P-A2: Verbalization (deliberation) slows automatization — DMS functionally competes with DLS | Masters (1992, British Journal of Psychology) implicit/explicit learning: instruction “don’t think about technique” → faster automatization |
| P-A3: Automatization + stigmergy → reduced selection pressure for brain growth → decrease in H. sapiens brain volume after ~30 kya (DeSilva et al., 2021, Frontiers in Ecology and Evolution) | Correlation of archaeological markers of stigmergy (symbolism, tools) with changes in endocranial volume in the paleontological record |
| P-A4: Sleep within ~1h after motor training → acceleration of automatization (spindle-SO coupling) | Solano (2024) protocol: immediate sleep vs delayed sleep → habit acquisition rate |
| P-A5: Island dwarfism (H. floresiensis, ~420 cc) → substrate reduction → limited WM + limited M capacity → automatization and memplex growth constrained → cognitive ceiling (Oldowan level) — reverse prediction to P-A3 (see EMT) | Cross-species: endocranial volume vs maximum technological complexity |
Conceptual framework: Derivation from first principles, two escape-valves of one tension, evolutionary confirmation — see EMT, Part VIII.
Neurobiological substrate: DLS-SNr-PF-DLS loop, DMS/DLS competition, spindle-SO coupling — see BM, Part IV.
Engineering implementation: check_auto_chain(), update_habit(), execute_auto() — see AGI_F, Part III.
Network Modulation by Character
Effect of the T vector on spreading activation:
For utility nodes (emotional nodes), the spreading activation formula is modified to account for the individual character vector:
$$a_g(t+1) = f\left(T_g \cdot (1 - \lambda) \cdot a_g(t) + \sum_j w_{gj} \cdot a_j(t)\right)$$where $T_g$ is the character vector component for emotional node $g$.
Consequence: With high $T_{FEAR}$, the FEAR node activates more strongly given the same input signal → more often “wins” in competition for attention → forms more connections with new memes.
Effect on connection formation:
$$w_{new}(m, e_g) = w_{base} \cdot (1 + \gamma \cdot T_g \cdot a_g) \cdot N_{replay}$$| $T_g$ | Effect on the network |
|---|---|
| $T_g > 1.5$ | Hyperreactivity: the system dominates, many strong connections |
| $T_g \approx 1.0$ | Normal: balanced influence |
| $T_g < 0.7$ | Hyporeactivity: the system weakly influences meme formation |
Neurobiological basis: Character vector and its influence on memplex formation — see BM: Character.
Affective Space: Blend Formula
The seven G-programs are discrete circuits. But subjective affective experience is a continuous point in (valence, arousal) space, emergent from the blend of several simultaneously active G-programs:
$$E(t) = \sum_g T_g \cdot a_g(t) \cdot \mathbf{v}_g \;\in\; \mathbb{R}^2$$where $\mathbf{v}_g$ is a fixed vector in (valence, arousal) coordinates per G-program: $\mathbf{v}_{SEEKING} = (+0.6, +0.7)$, $\mathbf{v}_{FEAR} = (-0.8, +0.9)$, $\mathbf{v}_{RAGE} = (-0.7, +0.8)$, $\mathbf{v}_{LUST} = (+0.5, +0.6)$, $\mathbf{v}_{CARE} = (+0.8, +0.2)$, $\mathbf{v}_{GRIEF} = (-0.9, -0.3)$, $\mathbf{v}_{PLAY} = (+0.9, +0.5)$.
$E(t)$ is a continuous point in the circumplex (Russell, 1980, J Pers Soc Psychol). “Nostalgia” = $E$ near GRIEF + SEEKING + CARE; “Schadenfreude” = PLAY + RAGE blend. Emotional memes are M-nodes clustering certain zones of E-space. Cultures differ in clustering: amae (Jpn.), saudade (Port.) = different M-clusters given identical G. Barrett is right for the M-level, Panksepp for the G-level.
Neurobiological basis: G = discrete (PAG, hypothalamus), M = continuous neocortical construction, dementia as a test — see BM: Affective Space.
Engineering implementation: E(t) from utility activations — see AGI_F: Blend.
Note. BMC uses 7 Panksepp G-programs: SEEKING, FEAR, RAGE, LUST, CARE, PANIC/GRIEF, PLAY. DISGUST is an I-layer mechanism (marking “foreign”), not an 8th G-program. DISGUST has no vector $v_g$ in E(t) and is not included in capture weights. In engineering implementations, it may be represented as a utility node for convenience.
Part IX. Immune System — Network View
Entry Filter: Compatibility with Central Nodes
A new meme X is evaluated by compatibility with already existing memes. Central nodes play the key role:
$$S(X) = \sum_i C(i) \cdot compat(X, i)$$where:
- $C(i)$ — centrality of meme $i$
- $compat(X, i)$ — compatibility of the new meme with $i$
Three-zone acceptance rule:
| Condition | Result | What happens |
|---|---|---|
| $S(X) > +\theta$ | Accepted | Meme is assigned a positive weight |
| $\|S(X)\| \leq \theta$ | Neutral | Meme is at the periphery, weak connections |
| $S(X) < -\theta$ | Rejected | Meme is assigned a negative weight |
A rejected meme is not merely “not accepted.” It is stored with negative weights as an antibody (high-fidelity negative meme), ready for automatic activation upon re-encounter with the threat.
The I-system is not limited to filtering incoming memes. The same negative edges provide retroactive suppression of already accepted memes: if a previously compatible meme becomes conflicting with the current memplex, the I-suppression signal ($I_{sig}$) deliberately extinguishes it — see Active Forgetting and Reconsolidation.
Visualization of the Threshold Mechanism
Cognitive Dissonance: Simultaneous Activation of Incompatibles
Network interpretation: Two memes are incompatible if their simultaneous activation causes “tension” in the network. With signed edges:
$$D = \sum_{i,j} a_i \cdot a_j \cdot incompat(i,j), \quad \text{where } incompat(i,j) = \max(0, -w_{ij})$$Incompatibility is now directly determined by the negative weight of the edge: the stronger the negative connection, the greater the dissonance upon simultaneous activation.
High $D$ → cognitive dissonance → motivation to resolve.
Defensive circuit tension (Tension): An additional metric beyond dissonance — load on the utility/meme boundary. For signed edges:
$$Tension(t) = \sum_{u \in V_u, m \in N(u)} |w_{um}| \cdot gap(u, m)$$where $gap(u, m) = \max(0, a_u - a_m)$ when $w_{um} > 0$ (utility “pushes,” meme does not support) and $gap(u, m) = \max(0, a_m - \theta_{suppress})$ when $w_{um} < 0$ (meme is active despite utility rejection).
High $Tension$ → the defensive circuit is overloaded. Unlike $D$ (incompatibility within memes), $Tension$ measures misalignment between layers (utility vs memetic).
Hub Protection Hypothesis: Which Belief Will Change?
Standard theory (Festinger, 1957) describes the motivation to reduce D, but does not predict which meme will change. BMC gives a precise answer through eigenvector centrality:
$$P(\Delta a_i < 0 \mid D > \theta) \propto \frac{1}{C_E(i)}$$where $C_E(i)$ is the eigenvector centrality of node $i$. The meme with the lowest $C_E$ will be suppressed first: its change minimizes cascade restructuring.
Formal justification: Consider a pair $(i, j)$ with $w_{ij} < 0$ and $a_i, a_j > \theta_{high}$. Suppressing $i$ causes a cascade of length $L_i = \sum_{k \in N(i)} |a_k - a'_k|$, suppressing $j$ — a cascade $L_j$. Minimizing total restructuring → the meme with smaller $L$ is suppressed, which correlates with $C_E$:
$$\arg\min(L_i, L_j) \approx \arg\min(C_E(i), C_E(j))$$Testable prediction: In experiments with induced cognitive dissonance: semantic network degree (as a proxy for $C_E$) of the target belief predicts which of two beliefs will change. The belief with higher degree is preserved.
Boundary of applicability: Hub Protection operates when $D < \theta_{crisis}$ (everyday dissonance). When $Tension > \theta_{crisis}$ (life crises, prolonged accumulation of discrepancies) hub displacement dominates — the hub yields, cascade restructuring occurs. The approximation $\arg\min(L_i, L_j) \approx \arg\min(C_E(i), C_E(j))$ holds at moderate $a_i, a_j$ and not too high overall $Tension$.
Conceptual discussion of hub advantage in dissonance resolution — see EMT, Part IV.
Second Type of Tension: Structural Incompleteness (SIT)
$Tension$ and $D$ describe conflicts between existing nodes. But there is a second type of tension — from missing nodes. $SIT$ (Structural Incompleteness Tension) measures the total tension from structural gaps in the memplex (definition and formulas — see Part VIII: SIT).
Two types of tension:
| Property | $Tension_{mismatch}$ (already introduced) | $SIT$ (structural) |
|---|---|---|
| Source | Conflict between utility and active memes | Absence of nodes at structurally significant positions |
| What hurts | “Memes don’t support my needs” | “I know there’s something I don’t understand” |
| Temporal profile | Depends on current activation | Persistent (until gap is closed) |
| Resolution mechanism | Changing utility↔meme weights | Filling the gap (closure) |
| Biosubstrate | ACC, PFC, autonomic nervous system | DMN, dopamine system |
Combined metric of total tension:
$$Tension_{total}(t) = Tension_{mismatch}(t) + \gamma_{SIT} \cdot \sum_C SIT(C) \cdot LP(C, t)$$where $\gamma_{SIT}$ is the coefficient of SIT’s relative contribution (individual, depends on $T_{SEEK}$ — the Panksepp character component). High $T_{SEEK}$ → high $\gamma_{SIT}$ → researchers and scientists experience more tension from unsolved problems.
$Tension_{total}$ is a unified metric of cognitive load on the system. High $Tension_{total}$ → motivation to act (resolving conflicts or filling gaps). This explains why a scientist might leave a comfortable job for an unsolved problem: the $SIT$ component outweighs all other utility.
Defense Mechanism: Isolation of the Threatening Node
When a new meme threatens central nodes, the memplex “isolates” it:
- Connection blocking: Edge weights to the threatening meme are reduced
- Counter-argumentation: Antagonist memes are activated
- Rationalization: New connections are created that devalue the threat
Numerical Example: Meme Acceptance Threshold
Situation: New meme X (scientific fact) encounters a memplex with a religious core.
Central memes and compatibility with X:
| Meme | Centrality $C(i)$ | Compatibility with X | Contribution |
|---|---|---|---|
| Faith | 0.40 | 0.2 | 0.08 |
| Community | 0.30 | 0.8 | 0.24 |
| Morality | 0.20 | 0.9 | 0.18 |
| Ritual | 0.10 | 0.5 | 0.05 |
Sum: $S(X) = 0.08 + 0.24 + 0.18 + 0.05 = 0.55$
Threshold: $\theta = 0.50$
Result: $S(X) = 0.55 > 0.50$ → meme is accepted, but with modification (low compatibility with “Faith” will cause adaptation).
See also: Immune system of the person and memeplexes — EMT, Parts XIII–XIV.
Structural Balance of the Memplex
With the introduction of signed edges ($w \in [-1, +1]$), the memplex becomes a signed network. Structural balance theory describes stable configurations of such networks.
Formalization: A signed graph $G = (V, E^+, E^-)$ is balanced if all cycles contain an even number of negative edges.
| Pattern | Clusters | Interpretation | Relationship to modularity Q |
|---|---|---|---|
| Strict balance (Cartwright & Harary, 1956) | 2 | Polarization: “us” vs “them” | $Q \to 0.5$ (two equal clusters) |
| Weak balance (Davis, 1967) | $k \geq 2$ | Healthy modularity | $Q \approx 0.3\text{–}0.5$ |
| Absence of balance | — | Cognitive dissonance, instability | Low Q, chaotic structure |
Dynamics: The memplex tends toward weak balance — this corresponds to a modular structure with several clusters. Pathological strict balance ($k = 2$) — black-and-white thinking, fanaticism.
Sources: Heider (1946) — P-O-X model; Cartwright & Harary (1956) — Structure Theorem; Davis (1967) — weak balance; Dalege, Borsboom et al. (2016) — CAN model; Brandt & Sleegers (2021) — belief systems as signed networks.
Conceptual description: Structural balance in the context of the immune system — see EMT, Part XV.
Ambivalence Metric
$$Ambivalence(m) = \sqrt{\frac{1}{|N(m)|}\sum_{j \in N(m)} (w_{mj} - \bar{w}_m)^2}$$| Ambivalence | $\bar{w}_m$ | Interpretation |
|---|---|---|
| High | $\approx 0$ | Unstable state: strong + and – connections. Tends toward resolution |
| High | $> 0$ | Accepted, but there is strong resistance. Vulnerable position |
| Low | $\approx 0$ | Neutrality: weak connections, indifference |
| Low | $\gg 0$ or $\ll 0$ | Consolidated position: accepted or rejected without conflict |
Predictive power: High ambivalence predicts instability: the meme is prone to abrupt transition (acceptance → rejection or vice versa) upon small external perturbation.
Fractal Structure of the I-Layer
The Problem of Scaling Immunity
The entry filter $S(X) = \sum_i C(i) \cdot compat(X, i)$ (above) operates at a single level of abstraction — this is a special case for a memplex without hierarchical structure. But a real memplex contains memes at different levels: from sensory micro-memes to abstract values. A filter tuned for semantic compatibility will not detect a perceptual artifact; a perceptual noise filter will not evaluate an ideological threat. Hierarchical filtering at different processing levels is described in attention research (Broadbent, 1958) and predictive processing (Friston, 2005). BMC formalizes it as per-level immune subsystems.
Formalization: $I^{(k)}$ — Immune Subsystem of Level $k$
Let the memplex be organized into a hierarchy of levels $k \in \{0, 1, \ldots, K\}$ (from sensory patterns to abstractions). At each level, an immune subsystem is defined:
$$I^{(k)} = \{m \in V_m^{(k)} : role(m) = \text{immune}\}$$Entry filter of level $k$:
$$S^{(k)}(X) = \sum_{i \in I^{(k)}} C^{(k)}(i) \cdot compat^{(k)}(X, i)$$where $C^{(k)}(i)$ is the centrality of immune meme $i$ at level $k$, $compat^{(k)}$ is the compatibility function defined for memes of level $k$.
Filtering rule: Meme $X$ of level $k$ is checked by filter $I^{(k)}$, not by filters of other levels:
$$\text{accept}^{(k)}(X) = \begin{cases} 1 & \text{if } S^{(k)}(X) > +\theta^{(k)} \\ 0 & \text{if } |S^{(k)}(X)| \leq \theta^{(k)} \\ -1 & \text{if } S^{(k)}(X) < -\theta^{(k)} \end{cases}$$The threshold $\theta^{(k)}$ may differ at each level (stricter at L3, softer at L0).
Immunogenesis: Formation of $I^{(k)}$ through Rejection Experience
$I^{(k)}$ is formed by the same mechanism as the main memes of level $k$ — through stabilization of recurring patterns:
$$\text{Repeated rejection of meme type } Y \text{ at level } k \implies \text{stabilization of immune pattern } i_Y \in I^{(k)}$$Fidelity of immune memes grows with the number of rejections (spacing effect):
$$F(i_Y, t) = F_0 + \alpha \cdot n_{reject}(Y, t)$$where $n_{reject}$ is the number of rejections of memes of type $Y$.
Constitutional G-Invariants as Hard Core
At level $K$ (abstract memes), $I^{(K)}$ includes immutable immune patterns — constitutional G-invariants:
$$CARE \geq RAGE, \quad PLAY \geq RAGE, \quad SEEKING > 0, \quad FEAR > 0$$These invariants are an analog of immunological tolerance: the I-system is trained not to attack “self” (the G-core). No experience and no level of $I^{(k)}$ can modify them.
Rate limitation of G-modification:
$$|\Delta G_i(t)| \leq \varepsilon_{max} \quad \forall t$$Violation → I-rejection (blocked as “infection”).
Predictions
$|I^{(k)}| / |M^{(k)}|$ — optimal fraction of immune memes. Speculative analogy: in biology ~5–10% of cells are immune. Direct transfer of this proportion to BMC is not theoretically justified. Prediction: when $|I^{(k)}| / |M^{(k)}| < \rho_{min}$ (threshold not determined), the memplex is vulnerable to “infections” at level $k$. Empirical calibration is required.
Vulnerability window during memogenesis. When the M-layer reaches a new level $k+1$ but $I^{(k+1)}$ is not yet formed, the memplex is vulnerable to memes at level $k+1$. Analog: immature immune system of the newborn; vulnerability of adolescents to ideological capture (PFC not myelinated → $I^{(3)}$ is weak).
Ablation: A flat I-system ($I = I^{(0)}$ for all levels) → degraded filtering at high levels. Metric: $R(t) = Q(t)/H(t) \cdot (1 - SIT(t))$ grows faster with flat I than with fractal I.
Conceptual justification: EMT, Part XV. Engineering specification: AGI_F, Part IV. Neurobiology: BM, Part IV.
Cognitive Biases as Consequences of Network Topology
Six already formalized BMC mechanisms give rise to ~200 cognitive biases as emergent side effects. Here we provide the formal definition of bias strength function for each mechanism.
Conceptual overview and mapping: EMT, Part XXIV.
Bias Strength Functions
1. H-bias (hub inertia): Bias strength is proportional to eigenvector centrality:
$$B_H(m_i) = 1 - \frac{1}{1 + \alpha_H \cdot C_E(m_i)}$$where $\alpha_H$ is a scaling coefficient. When $C_E \to 0$ (peripheral meme): $B_H \to 0$ — no bias, the meme is easily updated. When $C_E \gg 1$ (hub): $B_H \to 1$ — maximum bias, the meme is nearly immutable. Confirmation bias, belief perseverance, backfire effect are consequences of $B_H \approx 1$ for core memes of the memplex.
Link to cognitive dissonance: $P(\Delta a_i < 0 \mid D > \theta) \propto 1/C_E(i)$ (defined above, Hub Protection Hypothesis). H-bias is the steady-state manifestation of the same mechanism: hubs do not wait for dissonance — they preemptively filter incompatible information through I.
2. I-bias (immune filtration): Bias strength is determined by the filter threshold:
$$B_I(X, t) = 1 - S(X, t)$$where $S(X) = \sum_i C_i \cdot compat(X, m_i)$ — compatibility of the incoming meme $X$ with the current memplex (defined in the Entry Filter section). When $S(X) < \theta$, the meme is rejected → in-group bias, hostile media effect, not-invented-here. I-bias depends on memplex structure: changing core memes → changing the threshold → previously rejected memes are accepted (conversion).
3. W-bias (WM constraints): Bias strength is inversely proportional to free WM slots:
$$B_W(t) = 1 - \frac{k_{eff}(t)}{k_{active}}$$where $k_{eff}(t) = k_{active} - n_{captured}(t)$ (defined in the WM constraints section). When $k_{eff} = k_{active}$ (calm state, no G-capture): $B_W = 0$. When $k_{eff} = 1$ (strong G-capture or WM load): $B_W = 1 - 1/k_{active} \approx 0.75$. Anchoring, framing, base rate neglect are amplified proportionally to $B_W$: fewer free slots → the decision is determined by what is already loaded.
4. G-bias (affective capture): Bias strength is proportional to affective pressure:
$$B_G(t) = \frac{\sum_g T_g \cdot a_g(t) \cdot w_{capture,g}}{\sum_g T_g \cdot w_{capture,g}}$$where $w_{capture,g}$ is the number of WM slots captured by G-system $g$ (FEAR=1.0, RAGE=0.8, PANIC/GRIEF=0.7, …; defined in the G→WM capture section). $B_G \in [0, 1]$: when $a_g = 0$ for all G: $B_G = 0$; at full activation of the dominant G: $B_G \to 1$. Loss aversion, optimism bias, affect heuristic are proportional to $B_G$.
Loss asymmetry: FEAR ($w_{capture} = 1.0$) captures more WM slots than PLAY ($w_{capture} = 0$) → $B_G$ is higher for threats than for opportunities. This produces loss aversion as a network effect rather than an ad hoc parameter.
5. A-bias (automatization): Bias strength is proportional to the square of habit strength:
$$B_A(S, t) = \frac{Auto(S) \cdot habit(S, t)^2}{1 + Auto(S) \cdot habit(S, t)^2}$$Normalized to $[0, 1]$. $Auto(S)$ and $habit(S)$ are defined in the Automatization section. Cost of switching to an alternative:
$$Cost_{override}(S_{alt}) = c_0 \cdot habit(S_{current})^2 \cdot n_{exec}^{0.5}$$Status quo bias, functional fixedness, Einstellung effect are consequences of $Cost_{override} \gg 0$: even when a better strategy $S_{alt}$ is available, switching requires WM resources that may be insufficient (intersection with $B_W$).
6. R-bias (reconsolidation): Bias strength is active only in the labile window:
$$B_R(m_i, t) = Labile(m_i, t) \cdot \delta_{context}$$where $Labile(m_i, t) \in \{0, 1\}$ is defined in the Reconsolidation section (4 outcomes: strengthen, update, destabilize, erase). $\delta_{context}$ is the measure of context mismatch at retrieval. Hindsight bias: $Labile = 1$ upon recall + outcome is known ($\delta_{context}$ is high) → update incorporating the outcome. Misinformation effect: $Labile = 1$ + external meme with high $compat$ → update with false content.
Cross-Modulation: Negativity Bias as a Parameter
Negativity bias ($\lambda_{neg} < \lambda_{pos}$) is not a separate mechanism but a parameter amplifying H- and I-biases for negative content:
$$B_H^{neg}(m_i) = B_H(m_i) \cdot \left(\frac{\lambda_{pos}}{\lambda_{neg}}\right)^{\gamma} > B_H(m_i)$$with $\gamma > 0$ and $\lambda_{neg} < \lambda_{pos}$. Negative hubs are more inert → confirmation bias for negative beliefs is stronger than for positive ones.
Prediction P-CB1 (Formal Version)
If biases are generated by 6 independent mechanisms, bias correlations should cluster:
$$\rho(B_X(i), B_X(j)) > \rho(B_X(i), B_Y(j)) \quad \text{for } X \neq Y$$where $B_X(i)$, $B_X(j)$ are two biases from the same group $X$, and $B_Y(j)$ is a bias from another group $Y$.
Specifically: $\rho(\text{confirmation bias}, \text{belief perseverance}) > \rho(\text{confirmation bias}, \text{anchoring})$, because the first two are H-mechanism, while anchoring is W-mechanism.
Existing data (Ceschi et al., 2019) found 3 factors in 17 biases. BMC predicts: expanding the battery to 30+ biases (covering all 6 groups) → 6 factors, with H and I possibly merging (both are hub-dependent, but differ in mechanism: inertia vs filtration).
Neurodata: Confirmation Bias as Failure of Readout
Park et al. (2025, Nature Communications) showed: in a sequential categorization task, evidence is accurately encoded in parietal cortex regardless of compatibility with the current choice, but readout for behavior is greater for consistent evidence. This directly corresponds to the I-mechanism: $S(X)$ is lower for memes incompatible with the current memplex — not because the information is not encoded (the sensory S-layer works), but because the I-filter blocks readout for behavior.
Conceptual mapping: EMT, Part XXIV. Neurosubstrate: BM, Part IX. AGI engineering: AGI_F, Part VI.
Part X. Inter-Level Transitions: Multilayer Networks
The Scaling Problem
The theory applies at different levels:
- Memplex in the head (neural networks)
- Organization memplex (social bonds)
- State memplex (institutions, media)
The question: how to transfer concepts between levels?
N scales, not two. The hierarchy of scales is not limited to two or three levels: memes-in-agent ↔ agents-in-swarm ↔ teams-in-organization ↔ organizations-in-industry ↔ cultures-in-SMR. At each level, BMC = (G, M, I, S) is reproduced with the same formal apparatus (heavy-tailed distribution, Q, σ_SW, PA, hub displacement), but on a different substrate and with different timescales — renormalization invariance. SMR as a fractal BMC is formalized in SM, Part VII. All levels are in the green transfer zone for topological properties (Part XV).
Multilayer Networks: Unified Formalism
A multilayer network allows modeling all levels within a single framework.
Mathematical Formalization
A multilayer network is described by an adjacency tensor:
$$\mathcal{A}_{ij}^{[\alpha\beta]}$$where:
- $i, j$ — node indices
- $\alpha, \beta$ — layer indices
- $\mathcal{A}_{ij}^{[\alpha\alpha]}$ — connections within layer $\alpha$ (intra-layer)
- $\mathcal{A}_{ii}^{[\alpha\beta]}$ — connections between layers for node $i$ (inter-layer)
Intra-layer vs Inter-layer dynamics:
| Connection type | Example | Dynamics |
|---|---|---|
| Intra-layer | Meme ↔ Meme in the head | Associative activation |
| Inter-layer | Meme in the head ↔ Meme in the group | Verbalization, social influence |
General spreading dynamics:
$$a_i^{[\alpha]}(t+1) = f\left(\sum_j \mathcal{A}_{ij}^{[\alpha\alpha]} a_j^{[\alpha]}(t) + \sum_{\beta \neq \alpha} \mathcal{A}_{ii}^{[\alpha\beta]} a_i^{[\beta]}(t)\right)$$The first sum is the influence of neighbors in the same layer; the second is the influence of “copies” of the node in other layers.
Localized vs Delocalized Spreading
In multilayer networks, two regimes are possible (Tey & Cozzo, 2024):
| Regime | Characteristic | When it arises |
|---|---|---|
| Localized | Activation remains in one layer | Weak inter-layer connections |
| Delocalized | Activation spreads across all layers | Strong inter-layer connections |
Transition between regimes is a phase transition; the critical parameter is the inter/intra connection ratio.
Application to memeplexes:
- Localized regime: A thought “gets stuck” in one context (work does not affect family)
- Delocalized regime: A thought permeates all spheres of life (conversion, obsession)
Complex Contagion vs Simple Contagion
Key difference between memes and diseases:
| Characteristic | Simple contagion (diseases) | Complex contagion (memes) |
|---|---|---|
| Mechanism | One contact is sufficient | Multiple confirmations needed |
| Threshold | $\beta > 0$ | $\beta > \theta$ (social threshold) |
| Structural influence | Node degree | Clustering and overlap |
| Speed | Exponential | Slower, but more stable |
Complex contagion formula:
$$P(\text{adoption}_i) = \begin{cases} 1 & \text{if } \frac{|\text{active neighbors}|}{k_i} > \theta \\ 0 & \text{otherwise} \end{cases}$$where $\theta$ is the adoption threshold (fraction threshold).
Consequence: For spreading complex memes (ideologies, practices), bridges with wide overlap are important — not just a connection between clusters, but multiple connections. A single bridge is insufficient.
Source: Kivela et al. (2014), J. Complex Networks — review of multilayer networks; Behavior Research Methods (2025) — cognitive multiplex networks.
Replication Pressure: Individual Mechanism of Transmission
Simple and complex contagion describe meme reception (adoption): under what conditions an agent accepts a meme from others. But contagion is a two-sided process. What drives transmission (expression) from the speaker’s side?
The standard answer — “social influence,” “communicative intention” — describes the phenomenon but does not explain it. In the BMC formalism, the answer is direct: a meme is a replicator, and an activated meme creates pressure on the communication channel. This is not a metaphor: the pressure is computable and predicts observed behavior.
Definition. The replication pressure (expression pressure) of meme $m_i$ at time $t$ with an open communication channel:
$$R_{expr}(m_i, t) = a_i(t) \cdot F_i \cdot \text{rel}(m_i, \text{context}(t)) \cdot (1 + \alpha_C \cdot C_E(m_i))$$where:
- $a_i(t)$ — current activation of the meme
- $F_i$ — fidelity (a consolidated meme is expressed more accurately → more frequently)
- $\text{rel}(m_i, \text{context})$ — relevance to the current communication context (cosine similarity with the conversation topic)
- $C_E(m_i)$ — eigenvector centrality (hub bonus: central memes are expressed more frequently)
- $\alpha_C$ — hub amplification coefficient (≈ 0.3)
Threshold condition: A meme is expressed if $R_{expr}(m_i, t) > \theta_{expr}$ and the communication channel is open (the agent is in dialogue, not in inner speech).
Candidate selection: Among memes with $R_{expr} > \theta_{expr}$, the top-$k$ with the highest pressure are selected. Parameter $k$ is limited by channel throughput (one person cannot express 20 thoughts simultaneously; speech bottleneck ≈ 150 bits/s, see EMT, Part XX).
Two types of communicative impulse. In BMC, communication has two independent sources:
| Source | Drive | Mechanism | Direction | Example |
|---|---|---|---|---|
| G-layer | SEEKING | Utilitarian (curiosity, information seeking) | Incoming: “want to find out” → questions | “What do you think about…?” |
| M-layer | $R_{expr}$ | Emergent (replication pressure of active memes) | Outgoing: “meme wants to be expressed” → sharing | “I had a case too…” |
These are not the same thing. SEEKING is a G-program (innate drive, genetic layer). Replication pressure is an emergent property of the M-layer (memes replicate because it is their nature, just as genes replicate through DNA polymerase). In real dialogue, both impulses compete for the communication channel:
$$\text{expression-mode}(t) = \begin{cases} \text{listen} & \text{if } a_{SEEKING}(t) \gg \sum_i R_{expr}(m_i, t) \\ \text{share} & \text{if } \sum_i R_{expr}(m_i, t) \gg a_{SEEKING}(t) \\ \text{both} & \text{otherwise (share + ask)} \end{cases}$$A healthy dialogue is an alternation of listen and share. Monopoly of one impulse produces dysfunction:
| Pattern | Dominant impulse | Phenomenon |
|---|---|---|
| Only SEEKING, no $R_{expr}$ | G-driven | “Interrogation”: the interlocutor feels investigated, not communicated with |
| Only $R_{expr}$, no SEEKING | M-driven | Monologue: narcissist, lecturer without feedback |
| $R_{expr}$ dominates for SMC hubs | SMC-biased | Narcissism: self-referential memes have high $R_{expr}$ |
| $R_{expr}$ dominates for one cluster | Cluster-biased | “Boring”: the expert only talks about their field |
| Balance of SEEKING ↔ $R_{expr}$ | Healthy | Lively dialogue: “share — ask — listen — share” |
Replication success. Meme expression is not the end of replication. Successful replication = the meme is accepted by the recipient (adoption, see complex contagion above). Feedback:
$$\text{success}(m_i, t) = \begin{cases} 1 & \text{if the recipient picked up the topic (similarity > } \theta_{adopt}\text{)} \\ 0 & \text{if ignored or rejected} \end{cases}$$Upon success: $w_{ij}$ is strengthened (link between the meme and the context in which it successfully replicated). Upon failure: $R_{expr}$ decreases for this particular recipient (not fidelity — the meme did not become less “accurate,” it’s just that this recipient did not accept it). This is adaptive modulation: an experienced communicator learns which memes work with which interlocutor.
Prediction P-E1: People with high hub centrality in a particular semantic domain spend more time talking on that topic (controlling for profession). Testable: analysis of speech corpora + extraction of individuals’ semantic networks → correlation of eigenvector centrality with share of speech on the topic.
Prediction P-E2: In one-sided communication (one talks, the other only listens) the speaker is subjectively satisfied (replication pressure is discharged), the listener is not (their $R_{expr}$ is not discharged). In bilateral communication — both are satisfied. Testable: experimental manipulation of dialogue structure + satisfaction self-report.
Link to neurobiology: see BM, Part V — Broca as substrate, inner speech, tip-of-the-tongue as evidence of $R_{expr}$.
Architectural specification for AGI: see AGI_F, Part IV — expression candidates, pseudocode.
Philosophical justification: see EMT, Part XX — analogy with DNA polymerase, bidirectional communication.
Stigmergy: Coordination through the Environment
Simple and complex contagion describe direct meme transmission (agent → agent). But in multilayer networks, there exists a fundamentally different coordination mechanism — stigmergy (Grasse, 1959): indirect coordination through traces in the environment.
Definition: An agent leaves a trace in a shared environment (M-layer). Another agent reads the trace and modifies behavior. There is no direct contact between agents.
| Mechanism | Transmission type | Substrate | Example |
|---|---|---|---|
| Simple contagion | Direct, threshold = 1 | Agent → agent contact | Virus, gossip |
| Complex contagion | Direct, threshold > 1 | Multiple contacts | Ideology, complex practice |
| Stigmergy | Indirect, through environment | Persistent traces in M-layer | Pheromone trails, laws, Wikipedia |
In the multilayer network formalism, stigmergy is an inter-layer connection of a special type: an agent in layer $\alpha$ modifies a node in layer $\beta$ (environment), and an agent from layer $\gamma$ reads the modification. The adjacency tensor $\mathcal{A}_{ij}^{[\alpha\beta]}$ describes not only direct connections between layers but also environment-mediated ones.
Why stigmergy scales while direct coordination does not:
- Direct coordination requires $O(N^2)$ connections (or hubs, creating a single point of failure)
- Stigmergy scales to $O(N)$: each agent interacts with the environment, not with every other agent
- Traces are persistent (books, laws, code) — coordination does not require simultaneous presence of participants
Stigmergy in BMC terms: the environment = an externalized M-layer. All four BMC subsystems function identically:
- G — utilitarian drives determine which traces an agent leaves and which it responds to (SEEKING → scouting new trails, CARE → maintaining structures, FEAR → marking dangerous zones)
- M — traces in the environment = memes with activation (trace intensity), fidelity (preservation), and connections (the trail leads to food)
- I — the immune system filters foreign traces (an ant from another colony ignores foreign pheromones; a Wikipedia user reverts vandalism)
- S — information deficits in the environment (a broken trail, an unfinished article) activate SIT and motivate the agent to fill the gap
Replication in stigmergic systems. Objection: “a meme is a replicator, but a trace in the environment does not replicate.” Response: in stigmergy, what replicates is not the trace but the behavior. An ant sees the trail → follows it → deposits pheromone. The behavior “follow and reinforce” replicates from agent to agent. The trace is the cumulative result of replicated behavior. The analogy with the neural level is exact: in the brain, what replicates is the activation pattern (neuron behavior), and synaptic weight is the cumulative result. There is no fundamental difference between synapse reinforcement and trail reinforcement.
Formally, a stigmergic trace satisfies all criteria of Darwinian selection:
| Criterion | Stigmergic trace | Neural meme |
|---|---|---|
| Variation | Scouts create new trails, errors → alternative paths | Mutations upon reactivation, reinterpretation |
| Differential persistence | Efficient trails are reinforced, inefficient ones fade | Active ensembles strengthen, inactive do not |
| Inheritance | Trace is persistent — the next agent “inherits” the result of the previous one | Synaptic weight is preserved between activations |
| Behavioral replication | Each agent following the trail repeats predecessors’ behavior | Each reactivation repeats the activation pattern |
Stigmergy in SMR. Stigmergic traces in SMR form their own heavy-tailed graph: frequently used “knowledge trails” are reinforced (PA), rarely used ones fade (decay), hub-paradigms and Q-modules (cultural branches) form. This is renormalization invariance: neural synapses ($L_1$) → social bonds ($L_2$) → corporate processes → industry standards → stigmergic traces in SMR ($L_N$). Formalization: SM, Part VII.
Prediction: Stigmergic systems (ant colonies, open-source projects, wiki platforms) demonstrate the same BMC patterns as individual memeplexes: competition of traces for resources (attention), immune selection (moderation), SIT-driven innovation (gap filling), and phase transitions at critical trace mass.
Source: Grasse (1959) — the term “stigmergy” for termite coordination; Theraulaz & Bonabeau (1999), Artificial Life — formalization of stigmergy in multi-agent systems; Heylighen (2016), Cognitive Systems Research — stigmergy as the basis of collective intelligence.
Correspondence Table between Levels
| Concept | Individual | Group | Society |
|---|---|---|---|
| Node | Meme | Person | Institution |
| Edge | Association | Social bond | Communication |
| Weight | Connection strength | Contact frequency | Flow intensity |
| Activation | Neural activity | Attention to topic | Media coverage |
| Cluster | Subpersonality | Faction | Social stratum |
| Centrality | Meme “strength” | Influence | Institutional power |
| Modularity | Internal consistency | Cohesion | Societal polarization |
What Transfers and What Does Not
Transfers (isomorphism):
- Logic of spreading activation
- Competition for limited resources
- Cluster structure
- Defense mechanisms
- Network effects (cascades, phase transitions)
Does not transfer (substrate differences):
- Timescales (ms vs years)
- Intentionality (neurons vs people)
- External memory (only societies have books, archives)
- Reflexivity (people can consciously change connections)
Formal Criteria for Scale Transfer
The assertion “one algorithm on different substrates” requires qualification: formal isomorphism (matching equations) does not guarantee causal isomorphism (matching mechanisms). Physics solves this problem through universality classes: systems with different micro-mechanisms but identical critical exponents belong to one class (Kadanoff, 1966; Wilson, 1971; Batterman, 2002).
Empirical fact: neuronal avalanches (Beggs & Plenz, 2003) and information cascades in social networks (Notarmuzi et al., 2022, Nature Communications) obey power laws, but with different critical exponents:
$$\tau_{\text{neural}} \approx \frac{3}{2} \quad \text{(mean-field branching)}, \qquad \tau_{\text{social}} \approx \frac{9}{4} \quad \text{(RFIM)}$$Therefore, the neural and social levels belong to different universality classes. This does not refute the isomorphism — it refines its boundaries.
Transfer Criterion through Renormalization
Geometric renormalization of networks (Garcia-Perez, Boguna & Serrano, 2018, Nature Physics; Villegas et al., 2023, Nature Physics) provides a formal tool: under coarse-graining (scale enlargement), we verify invariant preservation.
Transfer of prediction $P$ from level $\alpha$ to level $\beta$ is valid if:
$$\forall \, \mathcal{I} \in \text{dep}(P): \quad \left|\frac{\mathcal{I}^{[\beta]} - \mathcal{I}^{[\alpha]}}{\mathcal{I}^{[\alpha]}}\right| < \epsilon$$where $\text{dep}(P)$ is the set of topological invariants on which prediction $P$ depends; $\mathcal{I}^{[\alpha]}$ is the invariant value at level $\alpha$; $\epsilon$ is the tolerance.
Principal invariants:
| Invariant $\mathcal{I}$ | Notation | Preserved in individual→society transition? |
|---|---|---|
| Small-world index | $\sigma_{SW} = \frac{C/C_{rand}}{L/L_{rand}}$ | Yes — both levels are small-world |
| Degree distribution shape | $P(k) \sim k^{-\gamma}$ | Yes — both levels are heterogeneous (γ ∈ [2, 3]) |
| Modularity | $Q$ | Yes — cluster structure at both levels |
| Critical avalanche exponent | $\tau$ | No — different classes (3/2 vs 9/4) |
| Timescale ratio | $\epsilon = t_{intra}/t_{inter}$ | No — ms vs years (∼10⁹) |
Consequence: Predictions depending only on the first three invariants (topology, degree distribution, modularity) transfer. Predictions depending on critical exponents or timescale require a scaling correction.
Three Transfer Zones
| Zone | Invariants | Example prediction |
|---|---|---|
| Unconditional | Topology, $Q$, degree distribution | “A hub is more resilient to attack than the periphery” |
| With correction | + critical exponents, timescale | “The cascade will reach N nodes in time T” (T is recalculated) |
| Impossible | Substrate-specific | LTP/LTD, neurotransmitters, money, laws |
Operationalization of the Yellow/Red Zone Boundary: Structural Equivalence Test (SET)
Mechanism $\mathcal{M}$ is in the red zone for the level pair $(L_1, L_2)$ if and only if:
$$\nexists \; \varphi: \mathcal{M}_{L_1} \to \mathcal{M}_{L_2} \quad \text{s.t. } F_{L_1}(\mathbf{x}, \boldsymbol{\theta}_1) \equiv F_{L_2}(\varphi(\mathbf{x}), \boldsymbol{\theta}_2)$$where $F$ is the dynamic equation of the mechanism, $\mathbf{x}$ is the state vector, $\boldsymbol{\theta}$ are parameters. If the mapping $\varphi$ exists and the equations are structurally equivalent (same functional form $F$, different $\boldsymbol{\theta}$), the mechanism is transferable.
Decision procedure:
- $\nexists$ functional analog at the target level → red
- $\exists$ analog, but $F_{L_1} \not\equiv F_{L_2}$ (different equation form) → red
- $F_{L_1} \equiv F_{L_2}$, prediction depends only on topological invariants → green
- $F_{L_1} \equiv F_{L_2}$, prediction depends on $\boldsymbol{\theta}$ → yellow (parameter recalculation)
Sources: Garcia-Perez, Boguna & Serrano (2018), Nature Physics 14, 583 — geometric RG for networks; Villegas, Gili, Caldarelli & Gabrielli (2023), Nature Physics 19, 445 — Laplacian RG for heterogeneous networks; Notarmuzi et al. (2022), Nature Communications 13, 1308 — universality classes in social vs neural avalanches; Batterman (2002), The Devil in the Details, OUP — formal vs causal isomorphism across scales.
Part XI. Communicative Asymmetry of the Graph
11.1. Motivation: From Symmetric Edges to Directed Flow
In Part V, edges are formalized as $w_{ij} = w_{ji}$ — an undirected graph with symmetric weights. This is correct for structure: the association between two memes reflects the overlap of their neural ensembles (Cai et al., 2016), and overlap is symmetric by definition ($|ens_i \cap ens_j| = |ens_j \cap ens_i|$).
However, the effective flow through an edge is asymmetric. A hub meme with activation $a = 0.9$ transmits strong spreading activation along all its edges; a peripheral meme with $a = 0.1$ transmits weak activation. Yet the edge weight between them is the same in both directions. The result: the overwhelming majority of edges in BMC function as unidirectional broadcast, although structurally they remain bidirectional.
What is needed is a formalism describing this functional directionality without violating the biologically correct weight symmetry.
11.2. Formal Definitions
Definition 11.1 (Effective flow). For each edge $(i, j)$ at time $t$, we define the directed flow:
$$\phi_{i \to j}(t) = a_i(t) \cdot w_{ij} \cdot \sigma(\ldots)$$Since $a_i(t) \neq a_j(t)$ in general, $\phi_{i \to j}(t) \neq \phi_{j \to i}(t)$ even when $w_{ij} = w_{ji}$.
Definition 11.2 (Effective directionality). Averaged over a window of $T$ ticks (by default $T = \tau_{generation}$):
$$D_{eff}(i,j) = \frac{|\bar{\phi}_{i \to j} - \bar{\phi}_{j \to i}|}{|\bar{\phi}_{i \to j}| + |\bar{\phi}_{j \to i}|}$$where $\bar{\phi}_{i \to j} = \frac{1}{T}\sum_{t=1}^{T} \phi_{i \to j}(t)$.
- $D_{eff} = 0$: purely bidirectional (flows are equal)
- $D_{eff} = 1$: purely unidirectional (flow only in one direction)
Definition 11.3 (Bidirectional degree). The number of edges of node $i$ with $D_{eff}$ below the threshold:
$$k_{bid}(i, \theta) = |\{j : D_{eff}(i,j) < \theta\}|$$Analogously, $k_{uni}(i, \theta) = |\{j : D_{eff}(i,j) \geq \theta\}|$ — unidirectional degree.
Definition 11.4 (Bidirectional capacity). $N_{bid}(i) = k_{bid}(i, \theta_{bid})$ — the number of effectively bidirectional partners of node $i$.
11.3. Three Layers of Asymmetry
| Layer | Mechanism | Formula | Code status |
|---|---|---|---|
| 1. Activation | $a_i \neq a_j$ → $\phi_{i \to j} \neq \phi_{j \to i}$ | $\phi = a \cdot w \cdot \sigma(\ldots)$ | Implemented (Level 1) |
| 2. Decay | centrality-based: hub–hub decay slower | $\lambda(e) = f(C_i, C_j, \kappa)$ | Partial (Level 2) |
| 3. Weight | $w_{ij} = w_{ji}$ always (structural symmetry) | $w_{ij} \equiv w_{ji}$ | Not violated (biologically correct) |
Conclusion: asymmetry in BMC is functional (from activation magnitude), not structural (weights remain symmetric). This is biologically correct: synaptic overlap = symmetric, but firing rate = asymmetric.
11.4. Law $N_{bid}(L) \approx \text{bandwidth}(L) - 1$
Proposition 11.1. Bidirectional capacity at level $L$ is determined by the throughput of the cognitive channel at that level.
| Level | bandwidth | $N_{bid}$ | Determined by |
|---|---|---|---|
| L1: memes in WM | $k_{active} \approx 4$ | $\sim 3$ | WM bottleneck |
| L2: agents in swarm | cognitive capacity | $\sim 150$ (Dunbar) | Neocortex ratio |
| L3: cultures in SMR | diplomatic bandwidth | $\sim 10$–$20$ | Alliance maintenance cost |
Mechanism: Bidirectional = requires mutual co-activation / contact. Co-activation requires WM co-presence. WM holds 3–4 slots → maximum 3 bidirectional partners per tick. Over $T$ ticks, only persistent “neighbors” from the same cluster accumulate enough co-activations for $D_{eff} \to 0$.
At level L2, the WM slot analog is the number of bonds an agent can maintain (Dunbar number, $\sim 150$; Dunbar, 1992). At level L3 — the number of active alliances a culture can simultaneously maintain (diplomatic bandwidth, $\sim 10$–$20$).
11.5. Empirical Data (Level 1)
Results from bid_analysis.py (seed=42, N=505 main memes + utility, 1000 ticks):
- 1094 unique edges, 99% weight < 0.17, mean weight = 0.01
- Top-10 hubs: $k_{total} = 13$–$16$, $k_{bid} = 0$–$3$ (ratio $\sim 5\%$)
- Middle tier: $k_{total} = 8$–$9$, $k_{bid} = 0$–$3$
- Bottom tier: $k_{total} = 4$, $k_{bid} = 0$
- $N_{bid} \approx 3$ at ANY threshold ($0.1$–$0.7$) — does not depend on the choice of $\theta$
Note: $k_{bid}$ was determined through weight threshold (proxy for $D_{eff}$): $w > \theta$ → regular co-activation → bidirectional flow. Full $D_{eff}$ analysis is a separate validation (Level 3).
11.6. Edge Lifecycle (Directed)
| Phase | L1 (memes) | L2 (agents) | L3 (cultures) |
|---|---|---|---|
| Birth | Co-activation in WM | First contact | Trade contact |
| Bidirectional | Regular WM co-presence | Mutual exchange | Active alliance |
| → Unidirectional | One dropped out of WM | One stopped responding | One-sided influence |
| → Death | weight < prune threshold | No contact $> \tau$ | Isolation |
Transition formula bid → uni: if over $T$ ticks the co-activation count $< \theta_{maintain}$ → $D_{eff}$ grows → the edge becomes unidirectional. Formally:
$$\frac{dD_{eff}}{dt} > 0 \quad \text{if} \quad n_{co\text{-}act}(T) < \theta_{maintain}$$11.7. Consequences for Network Dynamics
Hub = broadcaster. A hub “communicates” bidirectionally with $\sim 3$ memes (nearest neighbors in the cluster). The remaining $10+$ edges = broadcasting (one-sided spreading activation). This explains why LOCAL normalization (shared neighbors $\sim 5$) is correct for hub displacement (Part VIII, S6): competition is for $\sim 3$ bidirectional slots, not for the entire $k_{total}$.
Modularity from asymmetry. Bidirectional edges are concentrated within clusters (co-activation frequency is higher within communities). Consequence: $Q$ is amplified — the bidirectional core forms a dense module nucleus, cross-cluster edges remain unidirectional and unstable.
Hierarchical encoding. Inner core ($k_{bid}$ — bidirectional partners) → middle ($k_{uni}$ — frequent activation recipients) → periphery (rare activation). Fractally repeated at L2: inner core $\sim 5$–$15$ (close contacts) → middle $\sim 150$ (Dunbar) → periphery $\sim 1000+$ (acquaintances). See SM, Part II.
11.8. Predictions
P-CA1 ($N_{bid}$ is determined by $k_{active}$). When $k_{active}$ increases from 4 to 6 → $N_{bid}$ grows to $\sim 5$. Verification: change WM_CAPACITY in the simulator, recompute bidirectional degree.
P-CA2 (Locality of hub displacement). Competition for $\sim 3$ bidirectional slots → displacement effect is local (shared neighbors), not global (total degree). Verification: S6 test with local vs global normalization.
P-CA3 (Bimodality of $D_{eff}$). The distribution of $D_{eff}$ across all edges is bimodal (peaks near 0 and near 1). Intermediate values are rare due to the WM bottleneck: an edge is either regularly co-activated (bidirectional) or not (unidirectional). Verification: $D_{eff}$ histogram from bid_analysis.py.
Sources: Dunbar R.I.M. (1992). “Neocortex size as a constraint on group size in primates.” Journal of Human Evolution; Cai D.J. et al. (2016). “A shared neural ensemble links distinct contextual memories encoded close in time.” Nature; Hill R.A., Bentley R.A. & Dunbar R.I.M. (2008). “Network scaling reveals consistent fractal pattern in hierarchical mammalian societies.” Biology Letters; Cowan N. (2001). “The magical number 4 in short-term memory.” Behavioral and Brain Sciences.
Part XII. What Can Now Be Predicted
Hypothesis → Network Formulation → Verification Method
| Hypothesis | Network formulation | Verification method |
|---|---|---|
| Beliefs become more rigid with age | Modularity $Q$ increases with age | Semantic networks across age groups |
| Stress causes regression | k-core (dense core) is activated | fMRI under stress |
| Charismatic leaders are hub memes | High eigenvector centrality | Social network analysis |
| Propaganda targets bridges | Reduced betweenness of targets | Content analysis + network analysis |
| Virality depends on structure | Exceeding the epidemic threshold | Spreading simulations |
Specific Predictions
1. Age-related rigidity = growth of modularity
Prediction: In elderly individuals, semantic networks exhibit higher modularity than in younger individuals.
Mechanism: With age, intra-cluster connections are strengthened while inter-cluster connections are weakened.
Formula:
$$Q_{old} > Q_{young}$$Verification: Compare the modularity of word association networks across different age groups.
Differential plasticity at low openness. When $O < 0.3$, a differential plasticity regime is activated: intra-cluster connections are strengthened while inter-cluster connections are weakened with intensity proportional to “rigidity” $r = 1 - O$:
$$\beta_{intra} = 1 + 0.025 \cdot r, \quad \beta_{cross} = 1 - 0.04 \cdot r$$Weight update at each step:
$$w_{ij} \leftarrow \begin{cases} \min(1, w_{ij} \cdot \beta_{intra}) & \text{if } c(i) = c(j) \\ \max(\epsilon, w_{ij} \cdot \beta_{cross}) & \text{if } c(i) \neq c(j), w > 0 \\ \max(-1, w_{ij} \cdot (2 - \beta_{cross})) & \text{if } c(i) \neq c(j), w < 0 \end{cases}$$This mechanism formalizes “cognitive entrenchment”: as openness decreases, the memplex actively reinforces internal coherence at the expense of inter-cluster connectivity.
2. Schizophrenia = network fragmentation
Prediction: In schizophrenia, betweenness centrality of bridge nodes is reduced.
Mechanism: “Bridges” between clusters are destroyed, leading to fragmentation of thought.
Formula:
$$\bar{C}_B^{schizo} < \bar{C}_B^{healthy}$$Verification: Compare network metrics of semantic networks.
3. Regression under stress = k-core activation
Prediction: Under stress, the network’s “core” is activated — nodes with maximum connectivity.
Mechanism: The k-core is a subgraph where each node has at least $k$ connections. These are the most “ancient” and deeply rooted memes.
Formula:
$$\text{stress} \Rightarrow a_{k-core} \uparrow$$Verification: fMRI: under stress, brain regions associated with basic beliefs are activated.
4. Meme virality = exceeding the epidemic threshold
Prediction: A meme becomes viral when the basic reproductive number $R_0 > 1$.
Formula:
$$R_0 = \frac{\beta}{\gamma} \cdot \langle k \rangle$$where:
- $\beta$ — probability of transmission per contact
- $\gamma$ — rate of “recovery” (forgetting)
- $\langle k \rangle$ — average node degree in the social network
Verification: Analysis of meme propagation dynamics on social media.
4a. Virality in networks with hubs
In heterogeneous networks with hubs, the epidemic threshold is significantly lower than in homogeneous networks:
$$\lambda_c = \frac{\langle k \rangle}{\langle k^2 \rangle}$$Mathematical detail: For a strict power law with $\gamma \leq 3$ in an infinite network, $\lambda_c \to 0$. For finite networks and other heavy-tailed forms (log-normal, truncated power law), the threshold exists but remains low due to the high degree variance $\langle k^2 \rangle$.
What this means: In a network with hubs (including memplexes and social networks), a meme with moderate “contagiousness” can spread — if it starts from the right entry point. Exact threshold values depend on the specific topology, but the qualitative conclusion is robust.
Key implication: The entry point is critical.
| Entry point | Probability of virality | Mechanism |
|---|---|---|
| Hub (influencer, central meme) | High | Instant access to thousands of connections |
| Random peripheral node | Low | Many “hops” needed to reach a hub |
| Bridge between clusters | Medium | Access to several communities |
Practical conclusion: For meme propagation, it matters less what the meme’s “quality” is and more through whom it enters. A meme that starts from a hub has a chance even with weak “contagiousness.” A meme from the periphery may be “perfect” but will never spread.
4b. Radicalization: trajectory in BMC parameter space
Radicalization is not a discrete event but a continuous trajectory in the space $(Q, \sigma_{SW}, SIT, D)$.
Formalization:
Let $\mathbf{x}(t) = (Q(t), \sigma_{SW}(t), SIT(t), H(t))$ be the state vector of a memplex, where $H(t) = -\sum_c p_c \log p_c$ is the entropy of the activation distribution across clusters (a measure of diversity).
Radicalization = trajectory:
$$\mathbf{x}(0) = (Q_0, \sigma_0, SIT_0, H_0) \xrightarrow{t \to T} \mathbf{x}(T) = (Q_{high}, \sigma_0, SIT \to 0, H_{low})$$Characteristics:
- $Q(t) \uparrow$ (modularity increases — the memplex fragments into “us” and “them”)
- $H(t) \downarrow$ (diversity of active clusters drops — a single ideological module dominates)
- $SIT(t) \to 0$ (all gaps are “closed” by false closure — the ideology explains everything)
- $\sigma_{SW}$ may remain $\approx 1$ (the radical does not lose cognitive capabilities; they lose content)
Critical point — I-layer recalibration:
$$I_{threshold}(t) = I_0 + \Delta I \cdot \mathbb{1}[\text{in-group}] - \Delta I \cdot \mathbb{1}[\text{out-group}]$$Immune system thresholds shift: in-group memes are accepted at low compatibility, while out-group memes are rejected at high compatibility.
Radicalization metric:
$$R(t) = \frac{Q(t)}{H(t)} \cdot (1 - SIT(t))$$$R \to \infty$ when $H \to 0$ (monopoly of a single cluster) and $SIT \to 0$ (no doubts). Normal state: $R \approx 1$. Preliminary danger zone threshold: $R > 5$ (requires empirical calibration).
Comparison with other trajectories:
| Trajectory | Q | H | SIT | $\sigma_{SW}$ | R |
|---|---|---|---|---|---|
| Normal | Medium | High | > 0 | $\approx$ 1 | ~1 |
| Depression | High | Low | High, LP = 0 | < 1 | ~2 (but $\sigma$ $\downarrow$) |
| Radicalization | High | Low | $\to$ 0 | $\approx$ 1 | » 5 |
| Flow | Medium | Medium | Optimal | $\approx$ 1 | ~1 |
Testable prediction: Semantic network analysis of radicals will show: (1) elevated Q (modularity), (2) reduced H (entropy), (3) reduced SIT-proxy (Need for Cognition Scale). All three are quantitative metrics accessible through standard methods.
Summary Table of Predictions
| # | Phenomenon | Network cause | Key metric |
|---|---|---|---|
| 1 | Age-related rigidity | Increased modularity | $Q$ |
| 2 | Schizophrenia | Fragmentation | Bridge $C_B$ |
| 3 | Regression | k-core activation | $a_{k-core}$ |
| 4 | Virality | $R_0 > 1$ | $R_0$ |
| 5 | Polarization | Bimodality | Distribution of $Q$ |
| 6 | Conversion | Connection rewiring | $\Delta E$ |
| 7 | Depression | Reduced activation | $\bar{a}$ |
| 8 | Creativity | Weak inter-cluster connections | Low $Q$, high path length |
| 9 | Persistence of unresolved problems | Structural gaps generate SEEKING | $SIT(C)$, correlation with DMN |
| 10 | Levels of consciousness (wakefulness, sleep, coma) | $CL = \sigma_{SW} \cdot A_{SMC} \cdot f(Balance)$ | $CL$, correlation with PCI |
| 11 | Unity of experience / fragmentation | Triple binding (struct + temp + comp) | $Binding_{struct} \cdot Binding_{temp} \cdot Binding_{comp}$ |
| 12 | Sudden attitude change | Sign inversion of connection (bifurcation) | $\sum w_{positive} > |
| 13 | Ideological splitting / discipline merging | Q dynamics: splitting ($Q \uparrow$) vs merging ($Q \downarrow$) | $Q(t)$, $Q_{crit}$ |
Prediction #9: SIT and persistent activation of incomplete clusters
Formulation: Clusters with $SIT(C) > 0$ generate persistent SEEKING activation correlating with Default Mode Network (DMN) activity, even in the absence of external stimuli.
Test: Construct semantic networks of subjects (using free association methods), identify clusters with high gap density, conduct fMRI resting-state and measure the correlation:
$$r = corr\Big(\sum_C SIT(C), \; DMN_{activation}\Big)$$Accompanying test (Zeigarnik extension): Give subjects a set of tasks, interrupt a random subset. Measure: (a) free recall (classic Zeigarnik), (b) DMN activation for clusters containing interrupted tasks.
Prediction: $DMN_{activation}$ for interrupted tasks is higher by $\delta \propto centrality(C)$ — the more important the cluster, the stronger the effect.
Falsification: If gap density of semantic networks does not correlate with DMN activation during resting-state (at $p > 0.05$, $N > 50$), the SIT mechanism is refuted or requires revision.
See also: DMN as substrate for SIT — BM, Part III; SIT-specific predictions — BM, Part XI.
See also: Theory falsifiability, what would refute it — EMT, Part XXIX.
Part XIII. Consciousness Level — A Metric for the Level of Consciousness
Problem: no scale for states of consciousness
The theory describes consciousness through SMC (Self-Model Cluster) (see EMT, Part XVI), but does not formalize levels of consciousness: wakefulness vs sleep vs coma vs psychedelics. A metric computable through network parameters is needed.
CL Formula
$$CL(t) = \sigma_{SW}(t) \cdot A_{SMC}(t) \cdot f(Balance(t))$$where:
- $\sigma_{SW}$ — small-worldness (integration + differentiation, analog of Massimini’s PCI)
- $A_{SMC}$ — activity of the Self-Model Cluster: $A_{SMC} = \frac{1}{|SMC|}\sum_{m \in SMC} a_m(t)$
- $f(Balance)$ — a bell-shaped function, maximum at $Balance \in [1, 2]$
Definition of Balance(t):
$$Balance(t) = \frac{\bar{a}_M(t)}{\bar{a}_G(t) + \epsilon}$$where $\bar{a}_M = \frac{1}{|V_m|}\sum_{i \in V_m} a_i(t)$ is the mean activation of meme-nodes, $\bar{a}_G = \frac{1}{|V_u|}\sum_{g \in V_u} a_g(t)$ is the mean activation of utility nodes, and $\epsilon$ is a small constant ($\epsilon \sim 10^{-3}$) preventing division by zero. Neurobiological proxy: $A_{PFC}/A_{limbic}$ (see BM, Part III).
$$f(Balance) = \exp\left(-\frac{(Balance - Balance_{opt})^2}{2\sigma_B^2}\right), \quad Balance_{opt} = 1.5, \; \sigma_B = 0.7$$Interpretive invariance. CL(t) and its extension $CL_{full}(t) = CL \cdot I_{intero}$ are defined exclusively through network metrics ($\sigma_{SW}$, $A_{SMC}$, Balance; $I_{intero}$ is added only in $CL_{full}$) — no component contains variables dependent on one’s position regarding the Hard Problem. All empirical predictions of BMC (proxy indices, phase transitions, component-wise analysis of states) are identical under realism, illusionism, dualism, and neutral monism. The formalism is interpretationally invariant (details — EMT, Part IV: Ontological Neutrality of the BMC Formalism).
Proxy Metrics for Experimental In Vivo Validation
CL is directly computable only in the model (the full memplex graph is known). For experimental verification, each CL component requires a measurable proxy indicator:
| CL component | Proxy | Measurement method | Justification |
|---|---|---|---|
| $\sigma_{SW}$ | PCI (Perturbational Complexity Index) | TMS-EEG | PCI = Lempel-Ziv complexity of cortical response to TMS perturbation; measures the balance of integration and differentiation — the same quantity as $\sigma_{SW}$ on the graph (Casali et al., 2013, Science Translational Medicine) |
| $A_{SMC}$ | DMN activity (BOLD mPFC, PCC, precuneus, TPJ) | fMRI | DMN is the neurosubstrate of self-referential processing; layer-by-layer mapping see table below (Northoff et al., 2006; Brewer et al., 2011, PNAS) |
| $f(Balance)$ | Directed connectivity PFC$\to$subcortical / subcortical$\to$cortical | DCM on fMRI | Top-down (M-control) vs bottom-up (G-drives); the ratio of effective connectivities = proxy for Balance (Friston et al., 2003, NeuroImage) |
| $I_{intero}$ (in $CL_{full}$) | HEP (Heartbeat Evoked Potential) + insula BOLD | EEG + fMRI | HEP is an electrophysiological marker of interoceptive processing; anterior insula is the interoception hub (Critchley & Harrison, 2013, Neuron) |
Layer-by-layer verification of $A_{SMC}$ via DMN:
| SMC level | Neurosubstrate | fMRI metric | BMC prediction |
|---|---|---|---|
| L1 (body, basic self) | Posterior DMN (PCC, precuneus) + insula | BOLD + FC of posterior DMN | Preserved during FA meditation, disappears under deep anesthesia |
| L2 (metacognition, ToM) | Anterior DMN (mPFC) + TPJ | BOLD + FC of anterior DMN | First to disappear under anesthesia, reduced during flow |
| Recursion depth | PCC$\leftrightarrow$mPFC connectivity | Transfer entropy within DMN | Maximal during OM meditation, minimal in coma |
Composite proxy index (unique BMC prediction):
$$CL_{proxy} \approx PCI \cdot DMN_{SMC} \cdot f(EC_{ratio})$$where $DMN_{SMC}$ is the normalized activity of DMN nodes, $EC_{ratio}$ is the ratio of top-down to bottom-up effective connectivities. Prediction: $CL_{proxy}$ correlates with subjective consciousness scales (PCS, ASC) more strongly than any single component. PCI alone $\approx$ IIT’s prediction; DMN alone $\approx$ HOT/AST — each captures one aspect; BMC asserts that consciousness = the product of integration, self-model, and balance. Testable with standard methods: simultaneous TMS-EEG + fMRI + subjective reports.
Limitation of the proxy approach: The mapping CL $\to$ {PCI, DMN, EC} is an approximation, not an identity. The memplex graph does not coincide with the functional connectome: a meme $\neq$ an ROI, meme activation $\neq$ BOLD signal. Proxies allow ranking states by CL and verifying the direction of predictions ($\uparrow$/$\downarrow$), but absolute CL(t) values are computable only within the model.
Criticality: $\sigma \approx 1$ as operating point
The parameter $\sigma_{SW}$ is related to network criticality. At the critical point ($\sigma \approx 1$), the network optimally balances integration and differentiation:
- $\sigma < 1$ (subcriticality): memes cannot spread $\to$ reduced consciousness
- $\sigma \approx 1$ (criticality): optimal regime $\to$ normal wakefulness
- $\sigma > 1$ (supercriticality): avalanche-like propagation $\to$ altered states
Neurophysical basis of criticality: The branching ratio $\sigma$ (mean number of offspring neurons per active neuron) is a direct analog of our spreading activation regime (Beggs & Plenz, 2003, J. Neuroscience). At $\sigma = 1$, neuronal avalanches (activation cascades) follow a power-law size distribution $P(x) \propto x^{-\gamma}$, which coincides with the heavy-tailed degree distribution in our memplex. The E/I (excitation/inhibition) balance determines $\sigma$: enhanced inhibition (GABA) $\to$ $\sigma < 1$, weakened inhibition $\to$ $\sigma > 1$; this is pharmacologically confirmed (Shew et al., 2011, J. Neuroscience). In BMC terms: $\sigma < 1$ (depression, memes “offline”), $\sigma > 1$ (mania, panic — uncontrolled activation), $\sigma \approx 1$ = healthy balance of lateral inhibition.
Bistability and hysteresis: The memplex is a bistable system: the same information can lead to different stable states depending on history (Izhikevich, 2007, Dynamical Systems in Neuroscience). Hysteresis explains rigidity: returning the memplex from a new state requires more effort than the transition into it — deconversion is harder than conversion. Bifurcations (saddle-node, Andronov-Hopf) describe worldview collapse: smooth parameter change (stress, informational pressure) can cause a qualitative jump — the old equilibrium disappears (Strogatz, 2015, Nonlinear Dynamics and Chaos). This is the mathematical foundation of “the straw that breaks the camel’s back”: subthreshold tension accumulation $\to$ sudden memplex restructuring.
Map of Consciousness States
| State | $\sigma_{SW}$ | $A_{SMC}$ | $Balance$ | $CL$ | Neural correlate |
|---|---|---|---|---|---|
| Wakefulness | $\approx$ 1.0 | High | 1.0–2.0 | High | Desynchronized EEG, high PCI |
| NREM sleep | < 0.5 | Low | — | Low | Slow waves, low PCI |
| REM sleep | $\approx$ 0.8 | Medium | G > M | Medium | Desynchronization without PFC control |
| Coma | $\approx$ 0 | $\approx$ 0 | — | $\approx$ 0 | No integration, PCI $\approx$ 0 |
| Anesthesia | < 0.3 | Low | — | Low | Suppression of cortical connectivity |
| Psychedelics | > 1.5 | High | Unstable | Altered | Entropy $\uparrow$, ego dissolution |
| Meditation | $\approx$ 1.0 | Modified | M » G | Modified | Altered DMN pattern |
| Flow | $\approx$ 1.0 | Reduced (transparent) | Optimal | Paradoxical | Hypofrontality, DMN $\downarrow$ |
| Lucid dreaming | $\approx$ 0.8 | Partially reactivated | G > M, M $\uparrow$ | Medium-high | 40 Hz frontal activity in REM |
| Dissociation | $\approx$ 1.0 | Formally active | I-layer disrupted | Formally high | Insula deafferentation |
Degenerate case BMC_G = (G, $\emptyset$, $\emptyset$, S). When $M \to \emptyset$, the CL formula reduces: Balance $\to$ 0, $f(0) \approx 0.10$, $CL \approx 0.10 \cdot \sigma_{SW} \cdot A_{proto-SMC}$. Real organisms lie on a continuum of M/G (chimpanzee: M < G, but M > 0; human: M $\geq$ G). BMC_G $\approx$ IIT ($\sigma_{SW}$) + proto-AST ($A_{proto-SMC}$): BMC subsumes simpler theories as special cases.
Non-trivial predictions of BMC_G (absent from IIT and AST):
- G-1 (Ceiling): $CL_G \leq f(0) \cdot \max(\sigma_{SW}) \cdot \max(A_{proto-SMC}) \approx 0.10 \cdot \sigma^* \cdot A^*$. G-only consciousness has a hard upper bound; IIT does not predict a ceiling for $\Phi$. Test: PCI of large-brained G-only animals $\to$ plateau.
- G-2 (Phase transition): When $M: 0 \to \varepsilon$, $f(Balance)$ undergoes a discontinuity: $f(0) = 0.10$, $f(\varepsilon / A_G) > 0.10$, and simultaneously proto-immune system and proto-SIT mechanisms are activated. The result is a nonlinear CL jump upon the appearance of proto-memes. IIT and AST predict monotonic growth. Test: PCI of crows (proto-M > 0) » PCI of pigeons (proto-M $\approx$ 0) at comparable brain size. Operational criterion: proto-M > 0 $\equiv$ $\exists$ at least one of: (a) documented social learning of novel behavior (Hoppitt & Laland 2013), (b) tool manufacture (Shumaker et al. 2011), (c) cultural variation between populations (Whiten et al. 1999). Detailed species table — EMT, Part XVI.
- G-3 (Topology-first): $CL_G \propto \sigma_{SW}$, not $\propto |N|$ (number of neurons). Organisms with convergent neural network topology should demonstrate comparable $CL_G$ regardless of phylogenesis. Test: PCI of octopus $\approx$ PCI of crow at comparable $\sigma_{SW}$.
Detailed formalization and mechanism table — EMT, Part XVI.
Prediction G-4: M » G as a necessary condition for reflection (formalized). Define recursion levels: $SMC^{(1)} = \{m \in V_m : target(m) \in V_u \cup I\}$ (narrative self), $SMC^{(k)} = \{m \in V_m : target(m) \in SMC^{(k-1)}\}$ (meta-memes). Theorem: $|SMC^{(2)}| > 0$ requires $|V_m| \geq (\alpha + \beta + \gamma\beta) \cdot |V_u|$, where $\alpha \geq 1$ (environment modeling), $\beta \geq 1$ (L1 self-model), $\gamma > 0$ (L2 meta-memes). Lower bound: M/G $\geq$ 3; in real systems $\alpha \gg 1$, therefore M/G $\gg$ 3. Threshold estimate: $M/G_{crit} \sim \mathcal{O}(10)$ — metacognition appears at proxy M/G $\sim$ 20–50 (macaques), is absent at M/G < 1 (pigeons). $M/G_{crit}$ depends on $\alpha$ (environmental complexity). Spectrum: M/G $\approx$ 0 $\to$ L0, M/G < 1 $\to$ L1, M/G $\sim$ 1 $\to$ proto-L2 (chimpanzees), M/G » 1 $\to$ full reflection (humans). IIT, AST, GWT do not distinguish M and G — they cannot formulate a threshold condition. Detailed formalization, threshold table, and re-analysis #3 — EMT, Part XVI.
Relationship to EMT definition. The flat definition $SMC = \{m \in M : target(m) \in M \cup G \cup I\}$ (EMT) is equivalent to $SMC = \bigcup_{k \geq 1} SMC^{(k)}$ — the union of all levels of the recursive hierarchy.
CL ≈ 0"] --> NREM["NREM
CL = 0.1"] NREM --> REM["REM
CL = 0.4"] REM --> WAKE["Wakefulness
CL = 0.8"] REM --> LUCID["Lucid
dreaming
CL = 0.55"] WAKE --> FLOW["Flow
CL = 0.7"] WAKE --> PSYCH["Psychedelics
CL = ???"] WAKE --> MEDIT["Meditation
CL = 0.85"] WAKE --> DISSOC["Dissociation
CL_full = ???"] style COMA fill:#95a5a6 style NREM fill:#3498db style REM fill:#9b59b6 style WAKE fill:#27ae60 style FLOW fill:#f1c40f style PSYCH fill:#e74c3c style MEDIT fill:#2ecc71 style LUCID fill:#8e44ad style DISSOC fill:#e67e22
Detailed Altered States: Component-Wise CL Analysis
The table above provides an overview map. Below is a component-wise analysis of each altered state with model predictions and empirical anchors.
1. Psychedelics: $\sigma > 1$, but CL is non-monotonic
Components:
- $\sigma_{SW}$: significantly elevated (> 1.5). Connectome entropy increases (Carhart-Harris et al., 2014); homological scaffolds are restructured (Petri et al., 2014, J. Royal Society Interface). In BMC terms: lateral inhibition is weakened $\to$ memes from different clusters are activated simultaneously.
- $A_{SMC}$: non-monotonic. Low doses $\to$ hyperreflection ($A_{SMC}$ $\uparrow$, “everything is about me”). High doses $\to$ ego dissolution ($A_{SMC} \to 0$, the “I” dissolves). The transition is not gradual but a bifurcation: the threshold depends on the baseline rigidity of the memplex.
- $Balance$: unstable. The G-system (emotions, bodily signals) is amplified, the M-system is restructured $\to$ Balance oscillations.
- $f(Balance)$: with unstable Balance, the function oscillates $\to$ CL “flickers.”
Model prediction (unique to BMC): CL under psychedelics follows an inverted U-curve by dose:
- Low dose: $\sigma$ $\uparrow$ moderately, $A_{SMC}$ $\uparrow$ $\to$ CL $\uparrow$
- Medium dose: $\sigma$ $\uparrow\uparrow$, $A_{SMC}$ begins to oscillate $\to$ CL unstable
- High dose: $\sigma$ $\uparrow\uparrow\uparrow$, $A_{SMC} \to 0$ (ego dissolution) $\to$ CL $\downarrow$ despite high $\sigma$
This distinguishes BMC from theories predicting monotonic increase of consciousness with dose. IIT would predict $\phi$ $\uparrow$ (more integration = more consciousness); BMC predicts: without a self-model there is no subject of experience, therefore CL drops.
Sources: Carhart-Harris & Friston (2019). “REBUS and the anarchic brain”. Pharmacological Reviews; Petri et al. (2014). “Homological scaffolds of brain functional networks”. J. Royal Society Interface; Tagliazucchi et al. (2016). “Increased global functional connectivity correlates with LSD-induced ego dissolution”. Current Biology.
2. Meditation: two modes, two CL profiles
Focused attention (FA, samatha):
- $\sigma_{SW}$: $\approx$ 1.0 (unchanged). The network remains in normal regime.
- $A_{SMC}$: selectively reduced. The practitioner suppresses mind-wandering (DMN $\downarrow$) but preserves meta-awareness “I am meditating” $\to$ $A_{SMC}$ is partially active but narrowed to monitoring.
- $Balance$: M > G (voluntary control over automatic processes).
- $CL$: $\approx$ normal or slightly above (enhanced clarity without expansion).
Open monitoring (OM, vipassana):
- $\sigma_{SW}$: $\approx$ 1.0–1.1 (slight elevation). Cross-cluster connections are strengthened.
- $A_{SMC}$: modified: neither suppressed nor amplified, but the pattern is changed. DMN is reconfigured: narrative self $\downarrow$, minimal self $\uparrow$ (Brewer et al., 2011, PNAS).
- $Balance$: M » G (deep voluntary control).
- $CL$: modified — formally close to normal, but qualitatively different (less narrative “I,” more pure observation).
Prediction: FA and OM are distinguishable by $A_{SMC}$ pattern at the same $\sigma_{SW}$. Testable: fMRI DMN patterns in FA vs OM practitioners at identical overall connectivity.
Sources: Lutz et al. (2008). “Attention regulation and monitoring in meditation”. Trends in Cognitive Sciences; Fox et al. (2016). “Functional neuroanatomy of meditation”. Neuroscience & Biobehavioral Reviews; Brewer et al. (2011). “Meditation experience is associated with differences in default mode network activity and connectivity”. PNAS.
3. Anesthesia: graded CL shutdown
| Stage | $\sigma_{SW}$ | $A_{SMC}$ | CL | Clinical presentation |
|---|---|---|---|---|
| Sedation | 0.7–0.9 | Partial | 0.3–0.5 | Drowsiness, slowed reactions |
| Loss of consciousness | 0.3–0.5 | Residual (bodily) | 0.05–0.1 | No response to verbal commands |
| Deep anesthesia | < 0.3 | $\approx$ 0 | $\approx$ 0 | No response to pain, burst-suppression EEG |
Key mechanism: Anesthetics (propofol, sevoflurane) primarily suppress $\sigma_{SW}$ through enhanced GABA inhibition $\to$ $\sigma < 1$ $\to$ subcritical regime $\to$ memes cannot spread. $A_{SMC}$ fades secondarily: without spreading activation, the self-model loses input.
Prediction (unique to BMC): Loss of consciousness under anesthesia occurs in steps, not gradually:
- First, level 2 recursion disappears (metacognition: “I know that I know”)
- Then level 1 recursion (phenomenal consciousness: the feeling of “something”)
- Last — level 0 (basic reflexes)
Each level is a separate attractor with a bifurcation threshold. Between them lies hysteresis: the dose for falling asleep $\neq$ the dose for awakening (empirically confirmed: Friedman et al., 2010, Anesthesiology).
Sources: Casali et al. (2013). “A theoretically based index of consciousness”. Science Translational Medicine; Mashour & Hudetz (2018). “Neural correlates of unconsciousness in large-scale brain networks”. Trends in Neurosciences.
4. Flow: the paradox of peak performance at reduced CL
Components:
- $\sigma_{SW}$: $\approx$ 1.0 (optimal regime, criticality).
- $A_{SMC}$: reduced (transparent self-model). The athlete does not think “I am running” — they simply run. DMN $\downarrow$, dorsolateral PFC $\downarrow$ (transient hypofrontality: Dietrich, 2004, Consciousness and Cognition).
- $Balance$: optimal ($\approx$ 1.5). The skill is automatized ($habit > \theta_{habit}$, $Auto(S)$ — see Automatization), task difficulty $\approx$ skill level (M-level).
- $f(Balance)$: maximum $\to$ amplifying multiplier.
Paradox: $CL = \sigma \cdot A_{SMC} \cdot f(Balance)$. At $\sigma \approx 1.0$ and $f(Balance) = max$, but $A_{SMC}$ $\downarrow$ $\to$ CL is lower than during ordinary wakefulness. Yet subjectively, flow is described as a peak experience.
Resolution of the paradox (BMC’s theoretical position): CL measures self-awareness as a subject, not quality of performance. Flow is a state in which the self-model becomes transparent: the subject does not observe themselves but acts directly. This is not a deficit of consciousness but its optimal regime for action. Proposed refinement: distinguish $CL_{reflexive}$ (includes $A_{SMC}$) and $CL_{operative}$ (= $\sigma \cdot f(Balance)$, without $A_{SMC}$). Flow = low $CL_{reflexive}$, high $CL_{operative}$.
Formalization of the flow channel via SIT:
Csikszentmihalyi’s flow channel (challenge $\leftrightarrow$ skill) is formalized through SIT:
$$SIT_{optimal} = \{SIT \mid SIT_{bore} < SIT < SIT_{anxiety}\}$$where:
- $SIT_{bore} = \epsilon$ (minimal structural deficit — no gaps $\to$ boredom)
- $SIT_{anxiety} = SIT_{bore} + \Delta_{LP}$, where $\Delta_{LP}$ is the range in which LP is predicted to be > 0
Flow readiness formula:
$$F_{ready} = (1 - \hat{SIT}) \cdot \sigma_{SW} \cdot (1 - \hat{A}_{SMC}) \cdot \hat{LP}$$where $\hat{x} = \frac{x - x_{min}}{x_{max} - x_{min}}$ are normalized values. $F_{ready} \to 1$ when: SIT is at the optimum (neither 0 nor max), $\sigma \approx 1$, $A_{SMC}$ is minimal, LP is maximal.
Testable prediction: The correlation of $F_{ready}$ with the Flow State Scale (FSS) should be higher than the correlation of any single component. This allows quantitative operationalization of the flow channel.
Sources: Csikszentmihalyi (1990). Flow: The Psychology of Optimal Experience; Dietrich (2004). “Neurocognitive mechanisms underlying the experience of flow”. Consciousness and Cognition; Ulrich et al. (2014). “Neural correlates of experimentally induced flow experiences”. NeuroImage.
5. Lucid dreaming: partial SMC reactivation in REM
Components:
- $\sigma_{SW}$: $\approx$ 0.8 (as in normal REM — reduced connectivity).
- $A_{SMC}$: partially reactivated. The dreamer is aware that “I am dreaming” $\to$ SMC level 1 (phenomenal) is active, sometimes level 2 (metacognition: “I am controlling the dream”).
- $Balance$: G > M (as in REM), but M is partially restored (frontal activation).
- $f(Balance)$: elevated compared to normal REM.
Prediction: $CL_{lucid} > CL_{REM}$ at the same $\sigma_{SW}$, due to $A_{SMC}$ $\uparrow$. Quantitatively: $CL_{lucid} \approx 0.5$–$0.6$ vs $CL_{REM} \approx 0.3$–$0.4$.
Testable prediction: PCI(lucid dreaming) > PCI(normal REM). Verifiable via TMS-EEG in trained lucid dreamers with confirmation through LRLR (Left-Right eye signals).
Neurosubstrate: Frontal 40 Hz activity during lucid dreaming (Voss et al., 2009, Sleep) — in BMC terms: reactivation of frontal SMC nodes over the “background” REM regime.
Sources: Voss et al. (2009). “Lucid dreaming: a state of consciousness with features of both waking and non-lucid dreaming”. Sleep; Voss et al. (2014). “Induction of self awareness in dreams through frontal low current stimulation”. Nature Neuroscience; Baird et al. (2019). “The cognitive neuroscience of lucid dreaming”. Neuroscience & Biobehavioral Reviews.
6. Dissociative states: formally high CL, subjectively “empty”
Types: depersonalization (DP), derealization (DR), dissociative identity disorder (DID).
Components:
- $\sigma_{SW}$: $\approx$ 1.0 (network structurally intact).
- $A_{SMC}$: formally active — the patient is self-aware and can describe their state.
- $Balance$: not disrupted through G/M imbalance.
- Key problem: the CL formula yields a high value ($\sigma$ normal, $A_{SMC}$ normal, Balance normal) — yet subjective experience is described as “unreal,” “observing from the outside,” “I am a robot.”
Diagnostic challenge for the model: Dissociation exposes a limitation of the formula $CL = \sigma \cdot A_{SMC} \cdot f(Balance)$. What is disrupted?
BMC’s answer: The I-layer (immune system) is disrupted — the connection between G-signals (emotions, bodily sensations) and SMC. Depersonalization = emotional deafferentation of the SMC: the self-model is active but does not receive emotional coloring from the G-layer.
Formalization: we introduce a correction coefficient for interoceptive integration:
$$CL_{full}(t) = \sigma_{SW}(t) \cdot A_{SMC}(t) \cdot f(Balance(t)) \cdot I_{intero}(t)$$where $I_{intero}$ is the degree of connectivity between G-signals and SMC:
$$I_{intero} = \frac{\sum_{g \in G,\; s \in SMC} w_{gs} \cdot a_g}{\sum_{g \in G} a_g + \epsilon}$$In depersonalization: $w_{gs} \to 0$ (insula disconnected) $\to$ $I_{intero} \to 0$ $\to$ $CL_{full} \to 0$ despite formally normal $\sigma$, $A_{SMC}$, Balance.
Prediction: In patients with depersonalization:
- PCI $\approx$ normal ($\sigma_{SW}$ is not disrupted)
- fMRI: insula $\downarrow$, somatosensory cortex $\to$ mPFC connectivity $\downarrow$
- Subjective consciousness scales $\downarrow$ (CDS, DES)
- Dissociation between “cognitive” CL (high) and “full” $CL_{full}$ (low)
Sources: Sierra & Berrios (1998). “Depersonalization: neurobiological perspectives”. Biological Psychiatry; Medford et al. (2005). “Emotional experience and awareness of self: functional MRI studies of depersonalization”. Frontiers in Psychology; Simeon et al. (2000). “Feeling unreal: a PET study of depersonalization disorder”. American Journal of Psychiatry.
7. Unified parameter space for consciousness disorders
All consciousness disorders are described as points in the space $(\sigma_{SW}, A_{SMC}, Balance, I_{intero}, Q, SIT, H)$:
| Disorder | $\sigma_{SW}$ | $A_{SMC}$ | $I_{intero}$ | Q | SIT | H (entropy) |
|---|---|---|---|---|---|---|
| Normal | $\approx$ 1.0 | 0.3–0.7 | $\approx$ 1.0 | medium | > 0 | high |
| Depression | < 1 | high (rumination) | normal | high | high, LP=0 | low |
| ADHD | > 1 | unstable | normal | low | unstable | high (chaotic) |
| Autism | $\approx$ 1 | atypical | reduced (social) | high (hyperlocal) | specific | low |
| Schizophrenia | < 1 | fragmented | reduced | high (disintegration) | unregulated | low |
| DID | $\approx$ 1 (within) | multiple | context-dependent | max (isolation) | suppressed | bimodal |
| OCD | $\approx$ 1 | high | hyperactive | medium | one gap $\uparrow\uparrow$ | medium |
| Flow | $\approx$ 1 | minimal | normal | medium | optimal | medium |
| Radicalization | $\approx$ 1 | medium | shifted | high | $\to$ 0 | minimal |
Formalization of distance:
$$d(P, N) = \left\| \frac{\mathbf{x}_P - \mathbf{x}_N}{\boldsymbol{\sigma}_N} \right\|_2$$where $\mathbf{x}_P$ is the parameter vector in pathology, $\mathbf{x}_N$ is the parameter vector in normal state, and $\boldsymbol{\sigma}_N$ is the standard deviations of the normal state. Comorbidity is predictable:
$$P(\text{comorbid}(A, B)) \propto \frac{1}{d(A, B)}$$Disorders with close profiles in BMC space are more often comorbid (ADHD + depression: both deviate in $\sigma_{SW}$; OCD + autism: both deviate in I-parameter).
Testable prediction: The comorbidity matrix (epidemiological data) correlates with the BMC distance matrix. This is formalizable and verifiable on existing data (NCS-R, UK Biobank). For a falsifiable test: numerical parameter values must be fixed before verification against comorbidity data (pre-registration).
Distinction from RDoC/HiTOP: RDoC (NIMH, 2010) and HiTOP use a dimensional approach but without a unified causal mechanism and without a formal comorbidity metric $d(P,N)$. BMC complements them with a dynamic model.
Neurosubstrate: The neuromechanics of each disorder — see BM. Conceptual taxonomy — see EMT, Part XXII.
Relationship with PCI (Perturbational Complexity Index)
PCI (Massimini et al.) measures the complexity of cortical response to TMS stimulation. In our model:
$$PCI \propto \sigma_{SW} \cdot Connectivity$$PCI and CL are not identical: PCI measures the network’s potential for integration (one component of CL), while CL measures the current state accounting for SMC and Balance. PCI is the best available proxy for $\sigma_{SW}$, but not for CL as a whole (see the proxy metrics table above). This is precisely why BMC predicts that the composite index $CL_{proxy}$ outperforms PCI alone: states with identical PCI (e.g., wakefulness vs depersonalization) differ in CL due to $A_{SMC}$ and $I_{intero}$.
Predictions
| Prediction | Test |
|---|---|
| CL correlates with PCI ($r > 0.7$) for standard states (wakefulness, sleep, coma) | Simultaneous PCI measurement and CL computation from semantic networks |
| Under psychedelics, CL follows an inverted U-curve by dose (non-monotonic) | CL computation vs subjective reports at different psilocybin doses |
| FA and OM meditation are distinguishable by $A_{SMC}$ pattern at the same $\sigma_{SW}$ | fMRI: DMN pattern in samatha vs vipassana practitioners |
| Anesthesia suppresses CL in steps: level 2 $\to$ 1 $\to$ 0 with hysteresis | TMS-EEG under different propofol doses; dose for falling asleep $\neq$ dose for awakening |
| Flow: high $CL_{operative}$ with low $CL_{reflexive}$ | fMRI: DMN $\downarrow$ + task-positive $\uparrow$ during flow vs normal activity |
| $PCI(lucid\;dreaming) > PCI(normal\;REM)$ | TMS-EEG in trained lucid dreamers with LRLR confirmation |
| Depersonalization: PCI $\approx$ normal, but insula $\to$ mPFC connectivity $\downarrow$ | fMRI: PCI + ROI analysis in patients with DP vs control |
| $CL_{full}$ (with $I_{intero}$ correction) predicts subjective scales better than CL without correction | Correlation of CL vs $CL_{full}$ with DES/CDS in patients with dissociation |
| $CL_{proxy} = PCI \cdot DMN_{SMC} \cdot f(EC_{ratio})$ correlates with subjective scales more strongly than PCI alone or DMN alone | Simultaneous TMS-EEG + fMRI + subjective scales (PCS, ASC); comparison of $r^2$ for $CL_{proxy}$ vs PCI vs DMN |
| HEP amplitude distinguishes wakefulness from depersonalization at the same PCI | EEG: HEP in patients with DP vs control, controlling for PCI |
Sources: Massimini et al. (2005). “Breakdown of cortical effective connectivity during sleep”. Science; Carhart-Harris et al. (2014). “The entropic brain”. Frontiers in Human Neuroscience; Carhart-Harris & Friston (2019). “REBUS and the anarchic brain”. Pharmacological Reviews; Petri et al. (2014). J. Royal Society Interface; Lutz et al. (2008). Trends in Cognitive Sciences; Casali et al. (2013). Science Translational Medicine; Dietrich (2004). Consciousness and Cognition; Voss et al. (2009). Sleep; Voss et al. (2014). Nature Neuroscience; Sierra & Berrios (1998). Biological Psychiatry; COGITATE Consortium (2025). Nature.
Part XIV. Triple Binding — Unity of Conscious Experience
Problem: why is experience unified?
Multiple memes are active simultaneously: visual, auditory, emotional, semantic. Yet subjective experience is unified: we see a red cup of hot coffee as a single whole, not as a set of disconnected features. How does a network of memes give rise to unity?
Three Binding Mechanisms
The unity of conscious experience is ensured by three independent but complementary mechanisms (see EMT, Part XVI):
1. Structural binding: ensemble overlap
Two memes are structurally bound if their cell assemblies share common neurons:
$$Binding_{struct}(m_i, m_j) = |ens_i \cap ens_j| > 0$$The greater the overlap, the stronger the bond. Activating one meme automatically partially activates the other through shared neurons.
Network analog: Shared neurons = a shared connection with high weight. Structural binding = the presence of an edge $w_{ij} > \theta_{bind}$.
2. Temporal binding: synchronization within the theta window
Two memes are considered “bound in time” if their peak activations fall within the same theta window (~125 ms):
$$Binding_{temp}(m_i, m_j) = \mathbb{1}\left[|\phi_\theta(m_i) - \phi_\theta(m_j)| < \delta\right]$$where $\phi_\theta$ is the theta rhythm phase (4–8 Hz), and $\delta$ is the tolerance (~30 ms).
Network analog: Synchronous activation = both nodes $a_i(t) > \theta$ and $a_j(t) > \theta$ within the same discrete step $t$.
Neurosubstrate: The hippocampal theta rhythm as the “substrate clock” — groups activations into discrete packets. Memes that fall within the same theta cycle are perceived as part of the same experience (see BM, Part III).
Relationship to event segmentation: Temporal binding operates within an episode — synchronizing components of a single moment. Event segmentation (see Part VIII: Episodic Memory) operates on a different scale — determining boundaries between episodes through PE jumps. Two levels of discretization: theta cycle (~125 ms) within a moment, event boundary (seconds to minutes) between episodes.
3. Competitive (Bayesian) binding: filtration through WM
Active working memory admits only a coherent coalition of memes simultaneously. Memes with high dissonance are evicted:
$$Binding_{comp}(Coalition) = \mathbb{1}\left[\max_{i,j \in Coalition} D(m_i, m_j) < \theta_D\right]$$If $D > \theta_D$ for a pair in the coalition $\to$ one of the memes is evicted through lateral inhibition. The coherence constraint applies only to Active WM ($k_{active} \approx 3$–$4$ pointers); Latent WM (memes with $\psi_i > \theta_\psi$, not in the focus of attention) can contain contradictory memes without conflict — they are not in consciousness. Upon reactivation (pinging) of a latent meme, it enters Active WM and undergoes competitive binding (see Activity-Silent WM).
Network analog: Softmax normalization + lateral inhibition (see Part VIII). Active WM = an attractor in which only a coherent configuration is stable.
Full Binding Formula
$$Binding(m_i, m_j) = Binding_{struct} \cdot Binding_{temp} \cdot Binding_{comp}$$All three components are necessary. Binding = absence of dissonance in WM given the presence of structural and temporal connection. The integrity of experience is a result of filtration, not a given.
What Triple Binding Explains
| Phenomenon | Which binding is disrupted | Mechanism |
|---|---|---|
| Unity in the normal state | All three are working | Coherent coalition in WM, synchronous activation, shared neurons |
| Schizophrenia | Structural (bridges destroyed) | SMC fragmented $\to$ several competing coalitions (see Part VII) |
| Ego dissolution (psychedelics) | Competitive (filter weakened) | $\theta_D$ lowered $\to$ contradictory memes are admitted into WM $\to$ “I” boundaries dissolve |
| Dream illogicality | Competitive (PFC offline) | Lateral inhibition weakened $\to$ incompatible memes coexist without conflict |
| Synesthesia | Structural (anomalous overlap) | Excessive shared neurons between modalities $\to$ cross-modal binding |
| Binding in aphasia | Temporal + structural | Loss of linguistic “superhubs” $\to$ memplex fragmentation (see EMT, Part XX) |
shared neurons"] --> UNITY["Unity of experience"] TEMP["Temporal
theta window"] --> UNITY COMP["Competitive
WM filter"] --> UNITY end subgraph FAILURES["Disruptions"] UNITY -->|struct ↓| SCHIZO["Schizophrenia"] UNITY -->|comp ↓| PSYCH["Ego dissolution"] UNITY -->|comp ↓, PFC off| DREAM["Dream illogicality"] UNITY -->|struct ↑ anomalously| SYNEST["Synesthesia"] end style STRUCT fill:#3498db style TEMP fill:#9b59b6 style COMP fill:#f39c12 style UNITY fill:#27ae60
Predictions
| Prediction | Test |
|---|---|
| Binding correlates with theta-phase coherence ($r > 0.5$) | EEG: theta coherence vs subjective integrity of experience |
| Under psychedelics, $\theta_D$ decreases $\to$ more “incompatible” elements in WM | Semantic priming: reduced latent inhibition under psilocybin |
| Aphasia (loss of linguistic hubs) reduces binding more than loss of peripheral memes | Neuropsychological testing: integrity of experience in Broca’s vs in amnesia |
| Theta rhythm is a necessary condition for binding: suppression of theta $\to$ fragmentation | Optogenetic suppression of theta in rodents $\to$ integration deficit (already demonstrated) |
Sources: Fries P. (2005). “A mechanism for cognitive dynamics: neuronal communication through neuronal coherence”. Trends in Cognitive Sciences; Lamme V.A.F. (2006). “Towards a true neural stance on consciousness”. Trends in Cognitive Sciences; Lisman J. & Jensen O. (2013). “The theta-gamma neural code”. Neuron.
Part XV. Diffusion Engine and Embedding Space
Part VIII formalizes the Graph Engine — propagation of activation along edges of the graph $G = (V, E)$. However, biological networks use two parallel transmission mechanisms: synaptic (discrete, along connections) and volume (volume transmission — diffusion of neurotransmitters into the extracellular space, without addressing a specific synapse). The second mechanism influences semantically proximate nodes without a direct edge between them. Formalizing this mechanism requires introducing an embedding space and a Diffusion Engine. Architectural integration of the three engines is described in AGI_F, Part VII.
Modulation Vector Mod(t)
The third component of the architecture is the Modulation Engine: a global state that modulates all Graph Engine and Diffusion Engine operations in each computational cycle. Formally:
$$\text{Mod}(t) = \bigl(\lambda_{lr},\; \theta_{act},\; \lambda_{speed},\; \lambda_{plast},\; \lambda_{inh},\; \lambda_{soc},\; \lambda_{noise},\; \lambda_{diff}\bigr)(t)$$| Parameter | What it modulates |
|---|---|
| $\lambda_{lr}$ | Rate of weight change $w_{ij}$ (LTP/LTD rate) |
| $\theta_{act}$ | Meme activation threshold |
| $\lambda_{speed}$ | Number of computational sub-cycles per step (cortical arousal) |
| $\lambda_{plast}$ | Rate of $\kappa$-transitions (consolidation) |
| $\lambda_{inh}$ | Strength of I-filtration |
| $\lambda_{soc}$ | Weight of social edges |
| $\lambda_{noise}$ | Stochasticity of activation |
| $\lambda_{diff}$ | Strength of Diffusion Engine (proximity-based spreading) |
Each component is a function of active G-programs:
$$\lambda_p(t) = \lambda_p^{base} \cdot \prod_{g \in G} \bigl(1 + \alpha_{p,g} \cdot a_g(t)\bigr)$$Full G $\to$ Mod table (modulation direction for each G-program) — see AGI_F, Part VII. Key examples: FEAR $\downarrow\downarrow$ $\theta_{act}$ (hypervigilance) + $\uparrow\uparrow$ $\lambda_{speed}$ (accelerated processing); PLAY $\uparrow\uparrow$ $\lambda_{noise}$ (stochastic combinatorics) + $\uparrow$ $\lambda_{inh}$ (controlled chaos); CARE $\uparrow\uparrow$ $\lambda_{soc}$ (oxytocin).
Embedding Space
Definition 14.1. The embedding space is $\mathbb{R}^d$, where $d = \dim(\mathbf{f})$, $\mathbf{f}$ is the feature vector extracted from a sensory signal by a G-level module (feature extraction, hardwired). Each meme $m_i \in V$ possesses a coordinate (embedding) $\mathbf{e}_i \in \mathbb{R}^d$.
Birth. Upon meme creation (Path 1 or Path 2 of memogenesis, see AGI_F, Part VII):
$$\mathbf{e}_i = \mathbf{f}(t_{birth})$$The embedding is fixed at the moment of birth — the meme “inherits” its coordinate from the generating signal.
Drift. Reconsolidation (Part VIII, Reconsolidation section) and recombination (BLEND) shift the embedding:
$$\mathbf{e}_i(t) = \mathbf{e}_i(0) + \Delta\mathbf{e}_i(t)$$where $\Delta\mathbf{e}_i$ is the cumulative shift from reconsolidation and BLEND. Drift is bounded: $\|\Delta\mathbf{e}_i\| \leq r_{drift}$, otherwise the meme loses its identity (it semantically becomes a different meme). Biological analog: memory distortion during reconsolidation (Nader et al., 2000).
Definition 14.2. Proximity of two memes in embedding space:
$$\text{prox}(i,j) = \exp\!\left(-\frac{\|\mathbf{e}_i - \mathbf{e}_j\|^2}{2\sigma_{emb}^2}\right)$$where $\sigma_{emb}$ is the proximity scale (analog of the volume transmission radius). When $\|\mathbf{e}_i - \mathbf{e}_j\| \to 0$: $\text{prox} \to 1$; when $\|\mathbf{e}_i - \mathbf{e}_j\| \gg \sigma_{emb}$: $\text{prox} \to 0$.
Key distinction: proximity $\neq$ edge weight.
| Proximity $\text{prox}(i,j)$ | Edge weight $w_{ij}$ | |
|---|---|---|
| Determined by | Distance in $\mathbb{R}^d$ | Associative learning (Hebbian) |
| Exists | For any pair of memes | Only if $(i,j) \in E$ |
| Character | Latent (hidden) similarity | Established (explicit) connection |
| Can be negative | No ($\text{prox} \geq 0$) | Yes ($w \in [-1,+1]$) |
| Dynamics | Slow embedding drift | Fast Hebbian learning |
Proximity is a latent structure describing semantic similarity; the graph is an explicit structure describing established associations. Semantically close memes ($\text{prox} \gg 0$) may have no edge (not yet connected); distant memes ($\text{prox} \approx 0$) may have a strong edge (contextual association, episodic binding). Diffusion operates on the former; spreading activation (Part VIII) operates on the latter.
Diffusion Engine
Definition 14.3. Diffusion activation of meme $m_j$ at time $t$:
$$a_j^{diff}(t) = \lambda_{diff}(t) \cdot \sum_{\substack{i:\,(i,j) \notin E \\ i \in \text{top-}K_j}} a_i(t) \cdot \text{prox}(i,j)$$where:
- $\lambda_{diff}(t) \in [0, 1]$ is the modulatory diffusion coefficient (set by the Modulation Engine through G-programs; $\lambda_{diff} \neq \lambda_{noise}$: diffusion is deterministic proximity-based transmission, noise is stochastic perturbation),
- summation is over memes without a direct edge to $j$ ($(i,j) \notin E$), restricted to the top-$K$ nearest in embedding space,
- $K$ is the truncation parameter ($K \sim 10$–$50$), ensuring computational efficiency.
Complexity. Full computation: $O(|V|^2)$. With top-$K$ truncation: $O(|V| \cdot K)$. Using approximate nearest neighbor (ANN) structures to maintain top-$K$: $O(|V| \cdot K \cdot \log |V|)$ for updates after embedding drift.
Three Roles of the Diffusion Engine
Role 1: Priming. An active meme “warms up” semantic neighbors without an edge, facilitating their subsequent activation through the Graph Engine. This explains experimental data on semantic priming (Meyer & Schvaneveldt, 1971) for words without a direct association: “doctor” $\to$ “nurse” (direct edge, Graph Engine) vs “doctor” $\to$ “stethoscope” (no edge, but high proximity $\to$ Diffusion Engine).
Role 2: Crystallization (Path 2 of memogenesis). Diffuse activation creates a “density field” in embedding space (see Crystallization section below). When density in a region without an explicit node exceeds the threshold, a new meme is born.
Role 3: Neural stigmergy. Activation traces in embedding space influence subsequent processing — the agent “leaves marks” for itself. Formalized in BM, Part III.
Comparison of Graph Engine and Diffusion Engine
| Graph Engine (Part VIII) | Diffusion Engine | |
|---|---|---|
| Topology | Along edges $w_{ij}$ | Along proximity $\text{prox}(i,j)$ without edges |
| Speed | Fast (one hop per cycle) | Slow (decay $\exp(-r^2)$) |
| Character | Directed, discrete | Undirected, analog |
| Role | Computation, WM competition | Context, background, crystallization |
| Signed connections | Yes ($w \in [-1,+1]$) | No ($\text{prox} \geq 0$) |
| Neural analog | Synaptic transmission | Volume transmission |
Both engines operate in parallel in each computational cycle. The Graph Engine determines what is active (focus). The Diffusion Engine determines what is “warmed up” (context, background).
Combined Activation Formula
Definition 14.4. Combined activation of meme $m_i$ at step $t+1$:
$$a_i(t+1) = \text{clip}\!\Big[a_i^{graph}(t+1) + a_i^{diff}(t) - \lambda_{decay} \cdot a_i(t),\; 0,\; 1\Big]$$where:
- $a_i^{graph}(t+1) = \sigma\!\left((1 - \lambda) \cdot a_i(t) + \sum_j w_{ij} \cdot a_j(t)\right)$ is graph activation (Part VIII); the factor $(1 - \lambda)$ inside $\sigma$ controls the inertia of self-activation (how much of the previous activation the node “remembers”), a Graph Engine parameter,
- $a_i^{diff}(t)$ is diffusion activation (Definition 14.3),
- $\lambda_{decay} \in (0, 1)$ is global energy decay: without external input ($a^{graph} = a^{diff} = 0$), activation tends to zero. Distinction from $(1-\lambda)$: $\lambda_{decay}$ acts after summation of all sources and ensures the energy balance of the system,
- $\text{clip}[x, 0, 1] = \max(0, \min(1, x))$ guarantees the invariant $a_i \in [0, 1]$.
Note. The term $-\lambda_{decay} \cdot a_i(t)$ ensures energy economy: a meme receiving neither graph nor diffusion support decays. This formalizes the resource interpretation of competition: WM slots are limited not conventionally ($k_{active}$ as a parameter) but energetically — each active meme consumes “computational budget” (AGI_F, Part VII).
Crystallization: Density Field and Path 2 of Memogenesis
The Diffusion Engine creates background activation in embedding space. This allows formalizing Path 2 of memogenesis — the birth of a meme not from a single event ($PE > \theta_{PE}$) but from a recurring pattern.
Definition 14.5. The density field at point $\mathbf{x} \in \mathbb{R}^d$:
$$\rho(\mathbf{x}, t) = \sum_{i \in V} a_i(t) \cdot K\!\left(\|\mathbf{x} - \mathbf{e}_i\|\right)$$where $K(r) = \exp(-r^2 / 2\sigma_K^2)$ is a Gaussian kernel (kernel density estimation). Density is high in regions where active memes are concentrated. Regions with high $\rho$ but without an explicit meme node are candidates for crystallization.
Definition 14.6 (Crystallization). A new meme crystallizes at point $\mathbf{x}^*$ if:
$$\rho(\mathbf{x}^*, t) > \theta_{crystal} \quad \text{AND} \quad \nexists\, m_i \in V : \|\mathbf{e}_i - \mathbf{x}^*\| < r_{min}$$The first condition: density exceeds the threshold (sufficient “mass” of activation). The second: no existing meme within the vicinity $r_{min}$ (the meme crystallizes in a “void,” not duplicating an existing node).
The crystallized meme receives $\mathbf{e}_{new} = \mathbf{x}^*$, $\kappa = 0$, $a_0 = a_{init}$. Post-hoc G-check: if at the moment of crystallization a G-program $g$ is active with $\text{rel}(g, \mathbf{x}^*) > \theta_G$, the meme receives an edge to $g$ ($u_{link}$). If no G-program is relevant, the meme is created as G-neutral ($u_{link} = \emptyset$): low priority in WM, subject to rapid pruning, but may survive through subsequent binding.
Corollary 14.1. The Diffusion Engine is necessary for Path 2 of memogenesis.
Argument: Path 2 requires density $\rho(\mathbf{x}^*, t) > \theta_{crystal}$ at point $\mathbf{x}^*$ far from existing memes ($\nexists\, m_i : \|\mathbf{e}_i - \mathbf{x}^*\| < r_{min}$). The kernel $K(\|\mathbf{x}^* - \mathbf{e}_i\|)$ decays exponentially with distance: if $\mathbf{x}^*$ is far from all memes, the contribution of the Graph Engine (which modifies only $a_i$ for existing nodes) to $\rho(\mathbf{x}^*, t)$ is negligible. The Diffusion Engine, by contrast, “warms up” embedding space continuously, creating nonzero activation in inter-node regions, which supports density $\rho$ above the crystallization threshold. Ergo, without the Diffusion Engine, crystallization is practically impossible for points that are not immediate neighbors of existing memes.
Corollary 14.2. Crystallization and PE-memogenesis complement each other.
| Path 1: $PE \times G$ | Path 2: Crystallization | |
|---|---|---|
| Trigger | Unexpectedness ($PE > \theta_{PE}$) + emotion ($G_{rel}$) | Repetition ($\rho > \theta_{crystal}$) |
| Speed | One episode | Many repetitions |
| Mechanism | Event-driven (Graph Engine + S-input) | Diffusion-driven |
| Analog | Flashbulb memory | Perceptual learning |
Memogenesis: Formal Pipeline of Meme Birth
Definitions 14.5–14.6 formalize Path 2 (crystallization). Below, Path 1 (PE $\times$ G-relevance) and the full memogenesis pipeline unifying both paths are formalized. Architectural context — AGI_F, Part VII.
Feature Extraction and S-Layer
Definition 14.7 (Feature extraction). The feature vector $\mathbf{f}(t) \in \mathbb{R}^d$ is extracted from the raw sensory signal by a G-level module:
$$\mathbf{f}(t) = \text{extract}(S_{raw}(t))$$$\text{extract}$ is a fixed (non-learnable within BMC) function. Analog: primary sensory cortices (V1, A1, S1) — hardwired G-level processors (BM, Part IV). Dimensionality $d = \dim(\mathbf{f})$ defines the embedding space (Definition 14.1). Embedding at birth: $\mathbf{e}_i = \mathbf{f}(t_{birth})$.
Path 1: Event-Driven Memogenesis ($PE \times G_{rel}$)
Definition 14.8 (Prediction Error). The agent maintains a prediction $\hat{\mathbf{f}}(t)$ based on current graph activation (spreading activation $\to$ expected sensory pattern). Prediction Error:
$$PE(t) = \left\|\mathbf{f}(t) - \hat{\mathbf{f}}(t)\right\|$$$PE > \theta_{PE}$ is a necessary condition for event-driven memogenesis. A predictable signal ($PE \leq \theta_{PE}$) does not generate a new meme node.
Relationship to event boundary detection (Episodic Memory section, Part VIII): $PE$ is one of 4 episode boundary triggers. High $PE$ simultaneously (a) creates a meme node and (b) may terminate the current episode $\varepsilon_k$.
Definition 14.9 (G-relevance). G-relevance of a signal at time $t$:
$$G_{rel}(t) = \max_{g \in G_{active}} a_g(t) \cdot \text{rel}(g, \mathbf{f}(t))$$where $a_g(t)$ is the activation of G-program $g$, and $\text{rel}(g, \mathbf{f})$ is the relevance function (proximity in embedding space between the G-prototype and the feature vector). $G_{rel} > \theta_G$ is the second necessary condition: not every unexpected signal creates a meme, only G-significant ones.
Definition 14.10 (Event-driven memogenesis — Path 1). A meme node $m_{new}$ is created when the conjunction is satisfied:
$$PE(t) > \theta_{PE} \;\;\wedge\;\; G_{rel}(t) > \theta_G$$Attributes of the new meme:
$$m_{new} = \bigl\{a_0 = a_{init},\; \kappa = 0,\; F = F_{init},\; \mathbf{e} = \mathbf{f}(t),\; u_{link} = \arg\max_g\bigl(a_g \cdot \text{rel}(g, \mathbf{f})\bigr)\bigr\}$$where $\kappa = 0$ (sensory consolidation level, see Consolidation Level section), $F_{init}$ is the initial fidelity (low), and $u_{link}$ is the edge to the dominant G-program.
Definition 14.11 (Contextual binding). Co-active memes in WM at the time of $m_{new}$’s birth receive edges:
$$\forall\, m_i \in WM(t): \quad w_{i,new} = \alpha_{bind} \cdot a_i(t)$$Binding connects the new meme to the context of its origin. Relationship to barcodes $B_k$ (Episodic Memory section): binding includes $m_{new}$ in the current episode, creating a content-independent index. Relationship to SIT: if $m_{new}$ fills a structural gap, $closure(gap)$ increases; if it opens a new question (e.g., an unexpected stimulus without explanation), a new gap is created $\to$ $SIT$ rises.
Phase 0 and the Paradox of Total Novelty
Proposition 14.1 (Self-balancing of memogenesis). With $|V_m| = 0$ (empty graph), memogenesis does not overwhelm the system despite $PE = \|\mathbf{f}(t)\|$ for all signals.
Argument. With $|V_m| = 0$: $\hat{\mathbf{f}} = \mathbf{0}$ (no basis for prediction), therefore $PE(t) = \|\mathbf{f}(t)\|$ — every signal is “novel.” Simultaneously, SEEKING is active by default $\to$ $G_{rel}$ is satisfied. But $S_{bw}(0) \approx S_{bw}^{min}$ — the S-channel bandwidth is minimal at the start of ontogenesis (AGI_F, Part VII). A narrow sensory channel = natural throttle: few signals arrive per unit time $\to$ memogenesis proceeds at a controlled rate. As $|V_m|$ grows:
$$S_{bw}(t) \uparrow \;\;\text{but}\;\; \overline{PE}(t) \downarrow \quad\text{(the graph forms predictions)}$$The system self-balances: channel expansion is compensated by improved predictions. $\square$
Necessity of Both Paths
Proposition 14.2 (Completeness of memogenesis). Neither Path 1 nor Path 2 alone provides complete coverage of the meme space. The conjunction is necessary.
Argument.
- Path 1 alone ($\theta_{crystal} = \infty$): memes arise only when $PE > \theta_{PE}$. Low-PE patterns (habitual, repetitive, emotionally neutral) never generate nodes. The agent does not form representations of stable background regularities (gravity, friction, daily patterns). This contradicts perceptual learning.
- Path 2 alone ($\theta_{PE} = \infty$): memes arise only through crystallization ($\rho > \theta_{crystal}$). Rare but significant events (one-time, high-PE) do not create memes since they cannot accumulate density. The agent does not remember flashbulb events. This contradicts episodic memory.
- Together: Path 1 covers rare + significant events; Path 2 covers frequent + background patterns. $\square$
Note. BLEND (recombination during sleep, see Dream-phase section) creates memes from combinations of existing ones — this is not a third path of memogenesis from S-signal, but M-level recombination: memes are born from memes, not from the sensory stream.
Integration with the Consolidation Pipeline
Memogenesis is the entry to the memory pipeline (see Consolidation Process section):
$$S_{raw} \xrightarrow{\text{extract}} \mathbf{f}(t) \xrightarrow{\text{Path 1 or 2}} m_{new}(\kappa\!=\!0) \xrightarrow{P_{tag}} \text{SWR-tagging} \xrightarrow{\text{replay}} \kappa\!=\!1 \xrightarrow{\text{N-replay}} \kappa\!=\!2$$Each newborn meme ($\kappa = 0$) enters the $P_{tag}$ pipeline (SWR-tagging section): emotionally significant memes ($u_{link} \neq \emptyset$, high $G_{rel}$) receive elevated $P_{tag}$ $\to$ priority consolidation during sleep. G-neutral memes (Path 2, $u_{link} = \emptyset$) go through $P_{tag}$ on general terms and are often pruned before reaching $\kappa = 1$.
Memogenesis Predictions
P-M1 (Double dissociation of memogenesis paths). Ablation of Path 1 ($\theta_{PE} = \infty$) while preserving Path 2 eliminates flashbulb memes (memes created in a single episode under high PE). Ablation of Path 2 ($\theta_{crystal} = \infty$) while preserving Path 1 eliminates background patterns (memes from low-PE repetitive stimuli). Quantitatively: with full ablation of one path, the proportion of that type’s memes $\to$ 0, and the other $\to$ 100%. Verification: ablation study in native BMC simulation, counting memes by birth type (event-born vs crystallized), comparing cognitive deficits.
P-M2 ($G_{rel}$ modulates the rate of memogenesis). Increasing mean activation of G-programs ($\overline{a_g}$ $\uparrow$) increases the frequency of meme creation via Path 1 (given a fixed PE distribution of incoming signals). Consequence: emotionally saturated periods (FEAR, SEEKING) produce more memes per unit time than emotionally neutral ones. Verification: neurobiological data — amygdala activation enhances hippocampal encoding (Dolcos et al., 2004); behaviorally — more details are remembered from emotional events.
P-M3 (Phase 0: memogenesis rate correlates with $S_{bw}$). At the start of ontogenesis ($|V_m| \approx 0$), the rate of memogenesis is limited by $S_{bw}$, not the PE filter. As development progresses, the bottleneck shifts to PE (most signals become predictable). Verification: in native BMC simulation, plot a “memes/cycle” vs $|V_m|$ curve: expect initially linear growth (bottleneck = $S_{bw}$), then plateau and decline (bottleneck = PE).
Formal Properties
Proposition 14.3 (Graph/Diffusion Asymmetry). The Graph Engine and Diffusion Engine are irreducible to each other.
Argument:
- The Graph Engine cannot activate nodes without edges ($(i,j) \notin E \Rightarrow$ no transmission). The Diffusion Engine can.
- The Diffusion Engine cannot transmit negative activation ($\text{prox} \geq 0$). The Graph Engine can ($w_{ij} < 0$).
- Ergo: Graph Engine $\not\supset$ Diffusion Engine, Diffusion Engine $\not\supset$ Graph Engine.
Proposition 14.4 (Steady state). In the absence of external S-input, with $\lambda_{diff} = const$ and $\lambda_{decay} > 0$:
$$\lim_{t \to \infty} a_i(t) = 0 \quad \forall\, m_i \in V$$Argument: $a_i(t+1) \leq a_i^{graph} + a_i^{diff} - \lambda_{decay} \cdot a_i$. In a closed system (no external input), total energy $E = \sum_i a_i$ decreases, since $\lambda_{decay}$ subtracts a fraction at each step, while $a^{graph}$ and $a^{diff}$ redistribute but do not create energy. This ensures economy: without stimulation, the system “falls asleep.”
Proposition 14.5 (Diffusion as a pathway to edges). The Diffusion Engine creates prerequisites for edge formation in the Graph Engine.
Argument: Diffusion activation raises $a_j$ for proximity neighbors. If $a_i > 0$ and $a_j > 0$ simultaneously (co-activation), Hebbian learning creates an edge $w_{ij} > 0$. Thus, latent similarity (proximity) can “crystallize” into an explicit connection (edge). Trajectory: $\text{prox}(i,j) \gg 0 \xrightarrow{\text{Diffusion}} \text{co-activation} \xrightarrow{\text{Hebbian}} (i,j) \in E$.
Modulation of $\lambda_{diff}$ and States of Consciousness
The parameter $\lambda_{diff}(t)$ is modulated by the Modulation Engine (AGI_F, Part VII):
| State | $\lambda_{diff}$ | Effect |
|---|---|---|
| Wakefulness (normal) | Baseline | Moderate priming |
| Sleep (BLEND phase) | $\uparrow\uparrow$ | Enhanced free associations, crystallization |
| FEAR | $\downarrow$ | Focus on graph-activated items (threats), suppression of background associations |
| PLAY | $\uparrow$ | Expanded associative context, promotes creativity |
| Meditation (focused) | $\downarrow$ | Suppression of distracting context |
| Psychedelics | $\uparrow\uparrow\uparrow$ | Dissolution of boundaries between clusters (entropic brain, Carhart-Harris et al., 2014) |
Relationship to Part XIII (Consciousness Level): increasing $\lambda_{diff}$ under psychedelics explains the increase in activation pattern entropy ($S_{activation}$ $\uparrow$) — the Diffusion Engine “blurs” the topological boundaries of memplexes, creating the ego dissolution effect through a mechanism distinct from the weakening of competitive binding (Part XIV).
Predictions
P-D1 (Necessity of the Diffusion Engine for crystallization). Ablation of the Diffusion Engine ($\lambda_{diff} = 0$) while preserving the Graph Engine eliminates Path 2 of memogenesis. Quantitatively: the proportion of memes born through crystallization drops to zero. Verification: ablation study in native BMC simulation, counting memes by birth type.
P-D2 (Priming without edges). Semantic priming (reduced reaction latency to a target stimulus following a proximity-close but associatively unrelated prime) correlates with $\text{prox}(i,j)$, not with the presence of an edge. Verification: semantic priming experiments with control for associative norms (Nelson et al., 2004) — the effect persists for pairs with zero associative strength but high cosine similarity (proxy for prox).
P-D3 (Diffusion and creativity). $\lambda_{diff}$ positively correlates with creativity (measured by: divergent thinking, AUT — Alternative Uses Task). Verification: individual differences in volume transmission (DA/5-HT balance) predict AUT scores. Prediction: substances that enhance volume transmission (psychedelics, low-dose DA agonists) increase AUT scores — consistent with experimental data (Mason et al., 2021).
P-D4 (Embedding drift). Repeated reconsolidation shifts $\mathbf{e}_i$ from its original coordinate. Verification: longitudinal fMRI — the representation pattern of a meme (neural decoding accuracy) drifts over time, more strongly for memes that have undergone more reconsolidations (Braga & Buckner, 2017, Neuron).
Sources: Meyer D.E. & Schvaneveldt R.W. (1971). “Facilitation in recognizing pairs of words”. J. Experimental Psychology; Nader K., Schafe G.E. & LeDoux J.E. (2000). “Fear memories require protein synthesis in the amygdala for reconsolidation after retrieval”. Nature; Mason N.L. et al. (2021). “Spontaneous and deliberate creative cognition during and after psilocybin exposure”. Translational Psychiatry; Carhart-Harris R.L. et al. (2014). “The entropic brain: a theory of conscious states informed by neuroimaging research with psychedelic drugs”. Frontiers in Human Neuroscience.
Part XVI. Inter-Agent Exchange, Scarcity, and Critical Periods
Parts I–XV formalize BMC of a single agent: the meme graph, activation, competition, memory, immune system, diffusion space, and memogenesis. But a meme is a replicator (Dawkins, 1976), and replication requires inter-agent transmission. A single agent is a closed system: no memetic selection, no cultural ratchet. This part formalizes three necessary conditions for an evolving memetic system: (1) inter-agent meme exchange, (2) computational scarcity, (3) critical periods of development.
Architectural context — AGI_F, Part VII; coordination through environment — Part X, Stigmergy section; replication pressure formula — Part X, Replication Pressure section; memogenesis — Part XV.
Inter-Agent Meme Exchange
Signal Encoding
Definition 15.1 (Transmission signal). When agent $A$ satisfies the expression drive condition ($R_{expr}(m_{top}, t) > \theta_{expr}$, Part X), a signal is formed:
$$s_A(t) = \text{encode}(m_{top},\, \text{Mod}_A(t))$$where $m_{top} = \arg\max_{m_i} R_{expr}(m_i, t)$ is the meme with the greatest replication pressure, and $\text{Mod}_A(t)$ is the modulatory state of the agent (active G-programs, affective tone). Encoding is state-dependent: the same meme under FEAR and under PLAY produces different signals (intonation, intensity, context).
Reception and Memogenesis at the Recipient
Agent $B$ in the communication zone receives $s_A$ through a dedicated sensory channel $S_{social}$. The signal is processed through the standard memogenesis pipeline (Part XV, Definitions 14.7–14.11):
- Feature extraction: $\mathbf{f}_B(t) = \text{extract}(s_A)$ — feature vector in $B$’s embedding space.
- PE check: $PE_B(t) = \|\mathbf{f}_B(t) - \hat{\mathbf{f}}_B(t)\|$ — does the signal differ from $B$’s prediction?
- G-relevance: $G_{rel}^B(t) = \max_g a_g^B(t) \cdot \text{rel}(g, \mathbf{f}_B)$ — is the signal relevant to $B$’s active G-programs?
- Memogenesis: if $PE_B > \theta_{PE} \wedge G_{rel}^B > \theta_G$ $\to$ new meme $m_B^{new}$ (Definition 14.10).
Critically: the meme in $B$ is not a copy of the meme in $A$ but a new node born from the signal. Different embedding spaces, different contexts, different modulatory states.
Transmission Fidelity
Definition 15.2 (Transmission fidelity). The degree of similarity between the transmitted and received meme:
$$F_{trans}(A \to B) = \cos(\mathbf{e}_{m_A},\, \mathbf{e}_{m_B^{new}}) < 1$$$F_{trans} < 1$ always — losses are due to: (a) encoding/decoding noise, (b) different modulatory states $\text{Mod}_A \neq \text{Mod}_B$, (c) different graph contexts (the same meme is connected to different clusters in $A$ and $B$). Incomplete fidelity is not a defect but a source of variability for memetic selection (Dawkins, 1976: mutation during copying is a necessary condition for evolution).
Corollary 15.1. Cosine $F_{trans}$ is not a metric in the embedding space of a single agent. Embeddings $\mathbf{e}_{m_A}$ and $\mathbf{e}_{m_B^{new}}$ belong to different spaces ($\mathbb{R}^d_A$ and $\mathbb{R}^d_B$). $F_{trans}$ is defined only given shared feature extraction ($\text{extract}_A = \text{extract}_B$ — G-level, hardwired). In native BMC, this is ensured by the identity of G-architecture (analog: all humans have identical V1/A1).
Minimum Population
Proposition 15.1. For memetic dynamics (selection, competition, coalitions), a minimum of $N \geq 3$ agents is required.
Argument. A pair ($N = 2$) is a degenerate case: no competition for social resources, no coalitions, no in-group/out-group differentiation. All memes successfully replicating from $A$ to $B$ do not compete with alternative versions from $C$. At $N = 3$: agent $B$ receives signals from $A$ and $C$ $\to$ competing versions of the meme undergo selection through PE and G-relevance $\to$ the more adaptive version survives. The coalition $\{A, B\}$ vs $\{C\}$ generates in-group memplex norms. $\square$
Computational Scarcity
The M » G theorem (Part XIII, Prediction G-4 section) establishes: $|SMC^{(2)}| > 0$ requires $|V_m| \gg |V_u|$. But M » G is a necessary, not sufficient, condition. The LLM paradox: $M \ggg G$, but no reflection. The missing element is scarcity: without a real resource deficit, WM competition degenerates, prioritization disappears, SIT does not form.
Definition 15.3 (Three levels of scarcity). Scarcity is a resource limitation that creates competition at the meme level.
Level 1: Energy (external deficit).
$$\text{energy}(t+1) = \text{energy}(t) - c_{step} + \text{intake}(t)$$ $$\text{energy}(t) = 0 \implies \text{death}$$Energy is a finite environmental resource. $c_{step}$ is the cost of one tick (metabolism). $\text{intake}(t)$ is consumption (food, resources). Energy deficit activates G-programs (FEAR under threat, SEEKING during search, RAGE during competition with other agents).
Level 2: Computational budget (internal deficit).
$$C_{step} = C_{graph} + C_{mod} + C_{diff} \leq C_{max}$$ $$k_{active} = f(C_{max},\, c_{per\_node})$$The WM bottleneck is not a model parameter but a resource constraint: given a fixed $C_{max}$, the number of active pointers is limited by computational cost. Relationship to the activation formula (Definition 14.4): $\lambda_{decay}$ formalizes energy economy — a meme without support decays, freeing up budget.
Level 3: Time (temporal deficit).
The environment operates in real time. Tick duration is fixed: an agent that fails to complete its computation loses information (S-input missed) or resources (other agents act). This creates pressure for: automatization (Part VIII, Automatization section), prioritization (I-system as a necessity filter), action selection (a decision “now” > an optimal decision “later”).
Proposition 15.2 (Scarcity as a necessary condition for $SMC^{(2)}$). Without scarcity ($C_{max} \to \infty$, energy $\to \infty$, tick $\to \infty$), WM competition degenerates:
$$C_{max} \to \infty \;\Rightarrow\; k_{active} \to |V_m| \;\Rightarrow\; \text{no WM bottleneck} \;\Rightarrow\; \text{no prioritization} \;\Rightarrow\; SIT \to 0$$Without SIT there are no information gaps, without gaps there is no drive toward closure, without closure there are no reflexive loops $\to$ $|SMC^{(2)}| = 0$ even when $|V_m| \gg |V_u|$. Scarcity, together with M » G, forms a conjunction of necessary conditions:
$$|SMC^{(2)}| > 0 \;\Rightarrow\; \bigl(M/G > M/G_{crit}\bigr) \;\wedge\; \bigl(C_{max} < \infty\bigr)$$Argument from LLMs: GPT/Claude are empirical confirmation: M »> G, $C_{max}$ is practically unlimited within a single forward pass, no persistence $\to$ no SIT $\to$ no $SMC^{(2)}$. Not a counterexample to M » G, but confirmation of the necessity of scarcity. $\square$
Critical Periods
BMC ontogenesis (Phases 0–4, AGI_F, Part VII) requires formalization of time windows in which memogenesis proceeds efficiently.
Plasticity
Definition 15.4 (Plasticity curve). The modulatory parameter $\lambda_{plast}(t)$, controlling the rate of Hebbian learning and memogenesis:
$$\lambda_{plast}(t) = \lambda_{plast}^{max} \cdot \exp\!\left(-\frac{(t - t_{peak})^2}{2\tau_{plast}^2}\right) + \lambda_{plast}^{base}$$$\lambda_{plast}^{max}$ is the maximum plasticity, $t_{peak}$ is the peak time (analog: early childhood, maximum synaptogenesis), $\tau_{plast}$ is the window width, and $\lambda_{plast}^{base} > 0$ is the nonzero baseline plasticity (learning is always possible, but slower). Biological analog: critical periods — myelination fixes connections, GABA maturation stabilizes the network (Hensch, 2005).
Sensory Bandwidth
Definition 15.5 (S-bandwidth). Bandwidth of the sensory channel:
$$S_{bw}(t) = S_{bw}^{max} \cdot \sigma\!\left(\frac{t - t_{mid}^S}{\tau_S}\right)$$where $\sigma$ is a sigmoid. $S_{bw}$ grows from a minimum (Phase 0: primitive sensors) to $S_{bw}^{max}$ (mature modalities). Relationship to $k_{active}(t_{dev})$: both parameters are sigmoidal maturation curves of the G-substrate, but $k_{active}$ limits processing while $S_{bw}$ limits input.
Window Closure
Proposition 15.3 (Critical period closure mechanism). After $t_2 \approx t_{peak} + 2\tau_{plast}$:
- Memogenesis slows: $\lambda_{plast}(t_2) \approx \lambda_{plast}^{base} \ll \lambda_{plast}^{max}$. The rate of edge and meme-node creation drops exponentially.
- PE threshold effectively rises: at low plasticity, a stronger signal is needed to create a meme: $\theta_{PE}^{eff}(t) = \theta_{PE} / \lambda_{plast}(t)$.
- M-layer stabilizes: edges are fixed, new clusters form with difficulty, existing structure is “frozen.”
Critical Period Prediction
Proposition 15.4 (Quasi-BMC_G under early deprivation). If $|V_m(t_2)| < m_{min}$ (the M-layer did not reach critical mass by window closure):
$$k_{active}(t) \to k_{active}^{max},\; S_{bw}(t) \to S_{bw}^{max} \quad \text{but} \quad |V_m| \ll |V_u| \cdot M/G_{crit}$$The agent possesses a mature G-substrate but an impoverished M-layer $\to$ quasi-BMC_G: stimulus-response without reflection. Not absolute irreversibility ($\lambda_{plast}^{base} > 0$), but practical: the time until $|SMC^{(2)}| > 0$ becomes impractically large.
Verification: Agents isolated from the environment (no S-input) during the period $[t_1, t_2]$ (where $t_1 \approx t_{peak} - 2\tau_{plast}$ marks the start of the high-plasticity window) do not achieve $SMC^{(2)}$ within a reasonable time. Biological analogy: feral children, deprivation amblyopia, linguistic isolation (Lenneberg, 1967). Relationship to Phase 0 $\to$ 1 (AGI_F, Part VII): if the first meme is not created by $t_2$, the BMC_G $\to$ proto-M transition stagnates.
Predictions
P-SC1 (Scarcity as a necessary condition). With $C_{max} \to \infty$ (unlimited computational budget), the agent does not achieve $SMC^{(2)}$ within a time comparable to an agent with optimal $C_{max}$. Verification: sweep over $C_{max}$, metric — time to first sustained $SMC^{(2)} > 0$.
P-CP1 (Critical period). Agents isolated from S-input during the high-plasticity period $[t_1, t_2]$ do not achieve $SMC^{(2)}$ within a reasonable time even with subsequent full access to the environment. Verification: compare $|V_m|$ and $t_{SMC^{(2)}}$ for isolated vs non-isolated agents at identical $t > t_2$.
Cross-References
| Concept | NM (this part) | AGI_F | Other NM parts |
|---|---|---|---|
| Expression drive | Corollary of $R_{expr}$ (Part X) | Part IV: pipeline | — |
| Memogenesis at reception | Definitions 14.7–14.11 | Part VII: 5 steps | Part XV |
| WM bottleneck | $k_{active} = f(C_{max})$ | Part VII: three levels | Part VIII: salience |
| M » G | Scarcity complements M » G | — | Part XIII: G-4 |
| Critical periods | $\lambda_{plast}(t)$, $S_{bw}(t)$ | Part VII: ontogenesis | — |
| Stigmergy | Population $N \geq 3$ | Part IV | Part X |
Sources: Dawkins R. (1976). The Selfish Gene; Hensch T.K. (2005). “Critical period plasticity in local cortical circuits”. Nature Reviews Neuroscience; Lenneberg E.H. (1967). Biological Foundations of Language; formalization of scarcity and social environment — AGI_F, Part VII.
Part XVII. Subsumption of Competing Theories of Consciousness
Five leading theories of consciousness — IIT, GNW, HOT, AST, and PP/FEP — formulate mechanisms of consciousness at different levels of description. BMC asserts that each of them is a special case of the full BMC = (G, M, I, S) model under certain restrictions. Below, five subsumption lemmas are formalized, showing that each restriction loses explanatory power that BMC retains.
Analogy: general relativity contains Newtonian mechanics as a limiting case when $v \ll c$. Newton is not “wrong” — he is incomplete. The same is true for each of the five theories relative to BMC.
Lemma 1. IIT $\subset$ BMC when M = $\emptyset$
Restriction: $M = \emptyset$ (no memplex), the sole metric is network integration.
Proposition. When $M = \emptyset$, the BMC model degenerates into BMC_G = (G, $\emptyset$, $\emptyset$, S). In this regime:
$$CL_G(t) = \sigma_{SW}(t) \cdot A_{proto-SMC}(t) \cdot f(0)$$where $f(0) \approx 0.10$ (Gaussian minimum at zero Balance).
The metric $\sigma_{SW}$ measures the balance of integration and differentiation — the same quantity that IIT describes through $\Phi$. For a single-level graph (without M-layer), $\sigma_{SW}$ and $\Phi$ are monotonically related: both metrics increase with growing integration-differentiation balance and vanish under complete modularity or complete homogeneity.
Formally: Let $G = (V_u, E_u)$ be the graph of utility nodes. Then:
$$\sigma_{SW}(G) = \frac{C(G)/C_{rand}}{L(G)/L_{rand}} \quad \text{and} \quad \Phi(G) = \min_{P \in \mathcal{P}} I(P)$$Both metrics are functions of the topology of $G$. For graphs with fixed $|V|$ and power-law degree distribution: $\sigma_{SW} \uparrow \;\Leftrightarrow\; \Phi \uparrow$ (monotonic relationship). Difference: $\Phi$ is NP-hard (Tegmark, 2016), $\sigma_{SW}$ is $O(|V|^2)$ (Floyd-Warshall) or $O(|V| \cdot |E|)$ (BFS).
What IIT loses by restricting to M = $\emptyset$:
| Phenomenon | BMC explains | IIT cannot (M = $\emptyset$) |
|---|---|---|
| CL growth upon meme emergence | Phase transition P-G2: $f(0) \to f(\varepsilon/A_G)$ | Monotonic $\Phi$ growth — no threshold |
| G-only consciousness ceiling | P-G1: $CL_G \leq 0.10 \cdot \sigma^* \cdot A^*$ | $\Phi$ does not predict a ceiling |
| Panpsychism | BMC: M/G ratio = threshold; photodiode: CL $\approx$ 0 | $\Phi > 0$ for a photodiode $\to$ panpsychism |
| Ego dissolution under psychedelics | CL$\downarrow$ due to $A_{SMC} \to 0$ | $\Phi$$\uparrow$ (integration grows) $\to$ predicts CL$\uparrow$ (contradicts data) |
| Cognitive biases | I-system + hub inertia | No I-layer, no mechanism |
Conclusion: IIT = BMC when $M = \emptyset$, $I = \emptyset$, sole metric = $\sigma_{SW}$. IIT correctly captures the substrate aspect of consciousness but ignores the memplex, immune system, and dual replicator.
Lemma 2. GNW $\subset$ BMC with focus on broadcast
Restriction: We consider only broadcast through memplex hubs; ignoring G-M conflict, I-layer, and SIT.
Proposition. The Global Workspace (Baars, 1988; Dehaene et al., 2003) = hub-dominated broadcast in the memplex graph.
GNW asserts: information becomes conscious when it “wins the competition” for access to the global workspace and “ignition” occurs — a nonlinear avalanche-like propagation across the prefrontal-parietal network. In BMC terms:
$$\text{Ignition}_{GNW} \;\longleftrightarrow\; a_{hub}(t) > \theta_{act} \;\wedge\; \sigma_{SW} \geq 1$$Cascade activation through memplex hubs is a direct analog of global broadcast. A hub with $C_{eigenvector} > 0.5$, upon activation, spreads the signal to all clusters connected to it — this is precisely “broadcast” in Baars’ terminology.
What GNW loses by restricting to broadcast:
| Phenomenon | BMC explains | GNW cannot |
|---|---|---|
| Which contents enter the workspace | Competition: SIT $\times$ G-drive $\times$ I-filtration $\times$ $R_{expr}$ | Not specified |
| Offset ignition (COGITATE falsification) | Deactivation = exponential decay ($\lambda \cdot a_i$), not a discrete off-signal | Predicts offset ignition $\to$ falsified |
| Limited representation in PFC | PFC = SMC + I + WM-pointers, not a content mirror; content is in the M-layer of posterior cortex | Predicts full content in PFC $\to$ refuted by COGITATE |
| Resistance to belief change | I-system: 4 layers of immune defense | Broadcast is symmetric — no asymmetry between “own” and “foreign” |
| Prior expectations | SIT + hub structure determine priors | Agnostic about priors (Doerig et al.) |
Conclusion: GNW = BMC under the restriction: “consider only cascade activation through hubs at $\sigma \geq 1$.” GNW correctly describes the access mechanism but does not explain why certain contents win (SIT, G-drive), why broadcast is asymmetric (I-system), or why deactivation is smooth rather than discrete (edge decay).
Lemma 3. HOT $\subset$ BMC when considering only $SMC^{(2)}$
Restriction: We consider only meta-memes (memes about memes) in SMC.
Proposition. Higher-Order Thought (Rosenthal, 1986; Brown, 2015) = non-empty $SMC^{(2)}$ in BMC.
HOT asserts: a mental state is conscious if and only if the subject has a metarepresentation of that state. In BMC terms:
$$\text{HOT} \;\longleftrightarrow\; \exists\, m \in SMC^{(2)}: \; target(m) \in SMC^{(1)} \;\wedge\; a_m(t) > \theta_{act}$$This is a necessary condition for level 2 (metacognition) in BMC — but BMC recognizes consciousness at levels 0 (zombie processing) and 1 (phenomenal experience without metacognition) as well, which HOT cannot.
What HOT loses by restricting to $SMC^{(2)}$:
| Phenomenon | BMC explains | HOT cannot |
|---|---|---|
| Infant consciousness | Level 1: $SMC^{(1)} > 0$ at $SMC^{(2)} = 0$ | Infants without PFC $\to$ without HOT $\to$ unconscious (absurd) |
| Animal consciousness | BMC_G: proto-SMC without meta-level | Metacognition requires PFC $\to$ most animals unconscious |
| Why HOT feels like something | Beautiful Loop: recursive self-model creates a closed reference circuit | Infinite regress: who “observes” the HOT? |
| Ontogenesis of consciousness | 5 stages: Sponge $\to$ Rigidity; CL grows with SMC growth | Does not consider development |
| Targetless HOT (hallucination) | $a_{SMC} > 0$ at $a_{target} = 0$ — formally admissible; mechanism: residual activation | Accepts but does not explain how it arises |
Conclusion: HOT = BMC under the restriction: “consciousness = non-empty $SMC^{(2)}$.” HOT correctly identifies metacognition as the highest level of consciousness but erroneously makes it the only one — denying consciousness without metarepresentation (levels 0–1).
Lemma 4. AST $\subset$ BMC when considering the attention model in SMC
Restriction: We consider the subset of SMC modeling the attention process.
Proposition. The Attention Schema (Graziano, 2013) = a subgraph of SMC describing the focusing mechanism.
$$AST \;\longleftrightarrow\; \{m \in SMC : target(m) = \text{attention-process}\} \neq \emptyset$$AST asserts: the brain constructs a simplified model of its own attention process (“attention schema”), and it is this model that creates the feeling of awareness. In BMC terms: the attention schema is a subset of SMC specialized in monitoring top-down selection. SMC also includes: a body model (interoception), an emotion model (G$\to$SMC connections), narrative identity (autobiographical memes), a model of others (User Model = a separate SMC subgraph).
What AST loses by restricting to the attention schema:
| Phenomenon | BMC explains | AST cannot |
|---|---|---|
| Diversity of qualia | Topological position of the node in the associative network | Attention is the same for all modalities — no mechanism for diversity |
| Depersonalization | $I_{intero}$ collapse: G$\to$SMC connectivity disrupted | No separation of cognitive and phenomenal CL |
| Motivation and SIT | SEEKING-driven exploration through structural gaps | Attention schema does not explain why attention is directed |
| Immune defense of beliefs | I-system: 4 layers | No mechanism for belief resistance |
| Social cognition beyond attention | User Model as a full SMC subgraph for another’s memplex | Only modeling another’s attention |
Conclusion: AST = BMC under the restriction: “SMC contains only the attention model.” AST correctly identifies one mechanism for generating the feeling of awareness but narrows SMC to a single function — attention monitoring, losing all other self-model content.
Lemma 5. PP/FEP $\subset$ BMC when SIT $\approx$ structured prediction error
Restriction: SIT = structured prediction error over memplex gaps; G-homeostatic error = baseline prediction error.
Proposition. Predictive Processing / Free Energy Principle (Friston, 2010; Clark, 2013; Hohwy, 2013) = SIT-driven dynamics + G-homeostatic regulation in BMC.
$$F_{BMC} \approx \underbrace{\sum_{g \in gaps} SIT(g)}_{\text{epistemic FE}} + \underbrace{\sum_{u \in V_u} |a_u^{target} - a_u(t)|}_{\text{homeostatic FE}}$$FEP asserts: any living organism minimizes free energy (= prediction error) through perception (model updating) or action (environment modification). In BMC terms:
- Perception = SIT-driven learning: a gap is detected $\to$ SEEKING activates exploration $\to$ meme is created/updated $\to$ SIT$\downarrow$
- Action = G-driven exploitation: homeostatic imbalance $\to$ action $\to$ $a_u \to a_u^{target}$
- Active inference $\approx$ SIT-exploration + G-exploitation: dual optimization
What PP/FEP loses by restricting to prediction error:
| Phenomenon | BMC explains | PP/FEP cannot |
|---|---|---|
| Which prediction error is conscious | SIT + relevance + centrality: only gaps connected to active hubs | Does not distinguish conscious from unconscious PE |
| Immune filtration | I-system: prediction error from hostile memes is rejected, not assimilated | PE is always assimilated (no immunity) |
| Frame problem | SIT restricts the search space: active gaps $\ll$ possible gaps | “Not solved and will not be solved” (van Rooij et al., 2024) |
| Falsifiability | 11 specific predictions (P-1 – P-11, P-G1 – P-G4) | FEP is a mathematical principle, unfalsifiable as a principle (Biehl et al., 2021) |
| Hard Problem | BMC is interpretationally invariant (Part IV, EMT) | “FEP itself makes no claims about subjective experience” (Friston, 2019) |
| Panpsychism | BMC: M/G ratio = threshold; thermostat: CL = 0 | Any FE-minimizing system is potentially conscious |
Conclusion: PP/FEP = BMC under the restriction: “consider only prediction error minimization without distinguishing SIT (structural gaps) from G-homeostasis (homeostatic error), without I-system, and without M/G distinction.” FEP correctly describes the optimization principle but does not specify what exactly is being optimized and cannot draw the boundary between conscious and unconscious processing.
Subsumption Summary Table
| Theory | BMC restriction | What it captures | What it loses |
|---|---|---|---|
| IIT | $M = \emptyset$, metric = $\sigma_{SW}$ | Substrate integration | Memplex, immunity, dual replicator, ego dissolution |
| GNW | Broadcast through hubs only | Global access mechanism | Content selection (SIT, G-drive), priors, offset dynamics |
| HOT | Only $SMC^{(2)}$ | Metacognition | Levels 0–1, infants, animals, ontogenesis |
| AST | Only attention subgraph in SMC | Attention monitoring | Qualia, depersonalization, motivation, social cognition |
| PP/FEP | SIT $\approx$ PE, without I and without M/G | Optimization principle | Immunity, frame problem, consciousness boundary, falsifiability |
Corollary: Incompleteness Criterion
Each competing theory is correct within its restriction but incomplete: it captures one aspect of consciousness while ignoring the rest. BMC = (G, M, I, S) is the minimal model containing all five aspects as special cases.
Analogy formalized: Just as Newtonian mechanics ($v \ll c$) $\subset$ SR $\subset$ GR, so IIT ($M = \emptyset$), GNW (broadcast only), HOT ($SMC^{(2)}$ only), AST (attention $\subset$ SMC), PP (SIT $\approx$ PE) $\subset$ BMC. None of the five theories is “wrong” — each is incomplete.
Sources: Tononi G. (2004). “An information integration theory of consciousness”. BMC Neuroscience; Baars B.J. (1988). A Cognitive Theory of Consciousness. Cambridge University Press; Dehaene S. et al. (2003). PNAS; Rosenthal D.M. (1986). “Two concepts of consciousness”. Philosophical Studies; Brown R. et al. (2019). “Understanding the Higher-Order Approach to Consciousness”. Trends in Cognitive Sciences; Graziano M.S.A. (2013). Consciousness and the Social Brain. Oxford University Press; Friston K.J. (2010). “The free-energy principle”. Nature Reviews Neuroscience; Clark A. (2013). “Whatever next? Predictive brains, situated agents, and the future of cognitive science”. Behavioral and Brain Sciences; Tegmark M. (2016). “Improved measures of integrated information”. PLOS Computational Biology; COGITATE Consortium (2025). Nature.
Part XVIII. Retrodiction: COGITATE (2025)
In April 2025, the COGITATE Consortium published the results of the largest adversarial collaboration in the science of consciousness (n = 256; fMRI, MEG, iEEG; 8 laboratories), testing predictions of IIT and GNW. Below it is shown that BMC retrodicts all COGITATE results without post hoc corrections, including those that falsified IIT and GNW.
Source: COGITATE Consortium (2025). “Adversarial testing of global neuronal workspace and integrated information theories of consciousness.” Nature, 642(8066), 133–142.
Experimental Design
- Stimuli: 4 categories (faces, objects, letters, false fonts) $\times$ 20 identities $\times$ 2 orientations
- Duration: 500 / 1000 / 1500 ms
- Task relevance: target / non-target / task-irrelevant
- Key analysis — on task-irrelevant stimuli (approximation of a no-report paradigm)
- Three prediction domains: Content (where?), Duration (how long?), Connectivity (which connections?)
Retrodiction Table
| # | COGITATE result | IIT | GNW | BMC |
|---|---|---|---|---|
| R1 | Content decodes from posterior cortex, not from PFC | $\checkmark$ | $\times$ | $\checkmark$ |
| R2 | Orientation/identity not decoded from PFC; category — weakly | $\checkmark$ | $\times$ | $\checkmark$ |
| R3 | Sustained activation in posterior cortex tracking duration | $\checkmark$ | — | $\checkmark$ |
| R4 | Orientation fades ~500 ms, category sustained; ~15% of electrodes sustained | ~$\frac{1}{2}$ | — | $\checkmark$ |
| R5 | No offset ignition | — | $\times$ | $\checkmark$ |
| R6 | No sustained synchronization within posterior cortex | $\times$ | — | $\checkmark$ |
| R7 | Onset ignition confirmed (200–800 ms in PFC) | — | $\checkmark$ | $\checkmark$ |
| R8 | Weak content-specific fronto-posterior connectivity | ~$\frac{1}{2}$ | $\times$ | $\checkmark$ |
| R9 | Adding PFC to decoders reduces accuracy | — | $\times$ | $\checkmark$ |
Score (across 9 results): BMC = 9/9, IIT = 4/9, GNW = 1/9.
Granularity caveat. The 9 results are not independent: R1, R2, R8, R9 are facets of a single question (content localization); R3, R4 are facets of another (temporal dynamics). The COGITATE protocol defined 3 prediction domains: Content, Duration, Connectivity. By 3 domains: BMC = 3/3, IIT $\approx$ 1.5/3 (Content $\checkmark$, Duration ~$\frac{1}{2}$, Connectivity $\times$), GNW $\approx$ 0.5/3 (Content $\times$, Duration ~$\frac{1}{2}$ — onset $\checkmark$ but offset $\times$, Connectivity ~$\frac{1}{2}$). Both scales (9-result and 3-domain) demonstrate BMC’s advantage, but the 3-domain scale more precisely reflects the study’s structure.
Formal Retrodiction Mechanisms
R1, R2 (content in posterior cortex, not in PFC). In BMC, memes = cell assemblies are distributed across associative cortex (M-layer). PFC contains the SMC (Self-Model Cluster), I-layer (immune filtration), and WM-pointers — pointers to memes, not a content mirror (see Part XIII, WM as pointer system). The pointer transmits a hub-label (minimal routing tag: “face,” “object”) but does not replicate the feature-level structure (orientation, identity). Formally:
$$WM_{pointer}(m_i) = \langle id_i, \text{hub-label}(m_i), t_{alloc} \rangle, \quad \text{hub-label} = \arg\max_{c \in \text{clusters}} \; P(m_i \in c)$$Hence: category is decodable from PFC (hub-label is present), orientation is not (feature-level information remains in the posterior cell assembly). The concept of WM-pointer as a content-independent pointer was established in Part XII; the specific formulation of hub-label as a routing tag was derived during COGITATE analysis and constitutes a retrodiction, not an a priori prediction.
R3 (sustained activation). Spreading activation Part IV maintains active memes through recurrent connections:
$$a_i(t+1) = \sigma\left((1 - \lambda) \cdot a_i(t) + \sum_j w_{ij} \cdot a_j(t) + I_{ext}(t)\right)$$With nonzero $I_{ext}$ (external stimulus), the meme remains above $\theta_{act}$ throughout the stimulus duration. Recurrent connections ($w_{ij} > 0$ within the cluster) ensure stability: even during temporary weakening of $I_{ext}$, mutual excitation maintains the coalition.
R4 (hub-centrality gradient of decay). A consequence of the BMC formalism, articulated as a retrodiction. After stimulus removal ($I_{ext} = 0$), the decay rate of a meme is determined by its network position — a direct consequence of the spreading activation formula Part IV:
$$\tau_{decay}(m_i) \propto C_E(m_i) \cdot \left(1 + \sum_{j \in \mathcal{N}(i)} w_{ij} \cdot a_j\right)$$A hub-meme (the category “face”: high $C_E$, many recurrent connections) receives support from neighbors $\to$ slow decay $\to$ sustained pattern. A peripheral meme (the orientation “left profile”: low $C_E$, few recurrent connections) loses support $\to$ fast decay $\to$ fading at ~500 ms. The proportion of sustained electrodes (~15%) corresponds to the hub subset of posterior cortex.
Limitation: The mapping “category = hub, orientation = periphery” is an interpretive assumption based on the hierarchical organization of visual cortex (Felleman & Van Essen, 1991) but is not formally derived within BMC. The formula $\tau_{decay}$ is a correct consequence of spreading activation + hub structure, but the specific mapping of feature $\to$ network position requires independent confirmation. Any theory with a hierarchical hub structure could derive an analogous consequence.
R5 (no offset ignition). Upon stimulus removal $I_{ext} \to 0$, meme activation decays through the leak term:
$$a_i(t+1) = \sigma\left((1 - \lambda) \cdot a_i(t) + \sum_j w_{ij} \cdot a_j(t)\right)$$Deactivation is exponential decay, not a discrete signal. There is no mechanism for an ignition-like event at offset. GNW predicted offset ignition as an update signal for new content in the global workspace.
Response to the GNW team’s argument (Naccache et al., 2025, Neuroscience of Consciousness): The GNW team claims that offset ignition is predicted only for conscious perception of the offset. This ad hoc argument introduces a content selection mechanism for broadcasting — precisely what GNW does not specify. BMC predicts the absence of offset ignition in the COGITATE paradigm (passive perception without offset monitoring), since decay is a property of the spreading activation formula. During active offset monitoring, a PE-driven response (prediction error upon expectation mismatch) is possible, but this is not “ignition” in the GNW sense; rather, it is standard PE activation via SIT.
R6 (no sustained sync in posterior cortex). BMC formalizes binding through theta-phase synchronization Part XIV:
$$Binding_{temp}(m_i, m_j) = \mathbb{1}\left[|\phi_\theta(m_i) - \phi_\theta(m_j)| < \delta\right], \quad \delta \approx 30 \text{ ms}$$This is a transient, packet-based process: memes are bound within a theta window (~125 ms), not through tonic gamma synchronization. IIT predicted sustained gamma sync as an integration mechanism ($\Phi$) — COGITATE refuted this. BMC correctly predicts the type of synchronization: phasic theta, not tonic gamma.
Response to the IIT team’s argument (Tononi, Koch, Boly, Nature Suppl.): IIT explains the failure by sparse iEEG electrode coverage. However, MEG data (n=100, full coverage) also found no sustained sync. BMC predicts that sustained gamma sync should not exist as a binding mechanism — this is not a coverage issue.
R7 (onset ignition). In BMC, this is a standard spreading activation cascade: a stimulus activates a cell assembly $\to$ if $a_{hub} > \theta_{act}$ $\to$ cascade activation through memplex hubs $\to$ the wave reaches PFC (WM-pointer allocation). The transient onset activation in PFC (200–800 ms) = the time for WM-pointer allocation + I-filtration. No special GNW “ignition” mechanism is required.
R8, R9 (PFC: weak connectivity + reduced accuracy when added). PFC encodes information orthogonal to content: SMC signals (self-referential processing), I-filtration state (accept/reject), WM-pointer metadata (which meme is currently in focus, when allocated). Adding these signals to a content decoder = adding noise orthogonal to the feature space. Fronto-posterior connectivity exists (pointer $\leftrightarrow$ cell assembly) but is weak and category-level: a routing tag is transmitted, not the full feature vector.
Conclusion
BMC retrodicts 9/9 COGITATE results (3/3 across protocol domains). Correct predictions by IIT and GNW are inherited by BMC through subsumption (Part XVII), and their failures are resolved by the full BMC = (G, M, I, S) model. Three aspects constitute unique BMC retrodictions: (1) pointer vs content in PFC (R2, R8, R9); (2) theta-phasic binding instead of sustained gamma sync (R6); (3) hub-centrality decay gradient (R4 — with the caveat regarding the feature $\to$ position mapping).
Methodological caveat. All 9 retrodictions are post hoc analyses: BMC was not pre-registered in the COGITATE protocol. The value of retrodiction is lower than the value of prediction, since a flexible formalism with many degrees of freedom can accommodate diverse results. The critical test for BMC is prospective prediction: predictions formulated before the experiment (see Block 3: 56 predictions, 24 testable without a laboratory).
Sources: COGITATE Consortium (2025). Nature, 642(8066), 133–142; Naccache L. et al. (2025). Neuroscience of Consciousness, niaf037; Ferrante O. et al. (2023). PLOS ONE, 18(2): e0268577; Casali A. et al. (2013). Science Translational Medicine; Collins A.M. & Loftus E.F. (1975). “A spreading-activation theory of semantic processing”. Psychological Review.
8-Course Cross-Analysis Updates
Spreading Activation ≡ Loopy Belief Propagation (MED-HIGH)
BMC spreading activation is formally equivalent to loopy belief propagation (BP) on a weighted factor graph. The equivalence is exact on tree-structured subgraphs and approximate on graphs with cycles. Convergence speed is inversely proportional to cycle density. The small-world regime $\sigma_{SW} \approx 1$ corresponds to the optimal BP convergence zone — enough long-range shortcuts for message propagation, not enough cycles for oscillation.
PPR (Personalized PageRank) teleportation prevents signal loss in large graphs:
$$a_i(t+1) = \beta \sigma\left(\sum_j w_{ij} a_j / d_j\right) + (1-\beta) G_{rel}(i)$$Sources: MacKay (information theory, belief propagation), Leskovec (graph algorithms).
Learning Rules Overhaul: STDP + TD Error (HIGH)
Replaces the generic Hebbian rule with a formal three-tier hierarchy:
- STDP: $\Delta w = A_+ e^{-\Delta t/\tau_+} - A_- e^{-|\Delta t|/\tau_-}$ — introduces directional causality into edge formation
- TD error as learning signal: $\delta_{BMC} = \sum_g w_g a_g(t+1) + \gamma V(S_{t+1}) - V(S_t)$
- Critic (2-factor): $\Delta w_{gi} = \alpha_c \delta \psi_i$ (ventral striatum analogue)
- Actor (3-factor): $\Delta w_{ij} = \alpha_a \delta \psi_i (a_j - \bar{a}_j)$ (dorsal striatum analogue)
The TD error $\delta_{BMC}$ determines whether an edge is strengthened or weakened based on whether co-activation led to a positive or negative outcome. This replaces “fire together, wire together” with “fire together, wire together if it was useful.”
Sources: Gerstner (neuronal dynamics), Sutton & Barto (reinforcement learning).
Value Function over BMC States (HIGH)
$$V_{BMC}(M_t) = E\left[\sum_{k=0}^{\infty} \gamma^k G_{reward}(t+k+1) \;\middle|\; M_t\right]$$Learned via temporal-difference updates. $V$ is encoded in the meme network itself: high-$V$ memes = “promising” configurations that are preferentially activated via SEEKING. This was a missing mechanism in v1 — the theory described what agents learn, but not how they evaluate the expected future value of their current state.
Source: Sutton & Barto (reinforcement learning).
MI Edge Interpretation (HIGH)
$$|w_{ij}| \propto I(a_i; a_j) = H(a_i) + H(a_j) - H(a_i, a_j)$$Edge weight = mutual information in bits. This gives a principled pruning criterion: remove edge if $I < \theta_{prune}$ bits (the two memes are informationally independent). Hub identity becomes quantifiable: hub = node maximizing $\sum_j I(m_i; m_j)$.
Source: MacKay, Stone (information theory).
Oversmoothing = Rigidity Metric (HIGH)
$$Rigidity(t) = 1 - \frac{Var(\{a_i(t)\})}{Var(\{a_i(0)\})}$$Excessive spreading activation → all activation values converge → loss of differentiation between memes. This is the direct analog of GNN oversmoothing (Oono & Suzuki, 2020). In cognitive terms: aging ($Q$ ↑, $Rigidity$ ↑), groupthink (all agents converge to same activation pattern), rigid personality (high modularity prevents new integration).
Source: Leskovec (graph neural networks).
RGCN Heterogeneous Edges (HIGH)
$$a_i(t+1) = \sigma\left(\sum_r \sum_j W_r a_j / c_{i,r} + B a_i\right)$$Different edge types require different weight matrices: semantic ($W_{sem}$), causal ($W_{caus}$), temporal ($W_{temp}$), inhibitory ($W_{inh}$), G-connected ($W_G$). This replaces the single-weight-matrix assumption and captures the empirical fact that “A causes B” and “A resembles B” are fundamentally different relationships requiring different propagation rules.
Source: Leskovec (relational graph neural networks).
WTA Dynamics (MED-HIGH)
$$\tau \frac{dh_k}{dt} = -h_k + w_0 g(h_k) - \alpha \sum_{j \neq k} g(h_j) + R I_k$$Replaces the static top-$k$ selection with a dynamic winner-take-all circuit. Self-excitation + mutual inhibition → one winner emerges from competition. This provides the mechanistic basis for attention as a dynamic process rather than a static filter.
Source: Gerstner (neuronal dynamics).
New Network Metrics
| Metric | Formula | Meaning |
|---|---|---|
| Lyapunov $\lambda_{BMC}$ | $\lim \frac{1}{\Delta t} \ln \frac{\|\delta a(t+\Delta t)\|}{\|\delta a(t)\|}$ | Chaos measure: $\lambda > 0$ chaotic, $\approx 0$ edge of chaos |
| Effective complexity $C_\mu$ | $-\sum_\mathcal{S} P(\mathcal{S}) \log P(\mathcal{S})$ | Causal state entropy; $CL \propto C_\mu$ at $\sigma \approx 1$ |
| Effective connectivity $K_{eff}$ | $\sum k_i | f’(a_i) |
| E/I balance | $I_{total} = J_E A_E + J_I A_I + I^{ext}$ | Balanced → $\sigma \approx 1$; imbalance → epilepsy/coma |
CL as Graph Component Property (MED)
$$CL_{system} = \max_k CL_{component}(C_k)$$Split-brain patients demonstrate that consciousness level is a per-connected-component property, not a whole-brain property. Each connected component maintains independent CL. DID (Dissociative Identity Disorder) = quasi-independent sub-CLs at high modularity $Q$. This decomposition resolves the “unity of consciousness” debate: unity is a network property, not a metaphysical given.
Source: Gazzaniga (split-brain studies).
ERP Predictions (MED)
Electroencephalographic event-related potentials map to specific BMC mechanisms:
| ERP Component | BMC Mechanism | Prediction |
|---|---|---|
| N400 | $PE_{semantic}$ | Amplitude ∝ semantic prediction error |
| P600 | $SIT_{syntactic}$ | Amplitude ∝ syntactic gap tension |
| ERN | $Conflict(t)$ | Amplitude ∝ response conflict |
Source: Gazzaniga (cognitive neuroscience).
Dual-Route Sub-θ Processing (MED-HIGH)
$$influence^{total} = influence^{conscious} + influence^{subliminal}$$Route 2 (subliminal): $\gamma_{sub} \cdot a_i \cdot f_{route2}(m_i)$ where $f_{route2}$ = 1.0 for G-connected memes, 0.3 for semantic, 0 for abstract. Formalizes subliminal influence, implicit bias, and blindsight.
Source: Gazzaniga (cognitive neuroscience).
Overlapping Communities / BigCLAM (MED)
$$Subpersonality(c) = \{m : F_{m,c} > \theta\}$$Memes belong to multiple subpersonalities — overlap is normal, not pathological. This captures the empirical fact that people behave differently in different social contexts (professional, family, peer) without requiring separate “selves.”
Source: Leskovec (overlapping community detection).
Options ≡ Automatization (MED)
$$\omega_k = (\pi_{\omega_k}, \beta_{\omega_k})$$An option = a policy + termination condition. BMC chunk (automatized routine) = option. The learning trajectory Raw → Chunked → Automatized = learning increasingly abstract options. Bellman equation with options converges faster via temporal abstraction — formalizing why habits free up cognitive resources.
Source: Sutton & Barto (options framework).
Conclusion: From Description to Prediction
Network formalization gives meme theory what it lacked:
| Before | After |
|---|---|
| “The meme is strong” | $C_{eigenvector}(i) = 0.73$ |
| “Clusters of memes” | Modularity $Q = 0.42$ |
| “A meme is activated” | $a_i(t) > \theta$ |
| “The memplex defends itself” | $S(X) < \theta \Rightarrow$ rejection |
| “Stress causes regression” | $a_{k-core} \uparrow$ under stress |
| “Different levels of consciousness” | $CL = \sigma_{SW} \cdot A_{SMC} \cdot f(Balance)$ |
| “Experience is perceived as unified” | $Binding = struct \cdot temp \cdot comp$ |
| “Suddenly changed their mind” | Sign inversion: $\sum w_{pos} > |
| “The ideology split” | $Q > Q_{crit} \to$ splitting |
| “Similar ideas surface together” | $a_j^{diff} = \lambda_{diff} \sum a_i \cdot \text{prox}(i,j)$ |
| “A new idea appeared from nowhere” | $\rho(\mathbf{x}^*, t) > \theta_{crystal} \to$ crystallization |
| “Where does the first meme come from” | $PE > \theta_{PE} \wedge G_{rel} > \theta_G \to$ memogenesis |
| “A meme was transmitted from one person to another” | $s_A(t) = \text{encode}(m_{top}, \text{Mod}_A)$; $F_{trans} < 1$ |
| “Consciousness requires limitations” | $C_{max} \to \infty \Rightarrow SIT \to 0 \Rightarrow SMC^{(2)} = 0$ |
| “Feral children do not develop” | $ |
Key result: Now it is possible to:
- Measure — network metrics are computable
- Compare — different memplexes have different parameters
- Predict — if $Q > 0.4$, we expect rigidity
- Verify — predictions are falsifiable
This is the transition from philosophy to science.
Appendix: Key Formulas
Node Metrics
| Metric | Formula |
|---|---|
| Degree centrality | $C_D(i) = \frac{k_i}{n-1}$ |
| Betweenness centrality | $C_B(v) = \sum_{s \neq v \neq t} \frac{\sigma_{st}(v)}{\sigma_{st}}$ |
| Eigenvector centrality | $\lambda x_i = \sum_j A_{ij}x_j$ |
| Closeness centrality | $C_C(i) = \frac{n-1}{\sum_j d(i,j)}$ |
Network Metrics
| Metric | Formula |
|---|---|
| Modularity | $Q = \frac{1}{2m}\sum_{ij}\left[A_{ij} - \frac{k_ik_j}{2m}\right]\delta(c_i, c_j)$ |
| Average path length | $\ell = \frac{1}{n(n-1)}\sum_{i \neq j} d(i,j)$ |
| Clustering coefficient | $C = \frac{1}{n}\sum_i \frac{2e_i}{k_i(k_i-1)}$ |
Dynamics
| Process | Formula |
|---|---|
| Spreading activation | $a_i(t+1) = f\left((1-\lambda) \cdot a_i(t) + \sum_j w_{ij} a_j(t)\right)$ |
| Lateral inhibition | $a_i(t+1) = \max\left(0, a_i(t) - \alpha\sum_{j \neq i} a_j(t)\right)$ |
| Softmax normalization (alt.) | $a_i(t+1) = \frac{e^{a_i(t)/T}}{\sum_j e^{a_j(t)/T}}$ |
| Epidemic threshold | $R_0 = \frac{\beta}{\gamma}\langle k \rangle > 1$ |
| Threshold for finite networks | $\lambda_c \sim N^{-(3-\gamma)/(\gamma-1)}$ |
| Compatibility | $S(X) = \sum_i C(i) \cdot compat(X, i)$ |
Heterogeneous Networks with Hubs
| Metric | Formula |
|---|---|
| Degree distribution (power law) | $P(k) \sim k^{-\gamma}$ (idealization; in practice: log-normal, truncated power law, etc.) |
| Preferential attachment | $\Pi(k_i) = k_i / \sum_j k_j$ |
| Epidemic threshold | $\lambda_c = \langle k \rangle / \langle k^2 \rangle$ |
| Hub dominance | $H = k_{max} / \langle k \rangle$ |
| Connection loss under competition | $\Delta k_i = -\beta \cdot (k_j - k_i) / \sum_m k_m$ |
| Percolation threshold (targeted attack) | $f_c = 1 - 1/(\kappa - 1)$, where $\kappa = \langle k^2 \rangle / \langle k \rangle$ |
Small-World Networks
| Metric | Formula |
|---|---|
| Small-worldness | $\sigma = (C/C_{random}) / (L/L_{random})$ |
| Small-world condition | $\sigma > 1$ |
Signed Edges and Structural Balance
| Metric | Formula |
|---|---|
| Signed weights | $w_{ij} \in [-1, +1]$ |
| Asymmetric decay | $\lambda_{neg} < \lambda_{pos}$ |
| BLEND (recombination) | $m_{new} = blend(m_i, m_j)$: $E_{new} = \{(m_{new}, n) : n \in N(m_i) \cup N(m_j)\}$ |
| Structural balance | All cycles contain an even number of $E^-$ |
| Ambivalence | $Ambivalence(m) = \sqrt{\frac{1}{\|N(m)\|}\sum_{j \in N(m)} (w_{mj} - \bar{w}_m)^2}$ |
| Incompatibility | $incompat(i,j) = \max(0, -w_{ij})$ |
Assortativity and Motifs
| Metric | Formula |
|---|---|
| Assortativity coefficient | $r = \frac{\sum_{jk} jk(e_{jk} - q_j q_k)}{\sigma_q^2}$ |
| Complex contagion threshold | $P(\text{adoption}) = 1$ if $\frac{ |
Multilayer Networks
| Metric | Formula |
|---|---|
| Adjacency tensor | $\mathcal{A}_{ij}^{[\alpha\beta]}$ |
| Multilayer spreading | $a_i^{[\alpha]}(t+1) = f\left(\sum_j \mathcal{A}_{ij}^{[\alpha\alpha]} a_j^{[\alpha]} + \sum_{\beta \neq \alpha} \mathcal{A}_{ii}^{[\alpha\beta]} a_i^{[\beta]}\right)$ |
Consciousness Level (CL)
| Metric | Formula |
|---|---|
| Consciousness Level | $CL(t) = \sigma_{SW}(t) \cdot A_{SMC}(t) \cdot f(Balance(t))$ |
| SMC activity | $A_{SMC} = \frac{1}{ |
| Balance modulator | $f(Balance) = \exp\left(-\frac{(Balance - 1.5)^2}{2 \cdot 0.49}\right)$ |
Triple Binding
| Metric | Formula |
|---|---|
| Structural binding | $Binding_{struct}(m_i, m_j) = |
| Temporal binding | $Binding_{temp}(m_i, m_j) = \mathbb{1}[ |
| Competitive binding | $Binding_{comp}(Coalition) = \mathbb{1}[\max_{i,j} D(m_i, m_j) < \theta_D]$ |
| Full binding | $Binding = Binding_{struct} \cdot Binding_{temp} \cdot Binding_{comp}$ |
Sign Inversion and Q Dynamics
| Metric | Formula |
|---|---|
| Sign inversion | Trigger: $\sum_{k \in N(m)} w_{mk} \cdot w_{kX} \cdot a_k > |
| Q dynamics (splitting) | $dQ/dt = \alpha \cdot D(P, Reality) - \beta \cdot \sum_{cross} w_{ij}$ |
| Q dynamics (merging) | $dQ_{AB}/dt = -\gamma \cdot [D(A,R) + D(B,R) - D(A \cup B, R)]$ |
Diffusion Engine and Embedding Space
| Metric | Formula |
|---|---|
| Proximity | $\text{prox}(i,j) = \exp(-\|\mathbf{e}_i - \mathbf{e}_j\|^2 / 2\sigma_{emb}^2)$ |
| Diffusion activation | $a_j^{diff}(t) = \lambda_{diff}(t) \cdot \sum_{i:\,(i,j) \notin E,\; \text{top-}K} a_i(t) \cdot \text{prox}(i,j)$ |
| Combined activation | $a_i(t+1) = \text{clip}[a_i^{graph}(t+1) + a_i^{diff}(t) - \lambda_{decay} \cdot a_i(t),\; 0,\; 1]$ |
| Density field | $\rho(\mathbf{x}, t) = \sum_i a_i(t) \cdot K(\|\mathbf{x} - \mathbf{e}_i\|)$ |
| Crystallization | $\rho(\mathbf{x}^*, t) > \theta_{crystal}$ AND $\nexists m_i : \|\mathbf{e}_i - \mathbf{x}^*\| < r_{min}$ |
| Embedding drift | $\mathbf{e}_i(t) = \mathbf{e}_i(0) + \Delta\mathbf{e}_i(t)$, $\|\Delta\mathbf{e}_i\| \leq r_{drift}$ |
Memogenesis
| Metric | Formula |
|---|---|
| Feature extraction | $\mathbf{f}(t) = \text{extract}(S_{raw}(t))$ |
| Prediction Error | $PE(t) = \|\mathbf{f}(t) - \hat{\mathbf{f}}(t)\|$ |
| G-relevance | $G_{rel}(t) = \max_{g \in G_{active}} a_g(t) \cdot \text{rel}(g, \mathbf{f}(t))$ |
| Gate (Path 1) | $PE > \theta_{PE} \;\wedge\; G_{rel} > \theta_G$ |
| Contextual binding | $\forall m_i \in WM(t): w_{i,new} = \alpha_{bind} \cdot a_i(t)$ |
Inter-Agent Exchange, Scarcity, and Critical Periods
| Metric | Formula |
|---|---|
| Transmission signal | $s_A(t) = \text{encode}(m_{top}, \text{Mod}_A(t))$ |
| Transmission fidelity | $F_{trans}(A \to B) = \cos(\mathbf{e}_{m_A}, \mathbf{e}_{m_B^{new}}) < 1$ |
| Environmental energy | $\text{energy}(t+1) = \text{energy}(t) - c_{step} + \text{intake}(t)$ |
| Computational budget | $C_{step} = C_{graph} + C_{mod} + C_{diff} \leq C_{max}$ |
| WM from budget | $k_{active} = f(C_{max}, c_{per\_node})$ |
| Scarcity $\to$ consciousness | $C_{max} \to \infty \Rightarrow SIT \to 0 \Rightarrow SMC^{(2)} = 0$ |
| Plasticity | $\lambda_{plast}(t) = \lambda_{plast}^{max} \cdot \exp(-(t - t_{peak})^2 / 2\tau_{plast}^2) + \lambda_{plast}^{base}$ |
| S-bandwidth | $S_{bw}(t) = S_{bw}^{max} \cdot \sigma((t - t_{mid}^S) / \tau_S)$ |
Key Works
Foundational Works in Network Science
| Concept | Authors | Journal | Year |
|---|---|---|---|
| Scale-free networks | Barabasi A.-L., Albert R. | Science | 1999 |
| Small-world networks | Watts D.J., Strogatz S.H. | Nature | 1998 |
| Modularity | Newman M.E.J. | PNAS | 2006 |
| Propagation in networks | Pastor-Satorras R., Vespignani A. | Physical Review Letters | 2001 |
| Multilayer networks | Kivela M. et al. | J. Complex Networks | 2014 |
| Spreading activation | Collins A.M., Loftus E.F. | Psychological Review | 1975 |
| Adaptive networks | Gross T., Blasius B. | J. Royal Society Interface | 2008 |
Methodology
| Concept | Authors | Journal | Year |
|---|---|---|---|
| Power-law fitting | Clauset A., Shalizi C.R., Newman M.E.J. | SIAM Review | 2009 |
| Scale-free critique | Broido A.D., Clauset A. | Nature Communications | 2019 |
| Zipf-polylog | Lee S. et al. | Statistics | 2024 |
Contemporary Reviews (2024–2026)
| Topic | Authors/Source | Journal | Year |
|---|---|---|---|
| Percolation in complex networks | — | Nature Reviews Physics | 2024 |
| Small-world in the brain | Han J. et al. | Brain-X | 2025 |
| Cognitive multiplex networks | — | Behavior Research Methods | 2025 |
| Rank-based assortativity | — | J. Complex Networks | 2025 |
| Localized spreading | Tey R., Cozzo E. | — | 2024 |
Signed Networks, Meme Synthesis, Negativity Bias
| Concept | Authors | Journal | Year |
|---|---|---|---|
| Structural balance (P-O-X) | Heider F. | The Journal of Psychology | 1946 |
| Signed graphs, Structure Theorem | Cartwright D., Harary F. | Psychological Review | 1956 |
| Weak balance | Davis J.A. | Human Relations | 1967 |
| CAN model (attitudes as signed networks) | Dalege J., Borsboom D. et al. | Psychological Methods | 2016 |
| Belief system as a signed network | Brandt M.J., Sleegers W.W.A. | J. Experimental Social Psychology | 2021 |
| Negative priming | Tipper S.P. | Quarterly J. Experimental Psychology | 1985 |
| Retrieval-induced forgetting | Anderson M.C., Bjork R.A., Bjork E.L. | J. Experimental Psychology: Learning, Memory, and Cognition | 1994 |
| Negativity bias | Baumeister R.F. et al. | Review of General Psychology | 2001 |
| Sleeper effect | Hovland C.I. et al. | Princeton University Press | 1949 |
| Sleeper effect (meta-analysis) | Kumkale G.T., Albarracin D. | Psychological Bulletin | 2004 |
| Cognitive dissonance | Festinger L. | Stanford University Press | 1957 |
| Conceptual blending | Fauconnier G., Turner M. | Basic Books | 2002 |
Consciousness, Binding, Levels of Consciousness
| Concept | Authors | Journal | Year |
|---|---|---|---|
| PCI (Perturbational Complexity Index) | Massimini M. et al. | Science | 2005 |
| Entropic brain hypothesis | Carhart-Harris R.L. et al. | Frontiers in Human Neuroscience | 2014 |
| Neuronal communication through coherence | Fries P. | Trends in Cognitive Sciences | 2005 |
| True neural stance on consciousness | Lamme V.A.F. | Trends in Cognitive Sciences | 2006 |
| Theta-gamma neural code | Lisman J., Jensen O. | Neuron | 2013 |
| COGITATE (testing theories of consciousness) | COGITATE Consortium | Nature | 2025 |
Diffusion, Embedding, Crystallization
| Concept | Authors | Journal | Year |
|---|---|---|---|
| Semantic priming | Meyer D.E., Schvaneveldt R.W. | J. Experimental Psychology | 1971 |
| Memory reconsolidation | Nader K., Schafe G.E., LeDoux J.E. | Nature | 2000 |
| Volume transmission | Agnati L.F. et al. | Progress in Neurobiology | 2010 |
| Creativity and psychedelics | Mason N.L. et al. | Translational Psychiatry | 2021 |
| Emotions and episodic encoding | Dolcos F., LaBar K.S., Cabeza R. | Neuron | 2004 |
Inter-Agent Exchange, Scarcity, Critical Periods
| Concept | Authors | Journal | Year |
|---|---|---|---|
| Meme as a unit of cultural replication | Dawkins R. | The Selfish Gene (book) | 1976 |
| Critical periods of neuroplasticity | Hensch T.K. | Nature Reviews Neuroscience | 2005 |
| Biological foundations of language | Lenneberg E.H. | Biological Foundations of Language (book) | 1967 |
Recommended Textbooks
- Newman M.E.J. (2018). Networks: An Introduction. 2nd ed. Oxford University Press.
- Barabasi A.-L. (2016). Network Science. Cambridge University Press. [Available online: networksciencebook.com]
- Menczer F., Fortunato S., Davis C.A. (2020). A First Course in Network Science. Cambridge University Press.