Drug Discovery

In one sentence: The same engine that models consciousness discovers drug candidates — molecular fragments become “memes,” cultural evolution replaces random search, and each generation builds on the accumulated chemical knowledge of the previous one.

Theory sources: BM (meme dynamics), EMT (cultural accumulation), SM (cumulative cultural evolution, ratchet effect), AGI_F (multi-agent stigmergy)

Implementation: bmc-engine/bmc-discovery/ (Rust core + Python chemistry bridge + RDKit)

The Core Idea: One Engine, Any Domain

BMC claims its architecture is domain-agnostic: change the environment, and the same engine solves a different problem. Drug discovery is the strongest test of this claim.

BMC component	In a mind	In drug discovery
Drives	Curiosity, care, fear	Explore new molecules, exploit good scaffolds
Memes	Beliefs, skills, memories	Molecular fragments
Belief network	Worldview, personality	Assembled molecule
Filter	“Does this make sense?”	“Is this molecule valid and useful?”
Hardware	Brain	Chemistry toolkit (RDKit)
Cultural memory	Shared knowledge repository	Fragment library (what worked before)

No domain-specific architecture changes. The same code. Just a different environment and scoring function.

How It Works

The Agent Loop

Each BMC agent works like a molecular chemist with access to a shared library:

graph LR S["Explore
Try new
fragment combos"] --> MUT["Mutate
Swap atoms,
change rings"] MUT --> ASS["Assemble
Build a
molecule"] ASS --> EVAL["Test
Score against
target"] EVAL --> I{"Passes
filter?"} I -->|"Yes"| LIB["Share
Donate to
shared library"] I -->|"No"| S LIB --> CARE["Consolidate
Reinforce
good fragments"] CARE --> S style S fill:#2a2a1e,stroke:#ffd700,color:#ffd700 style MUT fill:#1a1a2e,stroke:#6af,color:#6af style ASS fill:#0d2a1a,stroke:#34d399,color:#34d399 style EVAL fill:#2a1a0d,stroke:#f80,color:#f80 style I fill:#2a0d0d,stroke:#f66,color:#f66 style LIB fill:#1a1a2e,stroke:#6af,color:#6af style CARE fill:#0d2a1a,stroke:#34d399,color:#34d399

Four Ways to Modify Molecules

Strategy	What it does	BMC analog
Atom swap	Replace one atom in a fragment	Point mutation
Ring replacement	Substitute the ring system	Structural mutation
Group addition	Attach a functional group	Meme extension
Crossover	Combine parts of two molecules	BLEND recombination

The Ratchet: Each Generation Builds on the Last

The critical innovation: agents don’t start from scratch each cycle. A shared knowledge store accumulates everything that worked:

graph TD C1["Cycle 1
Random exploration
Find first good fragments"] --> KS["Knowledge Store
6,000–7,000 fragments
Co-success bindings
Top fragment library"] C1 --> KS KS --> C5["Cycle 5
Build on known scaffolds
Targeted improvement"] C5 --> KS KS --> C10["Cycle 10
Near-optimal molecules
Refined combinations"] style C1 fill:#2a0d0d,stroke:#f66,color:#f66 style KS fill:#2a2a1e,stroke:#ffd700,color:#ffd700 style C5 fill:#2a1a0d,stroke:#f80,color:#f80 style C10 fill:#0d2a1a,stroke:#34d399,color:#34d399

Fragment registry: Tracks which fragments have been tried and how they performed
Co-success binding: Fragments that appear together in high-scoring molecules get linked (like memes that reinforce each other)
Cultural library: Top-performing fragments are shared across all agents
Domain isolation: Different target types get separate libraries (kinase fragments for kinase targets, GPCR fragments for GPCR targets)

This is the Super-Ratchet Effect — each generation builds on accumulated knowledge, and no information is lost between cycles.

Results

Tested on 4 standard pharmaceutical benchmarks (Therapeutics Data Commons):

Target	What it is	Best score	Progress from start
QED	Drug-likeness (how “drug-like” is the molecule?)	0.947	99.6% of theoretical maximum
GSK3-beta	Alzheimer’s disease target (kinase)	0.960	+31.5% from baseline
JNK3	Neurodegeneration target (kinase)	0.740	+8.8% from baseline
DRD2	Dopamine receptor (Parkinson’s, schizophrenia)	0.997	+384% from baseline

The Ratchet in Action

Watch how scores improve as the cultural library grows:

Cycle	QED	GSK3-beta	DRD2
1 (fresh start)	0.880	0.730	0.206
5 (library growing)	0.920	0.890	0.950
10 (mature library)	0.947	0.960	0.997

The improvement accelerates as the library grows. DRD2 is the most dramatic: from 0.206 (barely above random) to 0.997 (near-perfect) in 10 cycles.

What These Numbers Mean

QED 0.947: The discovered molecule scores near the theoretical maximum for drug-likeness properties (Lipinski rules, solubility, molecular weight, etc.)
DRD2 0.997: 99.7% predicted binding to the dopamine D2 receptor — the agent learned to construct molecular scaffolds with near-perfect complementarity
GSK3-beta 0.960: High predicted activity against a validated Alzheimer’s target

BMC Mechanisms at Work

Curiosity = Exploration

The SEEKING drive controls the mutation rate:

Early cycles: High mutation (explore broadly across chemical space)
Later cycles: Lower mutation (exploit known good scaffolds)
Gap detection: Missing patterns in fragment co-occurrence trigger targeted exploration

Filter = Chemical Validity

Two-level quality control:

Can this molecule physically exist? (valence rules, ring stability, synthetic accessibility)
Does it score above the threshold? (oracle score check)

Fragments that fail repeatedly get downweighted — the system learns to avoid unproductive chemical motifs.

Care = Knowledge Consolidation

High-scoring fragments get reinforced:

Fragments that co-occur in good molecules become linked
Top assemblies are decomposed and their fragments shared with everyone
The knowledge store persists between cycles — nothing is lost

BLEND = Fragment Crossover

The crossover strategy is BLEND: take sub-structures from two different high-scoring fragments and combine them into something new. The molecular version of “What if we combine feature A from drug X with feature B from drug Y?”

How It Compares

BMC Discovery is competitive with state-of-the-art molecular optimization methods (REINVENT, GraphGA, GFlowNets) on standard TDC oracle benchmarks — without any pretraining or training data.

What matters	Neural methods (REINVENT, etc.)	BMC Discovery
Training data	Pretrained on ZINC (250K+ molecules)	None
Score quality	Competitive (Top-1 near oracle max)	Competitive (Top-1 near oracle max)
Knowledge	Implicit (hidden in network weights)	Explicit (inspectable fragment library)
Transfer	Retrain from scratch per target	Direct — transfer library across targets
Explainability	Black box	Full — every fragment traceable
Cumulative learning	No (each run independent)	Yes — each cycle builds on the last

The key insight: neural methods and BMC reach similar score ranges, but BMC does it through explicit cultural accumulation (Super-Ratchet) rather than gradient-based pretraining. The knowledge is inspectable, transferable, and compounds across projects.

What Makes This Different from Existing Approaches

Approach	Cultural memory?	Knowledge inspectable?
Genetic algorithms	None (restart each run)	N/A
REINVENT (neural)	Implicit (hidden in weights)	No
Graph-based GA	None	N/A
BMC Discovery	Explicit (fragment library)	Yes (can inspect and transfer)

The critical difference: BMC makes accumulated knowledge explicit and inspectable. You can look inside the library, see which fragments are working, and understand why a molecule was designed the way it was.

The Domain-Agnostic Test

The same codebase handles all 4 targets with only configuration changes:

Setting	QED	GSK3-beta	JNK3	DRD2
Agents	50	50	50	50
Architecture	Identical	Identical	Identical	Identical
Target	Drug-likeness	Kinase	Kinase	GPCR
Fragment seeds	General	Kinase scaffolds	Kinase scaffolds	GPCR scaffolds

One engine, four targets, competitive results on all of them. This is the BMC domain-agnostic principle in action.

Testable Predictions

#	Prediction	How to test
P-DD1	Cultural library produces monotonically improving results across cycles	Ablation: with vs. without library persistence
P-DD2	Separate libraries per target type outperform shared libraries on dissimilar targets	Cross-domain ablation
P-DD3	Fragment co-success binding predicts scaffold compatibility on novel targets	Train binding on one target, test on another
P-DD4	Curiosity-modulated mutation rate outperforms fixed-rate mutation	Rate schedule ablation
P-DD5	The same architecture achieves competitive results on any pharmaceutical target	Extend to novel targets without code changes

Formalization

For readers interested in the mathematical treatment:

Super-Ratchet Effect:

K_{t+1} = K_t + innovation + \sum_i merge\_bonus(K_i), \quad loss = 0

Each cycle adds knowledge; nothing is lost between generations.

Domain-agnostic mapping:

BMC	Consciousness	Drug discovery
$$G$	Panksepp drives	SEEKING (mutate), CARE (consolidate)
$$M$	Meme graph	Fragment registry
$$I$	Coherence filter	Oracle + validity check
$$S$	Neural substrate	RDKit + TDC oracle

Full formal treatment: SM Parts IV–V, BM Part IV, AGI_F Part IV.

Interested in drug discovery with BMC?

We're looking for pharma partners and wet lab validation collaborators.

Get in Touch

Back to: Solutions Overview | Related: Financial Markets (same engine, different domain) | Theory: Swarm Memetics (cultural ratchet)