Drug Discovery

In one sentence: The same engine that models consciousness discovers drug candidates — molecular fragments become “memes,” cultural evolution replaces random search, and each generation builds on the accumulated chemical knowledge of the previous one.

Theory sources: BM (meme dynamics), EMT (cultural accumulation), SM (cumulative cultural evolution, ratchet effect), AGI_F (multi-agent stigmergy)

Implementation: bmc-engine/bmc-discovery/ (Rust core + Python chemistry bridge + RDKit)


The Core Idea: One Engine, Any Domain

BMC claims its architecture is domain-agnostic: change the environment, and the same engine solves a different problem. Drug discovery is the strongest test of this claim.

BMC componentIn a mindIn drug discovery
DrivesCuriosity, care, fearExplore new molecules, exploit good scaffolds
MemesBeliefs, skills, memoriesMolecular fragments
Belief networkWorldview, personalityAssembled molecule
Filter“Does this make sense?”“Is this molecule valid and useful?”
HardwareBrainChemistry toolkit (RDKit)
Cultural memoryShared knowledge repositoryFragment library (what worked before)

No domain-specific architecture changes. The same code. Just a different environment and scoring function.


How It Works

The Agent Loop

Each BMC agent works like a molecular chemist with access to a shared library:

graph LR S["Explore
Try new
fragment combos"] --> MUT["Mutate
Swap atoms,
change rings"] MUT --> ASS["Assemble
Build a
molecule"] ASS --> EVAL["Test
Score against
target"] EVAL --> I{"Passes
filter?"} I -->|"Yes"| LIB["Share
Donate to
shared library"] I -->|"No"| S LIB --> CARE["Consolidate
Reinforce
good fragments"] CARE --> S style S fill:#2a2a1e,stroke:#ffd700,color:#ffd700 style MUT fill:#1a1a2e,stroke:#6af,color:#6af style ASS fill:#0d2a1a,stroke:#34d399,color:#34d399 style EVAL fill:#2a1a0d,stroke:#f80,color:#f80 style I fill:#2a0d0d,stroke:#f66,color:#f66 style LIB fill:#1a1a2e,stroke:#6af,color:#6af style CARE fill:#0d2a1a,stroke:#34d399,color:#34d399

Four Ways to Modify Molecules

StrategyWhat it doesBMC analog
Atom swapReplace one atom in a fragmentPoint mutation
Ring replacementSubstitute the ring systemStructural mutation
Group additionAttach a functional groupMeme extension
CrossoverCombine parts of two moleculesBLEND recombination

The Ratchet: Each Generation Builds on the Last

The critical innovation: agents don’t start from scratch each cycle. A shared knowledge store accumulates everything that worked:

graph TD C1["Cycle 1
Random exploration
Find first good fragments"] --> KS["Knowledge Store
6,000–7,000 fragments
Co-success bindings
Top fragment library"] C1 --> KS KS --> C5["Cycle 5
Build on known scaffolds
Targeted improvement"] C5 --> KS KS --> C10["Cycle 10
Near-optimal molecules
Refined combinations"] style C1 fill:#2a0d0d,stroke:#f66,color:#f66 style KS fill:#2a2a1e,stroke:#ffd700,color:#ffd700 style C5 fill:#2a1a0d,stroke:#f80,color:#f80 style C10 fill:#0d2a1a,stroke:#34d399,color:#34d399
  • Fragment registry: Tracks which fragments have been tried and how they performed
  • Co-success binding: Fragments that appear together in high-scoring molecules get linked (like memes that reinforce each other)
  • Cultural library: Top-performing fragments are shared across all agents
  • Domain isolation: Different target types get separate libraries (kinase fragments for kinase targets, GPCR fragments for GPCR targets)

This is the Super-Ratchet Effect — each generation builds on accumulated knowledge, and no information is lost between cycles.


Results

Tested on 4 standard pharmaceutical benchmarks (Therapeutics Data Commons):

TargetWhat it isBest scoreProgress from start
QEDDrug-likeness (how “drug-like” is the molecule?)0.94799.6% of theoretical maximum
GSK3-betaAlzheimer’s disease target (kinase)0.960+31.5% from baseline
JNK3Neurodegeneration target (kinase)0.740+8.8% from baseline
DRD2Dopamine receptor (Parkinson’s, schizophrenia)0.997+384% from baseline

The Ratchet in Action

Watch how scores improve as the cultural library grows:

CycleQEDGSK3-betaDRD2
1 (fresh start)0.8800.7300.206
5 (library growing)0.9200.8900.950
10 (mature library)0.9470.9600.997

The improvement accelerates as the library grows. DRD2 is the most dramatic: from 0.206 (barely above random) to 0.997 (near-perfect) in 10 cycles.

What These Numbers Mean

  • QED 0.947: The discovered molecule scores near the theoretical maximum for drug-likeness properties (Lipinski rules, solubility, molecular weight, etc.)
  • DRD2 0.997: 99.7% predicted binding to the dopamine D2 receptor — the agent learned to construct molecular scaffolds with near-perfect complementarity
  • GSK3-beta 0.960: High predicted activity against a validated Alzheimer’s target

BMC Mechanisms at Work

Curiosity = Exploration

The SEEKING drive controls the mutation rate:

  • Early cycles: High mutation (explore broadly across chemical space)
  • Later cycles: Lower mutation (exploit known good scaffolds)
  • Gap detection: Missing patterns in fragment co-occurrence trigger targeted exploration

Filter = Chemical Validity

Two-level quality control:

  1. Can this molecule physically exist? (valence rules, ring stability, synthetic accessibility)
  2. Does it score above the threshold? (oracle score check)

Fragments that fail repeatedly get downweighted — the system learns to avoid unproductive chemical motifs.

Care = Knowledge Consolidation

High-scoring fragments get reinforced:

  • Fragments that co-occur in good molecules become linked
  • Top assemblies are decomposed and their fragments shared with everyone
  • The knowledge store persists between cycles — nothing is lost

BLEND = Fragment Crossover

The crossover strategy is BLEND: take sub-structures from two different high-scoring fragments and combine them into something new. The molecular version of “What if we combine feature A from drug X with feature B from drug Y?”


How It Compares

BMC Discovery is competitive with state-of-the-art molecular optimization methods (REINVENT, GraphGA, GFlowNets) on standard TDC oracle benchmarks — without any pretraining or training data.

What mattersNeural methods (REINVENT, etc.)BMC Discovery
Training dataPretrained on ZINC (250K+ molecules)None
Score qualityCompetitive (Top-1 near oracle max)Competitive (Top-1 near oracle max)
KnowledgeImplicit (hidden in network weights)Explicit (inspectable fragment library)
TransferRetrain from scratch per targetDirect — transfer library across targets
ExplainabilityBlack boxFull — every fragment traceable
Cumulative learningNo (each run independent)Yes — each cycle builds on the last

The key insight: neural methods and BMC reach similar score ranges, but BMC does it through explicit cultural accumulation (Super-Ratchet) rather than gradient-based pretraining. The knowledge is inspectable, transferable, and compounds across projects.


What Makes This Different from Existing Approaches

ApproachCultural memory?Knowledge inspectable?
Genetic algorithmsNone (restart each run)N/A
REINVENT (neural)Implicit (hidden in weights)No
Graph-based GANoneN/A
BMC DiscoveryExplicit (fragment library)Yes (can inspect and transfer)

The critical difference: BMC makes accumulated knowledge explicit and inspectable. You can look inside the library, see which fragments are working, and understand why a molecule was designed the way it was.


The Domain-Agnostic Test

The same codebase handles all 4 targets with only configuration changes:

SettingQEDGSK3-betaJNK3DRD2
Agents50505050
ArchitectureIdenticalIdenticalIdenticalIdentical
TargetDrug-likenessKinaseKinaseGPCR
Fragment seedsGeneralKinase scaffoldsKinase scaffoldsGPCR scaffolds

One engine, four targets, competitive results on all of them. This is the BMC domain-agnostic principle in action.


Testable Predictions

#PredictionHow to test
P-DD1Cultural library produces monotonically improving results across cyclesAblation: with vs. without library persistence
P-DD2Separate libraries per target type outperform shared libraries on dissimilar targetsCross-domain ablation
P-DD3Fragment co-success binding predicts scaffold compatibility on novel targetsTrain binding on one target, test on another
P-DD4Curiosity-modulated mutation rate outperforms fixed-rate mutationRate schedule ablation
P-DD5The same architecture achieves competitive results on any pharmaceutical targetExtend to novel targets without code changes

Formalization

For readers interested in the mathematical treatment:

Super-Ratchet Effect:

$$K_{t+1} = K_t + innovation + \sum_i merge\_bonus(K_i), \quad loss = 0$$

Each cycle adds knowledge; nothing is lost between generations.

Domain-agnostic mapping:

BMCConsciousnessDrug discovery
$G$Panksepp drivesSEEKING (mutate), CARE (consolidate)
$M$Meme graphFragment registry
$I$Coherence filterOracle + validity check
$S$Neural substrateRDKit + TDC oracle

Full formal treatment: SM Parts IV–V, BM Part IV, AGI_F Part IV.


Interested in drug discovery with BMC?

We're looking for pharma partners and wet lab validation collaborators.

Get in Touch

Back to: Solutions Overview | Related: Financial Markets (same engine, different domain) | Theory: Swarm Memetics (cultural ratchet)