Drug Discovery
In one sentence: The same engine that models consciousness discovers drug candidates — molecular fragments become “memes,” cultural evolution replaces random search, and each generation builds on the accumulated chemical knowledge of the previous one.
Theory sources: BM (meme dynamics), EMT (cultural accumulation), SM (cumulative cultural evolution, ratchet effect), AGI_F (multi-agent stigmergy)
Implementation: bmc-engine/bmc-discovery/ (Rust core + Python chemistry bridge + RDKit)
The Core Idea: One Engine, Any Domain
BMC claims its architecture is domain-agnostic: change the environment, and the same engine solves a different problem. Drug discovery is the strongest test of this claim.
| BMC component | In a mind | In drug discovery |
|---|---|---|
| Drives | Curiosity, care, fear | Explore new molecules, exploit good scaffolds |
| Memes | Beliefs, skills, memories | Molecular fragments |
| Belief network | Worldview, personality | Assembled molecule |
| Filter | “Does this make sense?” | “Is this molecule valid and useful?” |
| Hardware | Brain | Chemistry toolkit (RDKit) |
| Cultural memory | Shared knowledge repository | Fragment library (what worked before) |
No domain-specific architecture changes. The same code. Just a different environment and scoring function.
How It Works
The Agent Loop
Each BMC agent works like a molecular chemist with access to a shared library:
Try new
fragment combos"] --> MUT["Mutate
Swap atoms,
change rings"] MUT --> ASS["Assemble
Build a
molecule"] ASS --> EVAL["Test
Score against
target"] EVAL --> I{"Passes
filter?"} I -->|"Yes"| LIB["Share
Donate to
shared library"] I -->|"No"| S LIB --> CARE["Consolidate
Reinforce
good fragments"] CARE --> S style S fill:#2a2a1e,stroke:#ffd700,color:#ffd700 style MUT fill:#1a1a2e,stroke:#6af,color:#6af style ASS fill:#0d2a1a,stroke:#34d399,color:#34d399 style EVAL fill:#2a1a0d,stroke:#f80,color:#f80 style I fill:#2a0d0d,stroke:#f66,color:#f66 style LIB fill:#1a1a2e,stroke:#6af,color:#6af style CARE fill:#0d2a1a,stroke:#34d399,color:#34d399
Four Ways to Modify Molecules
| Strategy | What it does | BMC analog |
|---|---|---|
| Atom swap | Replace one atom in a fragment | Point mutation |
| Ring replacement | Substitute the ring system | Structural mutation |
| Group addition | Attach a functional group | Meme extension |
| Crossover | Combine parts of two molecules | BLEND recombination |
The Ratchet: Each Generation Builds on the Last
The critical innovation: agents don’t start from scratch each cycle. A shared knowledge store accumulates everything that worked:
Random exploration
Find first good fragments"] --> KS["Knowledge Store
6,000–7,000 fragments
Co-success bindings
Top fragment library"] C1 --> KS KS --> C5["Cycle 5
Build on known scaffolds
Targeted improvement"] C5 --> KS KS --> C10["Cycle 10
Near-optimal molecules
Refined combinations"] style C1 fill:#2a0d0d,stroke:#f66,color:#f66 style KS fill:#2a2a1e,stroke:#ffd700,color:#ffd700 style C5 fill:#2a1a0d,stroke:#f80,color:#f80 style C10 fill:#0d2a1a,stroke:#34d399,color:#34d399
- Fragment registry: Tracks which fragments have been tried and how they performed
- Co-success binding: Fragments that appear together in high-scoring molecules get linked (like memes that reinforce each other)
- Cultural library: Top-performing fragments are shared across all agents
- Domain isolation: Different target types get separate libraries (kinase fragments for kinase targets, GPCR fragments for GPCR targets)
This is the Super-Ratchet Effect — each generation builds on accumulated knowledge, and no information is lost between cycles.
Results
Tested on 4 standard pharmaceutical benchmarks (Therapeutics Data Commons):
| Target | What it is | Best score | Progress from start |
|---|---|---|---|
| QED | Drug-likeness (how “drug-like” is the molecule?) | 0.947 | 99.6% of theoretical maximum |
| GSK3-beta | Alzheimer’s disease target (kinase) | 0.960 | +31.5% from baseline |
| JNK3 | Neurodegeneration target (kinase) | 0.740 | +8.8% from baseline |
| DRD2 | Dopamine receptor (Parkinson’s, schizophrenia) | 0.997 | +384% from baseline |
The Ratchet in Action
Watch how scores improve as the cultural library grows:
| Cycle | QED | GSK3-beta | DRD2 |
|---|---|---|---|
| 1 (fresh start) | 0.880 | 0.730 | 0.206 |
| 5 (library growing) | 0.920 | 0.890 | 0.950 |
| 10 (mature library) | 0.947 | 0.960 | 0.997 |
The improvement accelerates as the library grows. DRD2 is the most dramatic: from 0.206 (barely above random) to 0.997 (near-perfect) in 10 cycles.
What These Numbers Mean
- QED 0.947: The discovered molecule scores near the theoretical maximum for drug-likeness properties (Lipinski rules, solubility, molecular weight, etc.)
- DRD2 0.997: 99.7% predicted binding to the dopamine D2 receptor — the agent learned to construct molecular scaffolds with near-perfect complementarity
- GSK3-beta 0.960: High predicted activity against a validated Alzheimer’s target
BMC Mechanisms at Work
Curiosity = Exploration
The SEEKING drive controls the mutation rate:
- Early cycles: High mutation (explore broadly across chemical space)
- Later cycles: Lower mutation (exploit known good scaffolds)
- Gap detection: Missing patterns in fragment co-occurrence trigger targeted exploration
Filter = Chemical Validity
Two-level quality control:
- Can this molecule physically exist? (valence rules, ring stability, synthetic accessibility)
- Does it score above the threshold? (oracle score check)
Fragments that fail repeatedly get downweighted — the system learns to avoid unproductive chemical motifs.
Care = Knowledge Consolidation
High-scoring fragments get reinforced:
- Fragments that co-occur in good molecules become linked
- Top assemblies are decomposed and their fragments shared with everyone
- The knowledge store persists between cycles — nothing is lost
BLEND = Fragment Crossover
The crossover strategy is BLEND: take sub-structures from two different high-scoring fragments and combine them into something new. The molecular version of “What if we combine feature A from drug X with feature B from drug Y?”
How It Compares
BMC Discovery is competitive with state-of-the-art molecular optimization methods (REINVENT, GraphGA, GFlowNets) on standard TDC oracle benchmarks — without any pretraining or training data.
| What matters | Neural methods (REINVENT, etc.) | BMC Discovery |
|---|---|---|
| Training data | Pretrained on ZINC (250K+ molecules) | None |
| Score quality | Competitive (Top-1 near oracle max) | Competitive (Top-1 near oracle max) |
| Knowledge | Implicit (hidden in network weights) | Explicit (inspectable fragment library) |
| Transfer | Retrain from scratch per target | Direct — transfer library across targets |
| Explainability | Black box | Full — every fragment traceable |
| Cumulative learning | No (each run independent) | Yes — each cycle builds on the last |
The key insight: neural methods and BMC reach similar score ranges, but BMC does it through explicit cultural accumulation (Super-Ratchet) rather than gradient-based pretraining. The knowledge is inspectable, transferable, and compounds across projects.
What Makes This Different from Existing Approaches
| Approach | Cultural memory? | Knowledge inspectable? |
|---|---|---|
| Genetic algorithms | None (restart each run) | N/A |
| REINVENT (neural) | Implicit (hidden in weights) | No |
| Graph-based GA | None | N/A |
| BMC Discovery | Explicit (fragment library) | Yes (can inspect and transfer) |
The critical difference: BMC makes accumulated knowledge explicit and inspectable. You can look inside the library, see which fragments are working, and understand why a molecule was designed the way it was.
The Domain-Agnostic Test
The same codebase handles all 4 targets with only configuration changes:
| Setting | QED | GSK3-beta | JNK3 | DRD2 |
|---|---|---|---|---|
| Agents | 50 | 50 | 50 | 50 |
| Architecture | Identical | Identical | Identical | Identical |
| Target | Drug-likeness | Kinase | Kinase | GPCR |
| Fragment seeds | General | Kinase scaffolds | Kinase scaffolds | GPCR scaffolds |
One engine, four targets, competitive results on all of them. This is the BMC domain-agnostic principle in action.
Testable Predictions
| # | Prediction | How to test |
|---|---|---|
| P-DD1 | Cultural library produces monotonically improving results across cycles | Ablation: with vs. without library persistence |
| P-DD2 | Separate libraries per target type outperform shared libraries on dissimilar targets | Cross-domain ablation |
| P-DD3 | Fragment co-success binding predicts scaffold compatibility on novel targets | Train binding on one target, test on another |
| P-DD4 | Curiosity-modulated mutation rate outperforms fixed-rate mutation | Rate schedule ablation |
| P-DD5 | The same architecture achieves competitive results on any pharmaceutical target | Extend to novel targets without code changes |
Formalization
For readers interested in the mathematical treatment:
Super-Ratchet Effect:
$$K_{t+1} = K_t + innovation + \sum_i merge\_bonus(K_i), \quad loss = 0$$Each cycle adds knowledge; nothing is lost between generations.
Domain-agnostic mapping:
| BMC | Consciousness | Drug discovery |
|---|---|---|
| $G$ | Panksepp drives | SEEKING (mutate), CARE (consolidate) |
| $M$ | Meme graph | Fragment registry |
| $I$ | Coherence filter | Oracle + validity check |
| $S$ | Neural substrate | RDKit + TDC oracle |
Full formal treatment: SM Parts IV–V, BM Part IV, AGI_F Part IV.
Interested in drug discovery with BMC?
We're looking for pharma partners and wet lab validation collaborators.
Get in TouchBack to: Solutions Overview | Related: Financial Markets (same engine, different domain) | Theory: Swarm Memetics (cultural ratchet)