The Closure

The question that launched molecular biology's most ambitious program: what is the minimal self-replicating unit? Find the molecule that copies itself, and you find the origin of life. The question assumes its answer — that self-replication is a property a single molecule can have.

In 1986, Günter von Kiedrowski synthesized the first artificial self-replicating molecule: a hexanucleotide that served as a template for its own assembly from two trinucleotide fragments. It worked. It also failed. The kinetics were parabolic, not exponential. Each copy of the template bound to its own product, forming a double-stranded complex too stable to dissociate and serve as a new template. The molecule that replicated itself also inhibited itself. Product inhibition is not a technical limitation. It is a consequence of the same complementarity that enables copying in the first place. The property that makes a molecule a template is the property that makes it a dead end.

Von Kiedrowski spent twenty years trying to achieve exponential growth from a single self-replicating system. He never succeeded.


In 2009, Tracey Lincoln and Gerald Joyce at the Scripps Research Institute demonstrated exponential self-replication — but not from a single molecule. They used two RNA enzymes, each of which catalyzed the assembly of the other from smaller fragments. Neither enzyme replicated itself. Each replicated its partner. The cross-catalytic pair grew exponentially because no molecule ever had to dissociate from its own product. The bottleneck that killed von Kiedrowski's system — template-product inhibition — was dissolved by splitting replication across two molecules that never competed with themselves.

The solution to self-replication was to stop trying to make a single molecule do it. The replicating unit was the pair, not the molecule.


Stuart Kauffman had predicted this decades earlier, from a different direction entirely. In 1986, and more fully in The Origins of Order (1993), Kauffman showed that in a sufficiently diverse collection of catalytic polymers, autocatalytic closure — a set of molecules in which every member's formation is catalyzed by some other member of the set — becomes statistically inevitable. The argument is combinatorial. As molecular diversity increases, the number of possible reactions grows faster than the number of molecules. Past a critical threshold, the probability that every molecule in some subset has its formation catalyzed by another member of the subset approaches one.

This is a phase transition. Below the threshold: isolated reactions, no self-sustaining chemistry. Above: closure is almost certain. The origin of collective self-reproduction is not an improbable event that happened once. It is a statistical inevitability that happens whenever chemistry becomes diverse enough.

Kauffman called these reflexively autocatalytic food-generated sets. The unit of self-reproduction is the entire closed network. No individual molecule in the set replicates itself. The set replicates the set.


But closure alone is not enough. Manfred Eigen showed in 1971 that replication without error correction faces a threshold: above a certain genome length, the error rate per replication exceeds the rate at which selection can maintain information. The genome dissolves into random sequences. Eigen called this the error catastrophe.

The implication cuts deep. Complex replication requires error correction. Error correction requires complex molecular machinery. Complex machinery requires a complex genome. The system needs to be complex before it can maintain complexity. This is not a paradox — it is a constraint on the order of operations. Error correction infrastructure must exist before the information it protects can grow. The proofreading must precede the message.

Biology solved this with layered correction. DNA polymerase alone has an error rate of roughly one in ten thousand. Exonuclease proofreading reduces this to one in ten million. Mismatch repair brings it to one in ten billion. Three layers, each squaring the fidelity of the previous. John Hopfield showed in 1974 that kinetic proofreading — spending energy to improve discrimination — can in principle reduce error rates without limit, at the cost of time and energy per unit copied. The cell spends energy to earn accuracy. The accuracy enables the complexity. The complexity enables the error correction. The circle is closed, but it had to be entered all at once.


Tibor Gánti saw this in 1971, from the theoretical side. His chemoton — the minimal abstract unit of life — required not one cycle but three, coupled: a metabolic cycle (energy and building blocks), a template replication cycle (information), and a membrane cycle (boundary). Remove any one and the system dies. The minimal living system was already a network of three subsystems. The individual molecule was never a candidate.

In 2016, the J. Craig Venter Institute constructed JCVI-syn3.0, the smallest self-replicating cell ever built. It has 473 genes. One hundred and forty-nine of them — nearly a third — have no known function. They are necessary for the cell to live and reproduce, but no one can say what they do or why they are needed. The gap between Gánti's three theoretical cycles and 473 empirical genes is not a gap in our tools. It is a gap in the assumption that a self-reproducing system can be decomposed into individually understood parts.


The question was wrong from the start. "What is the minimal self-replicating unit?" presupposes that replication is a property an individual can possess. Every case says otherwise. The single template inhibits itself. The pair succeeds. The network arises inevitably. The network must include its own error correction. The theoretical minimum is already three coupled cycles. The empirical minimum contains 149 parts no one can explain.

Self-replication is not a molecular property. It is a network property — a closure, in the mathematical sense: a set of operations that never produces anything outside the set. The molecule that copies itself is a fiction. What copies is the relationship between molecules, distributed across all of them and located in none.

← Back to essays