Unfortunately, a comprehensive scenario describing the transition from inanimate matter into the simplest protocell is still lacking. However, several progressively more sophisticated projects have been initiated to probe protocellular life. Their ultimate goal is to create a ``minimal protocell'' [1], which can be defined as a structure bound by membrane-like material, capable of capturing energy and chemical substrates from the environment and using them to support the chemical reactions (metabolism) needed for self-maintenance and growth. A community of such model protocells should be capable of evolving towards higher complexity.
Currently, most projects aimed at understanding the emergence of protocells focus on self-replication. Nucleic acids [2,3], proteins [4,5], membranes [6] and non-biological organic molecules [7] have been used as self-replicating units. In some instances, the evolutionary potential of the self-replicating system was also examined. Even though none of these projects will yield a true protocell, they still involve highly complicated processes. To accomplish these processes, it is usually necessary to use molecular components that, though simple by biological standards, are too complex to have existed in a protobiological milieu. While this type of approximation is undesirable, the results of these studies are still of great value because they shed light on principles underlying the fundamental processes involved in the emergence of life.
In this brief overview, we report on progress in another project aimed
at understanding how protocells worked and evolved. This project is a collaborative
effort between scientists from NASA-Ames Research Center and the laboratories
of J. Szostak (Harvard Medical School), J. Lanyi (University of California,
Irvine) and D. Deamer (University of California, Santa Cruz). It is distinguished
from other projects by its focus on protocellular metabolism and the coupling
of metabolism to energy transduction, rather than on self-replication.
This research is founded on the assumption that the emergence of systems
endowed with genomes and capable of Darwinian evolution was preceded by
a pre-genomic phase, in which protocells functioned and evolved using mostly
proteins, without self-replicating nucleic acids (e.g., RNA). This
view is motivated by findings that, while the building blocks of proteins
and simple cell walls are relatively abundant in space and can be readily
generated in experiments simulating conditions on the pre-biotic earth,
the building blocks of nucleic acids are quite rare under these conditions.
Furthermore, RNA cannot readily perform such essential cellular functions
as energy capture and transport. Finally, since there is no relationship
between the function of a catalytic RNA and the function, if any, of the
protein for which it can code, there is no clear path from the ``RNA World''
to today's world of protein catalysis and nucleic acid information storage.
To begin the study of the coupling of energy transduction to metabolism in protocells, a simple, model transduction system was constructed. It consists of two proteins, bacteriorhodopsin (BR) and the F0F1 ATPase from the thermophile Bacillus PS3, embedded in liposomes. BR is a light-driven proton pump, which generates the transmembrane proton gradient needed for the ATP-making activity of the ATPase. Although this system is most likely not protobiological, it is known to have high turnover and stability [11].
To construct the system, purple membranes containing wild type BR were solubilized and incubated with preformed liposomes in the presence of detergents. When illuminated, the proteoliposomes demonstrated very active proton transport directed inward, implying that the preferential orientation of the pumping species was inside-out. Alkalinization of unbuffered medium as large as 1 pH unit was observed. Next, BR was co-reconstituted with a thermostable ATPase from Bacillus PS3 into liposomes using a similar procedure as for the BR-only liposomes. The ATPase incorporated into the membrane with its F1 unit pointing outward, making it possible to utilize the pH gradient created by BR upon illumination (see Fig. 1). The system was stable under illumination for several hours (J. Lanyi and L. Brown, unpublished results).
Figure 1: Schematic of the coupled BR-ATPase energy transduction system.
It was further demonstrated that the production of ATP in liposomes could be coupled to a model, metabolic chemical reaction. As the model reaction, the production of acetyl-coenzyme A (acetyl CoA) from acetate and coenzyme A was chosen. This reaction is catalyzed by a thermophilic acetyl-CoA synthetase from Pyrococcus furiosus, which utilizes ATP, but not ADP to make acetyl-CoA. After adding this enzyme and appropriate substrates to liposomes, net synthesis of acetyl-CoA upon illumination was observed (J. Lanyi and L. Brown, unpublished results).
The next step will be to use the same enzyme, which is known to have broad specificity, to form different long-chain acyl-CoAs using fatty acids as substrates. Then, acyl-CoA transferase will add the fatty acid chain to lyso-phosphatylcholine (a lipid with only one hydrocarbon tail), and produce a new lipid. In a similar experiment performed previously, oleyl-CoA was coupled to lyso-phosphatylcholine, and production and growth of liposomes was observed (D. Deamer, unpublished results). Since higher concentrations of the lyso-lipid act as a detergent, a dynamic balance between liposome formation and destruction will be achieved if a mixture of different hydrocarbon chains is included. This, in turn, will introduce ``natural selection'' for the best components that can self-assemble into liposomes.
The results, described so far, demonstrate how simple metabolic reactions can be coupled to external sources of energy. The next objective is to simplify the components of the system and direct them to match better our ultimate goal - the creation of a protein-based model of primitive metabolism.
Figure 2: Richardson diagram of intermediate states of two mechanisms for proton translocation through the M2 protein of human Influenza A. Histidines and water wire shown as dark and light CPK models, respectively. Solvating bilayer and waters omitted. Figure made with MOLSCRIPT [14]
The results also suggest how the channel can be re-engineered to become a simple proton pump. First, a proton source must be placed inside the channel. Among many possible sources of protons, probably the simplest are polycyclic aromatic hydrocarbons [17] - a class of molecules readily available on the early earth. Once the channel is coupled with a proton source, the last thing that remains to be done is to ensure that protons transferred through the channel to the aqueous medium are unlikely to be used to regenerate the proton source. Since the proposed proton pump is asymmetrical, this should be possible to accomplish by appropriate genetic modification of the channel.
Using this technique, we will generate models for several important proto-enzymes involved in the creation of new proteins. The formation of a peptide bond between amino acids requires the input of energy. Thus the first target of the new in vitro evolution procedure will be an enzyme that can use the ATP generated by the transduction system to phosphorylate amino acids, thereby preparing them to be used to form new proteins. The second target for in vitro evolution will be an enzyme that can catalyze the formation of a peptide bond in the absence of a nucleic acid template, a peptide ligase. In modern biology, there are very few examples of such enzymes [19], and all are quite large and complex.
Central to this concept is the emergence of weak ligases, that would produce a diverse pool of peptides. A few of the peptides so generated could have been better catalysts of peptide bond formation than the proto-enzymes that formed them, thereby generating even more peptides and increasing the chances of producing functional ones. Some of these functional peptides would be proteases, proto-enzymes that cut peptide bonds. Since proteases cleave unstructured peptides more rapidly than structured ones, and since functional peptides have some degree of ordered structure, proteases would preferentially destroy non-functional peptides. Occasionally, the newly produced peptides would be capable of performing novel functions. If these novel peptides integrated into the protocellular metabolism, they could increase the capabilities of the protocell. Eventually, this process could lead to the emergence (or utilization) of nucleic acids and their coupling with peptides into a genomic system.
To examine the evolutionary potential of a non-genomic system, we have developed a simple, computationally tractable model that is still capable of capturing the essential biochemical features of the real system. In this model, the specific identities of amino acids are not considered, and the catalytic efficiencies of all peptides of the same length are normally distributed. As peptides grow longer, their mean efficiency also grows, but non-monotonically: only when peptides reach sufficient lengths to adopt ordered structures do the average efficiencies increase markedly. Peptides act as both catalysts and reaction substrates. In the simplest implementation of the model, only two catalyzed reactions were considered: the formation (polymerization) and destruction (hydrolysis) of peptide bonds. The properties of the products of these reactions are related to the properties of the reactants, and so the model is called the Inherited Efficiencies Model.
Figure 3: Average catalytic efficiency of peptides in a test protocell simulated by the Inherited Efficiencies Model. Left panel shows the effect of decreasing ease of forming ligases (red line) relative to a reference model (black line). Right panel shows effect of decreasing preference for cutting bonds in unstructured peptides (red line) relative to the same reference model (black line).
Simulations of this model were carried out using a Monte Carlo method [20]. For many choices of model parameters, the overall catalytic efficiency of the protocell was observed to increase. Two properties strongly affected the ability of the protocell to evolve: the balance between the probabilities that a peptide is an efficient protease and an efficient ligase, and the strength of the preference toward the hydrolysis of unstructured peptides. It was found that when efficient ligases were not difficult to form (Fig. 3, left panel), or when there was a strong preference for the hydrolysis of unstructured peptides (Fig. 3, right panel), any long, highly efficient peptides formed would not be destroyed before they could accelerate the rate at which new, and possibly efficient, peptides were formed. Under these conditions, the ability of the protocell to improve its catalytic capabilities was high. These results demonstrate the possibility of a novel mechanism of early protocellular evolution that does not require the presence of a genome, nor any form of sequence complementarity or the exact replication of protein sequences. In fact, the sloppy replication of protein sequences is an advantage in the earliest phase of evolution because it allows for the rapid exploration of the space of proteins and the discovery of new functions. It is the preservation of these functions and their inter-relationships which must be maintained during this early stage of evolution, not the identity of the actors performing those functions.