Saturday, July 31, 2010

Watching the evolution knobs spin

Evolution really happens at the dials controlling genes, more than in protein sequences those genes encode.

The shock of humans having only ~23,000 genes has yet to fully sink in. Fewer genes than soybeans? Than the potato? Additionally, the depth to which some of these genes are conserved is also astonishing, with a promoter of eye development working quite well when transplanted into fruit flies. What, then, makes us different? What has evolution been doing all this time?

A recent paper in science adds evidence that far more variation goes on in the promoters of genes than in their coding sequences. The authors tracked the sites of action (i.e. DNA binding) of two liver-specific transcription regulatory proteins in chickens, opossum, mice, dogs, and humans, and found that few  were recognizably conserved. Most sites disappeared, reappeared, altered, and mutated with considerable abandon.

The regulators themselves (CEBPA, and HNF4A) were very well conserved, meaning that as proteins, they had virtually the same sequence in each organism. And more critically, their preferred binding site on DNA stayed the same as well. That tends to be hard to change if their binding to thousands of different sites (~20,000 is the estimate given for each protein) is important for an organism's liver and other organs. Putting it in technical terms, such binding specificities tend to be subject to strong purifying selection.

On the other hand, the individual sites are much less constrained by evolution, since changes affect only that individual target gene. Some genes that have been studied as targets of CEBPA include metabolic enzymes, detoxifying enzymes like cytochromes P450, EPHX1, and SULT2A1, several insulin-regulated genes, growth factors, the gene for albumin, coagulation factor VIII, and other transcriptional regulators in liver development and function.

The current authors use some high-tech wizardry to isolate all the DNA bound to these regulatory proteins from each species of interest, and sequence around each site to see where it maps in the respective species' genome. This gives them the dataset of sites that they then mine to ask whether the sites have stayed consistent over evolutionary time. The answer is no: "For these two liver-specific TFs, binding events appear to be shared 10 to 22% of the time between mammals from any two of the three placental lineages we profiled, separated by approximately 80 million years of evolution (figs. S6 and S7). This result reveals a rapid rate of evolution in transcriptional regulation among closely related vertebrates."

For example, they show the binding of CEBPA to one region around the gene for PCK in liver. Phosphoenolpyruvate carboxykinase is a metabolic enzyme which helps synthesize glucose.

The coding exons of the PCK1 gene are shown at the lower right. kb = kilo basepairs. Hsap = human, Mmus= mouse, Cfam = dog, Mdom = short tailed opossum, and Ggal = chicken.

The pattern in chicken is quite simple. More sites appear in the mammals, with novel and significant sites appearing in dogs and humans. The scoring of these sites is somewhat unclear, in terms of how minor a site could be and still score, not to mention that they had no functional tests of which sites actually affected local gene transcription.

A key and well-occupied site right at the start of the PKC1 gene is well-conserved, however, and probably has a dominant regulatory role. What role the other sites might have is not clear, and might be minimal. So their  conclusion needs to be taken with a bit of salt, as they indicate that most of the highly conserved DNA binding sites are at this kind of most-influential position near genes that rely heavily on regulation by the bound regulator.

Nevertheless, the reason for flexibility in regulator binding is not hard to find, since binding sites are often composed of only six or eight nucleotides, with sloppy allowances for binding to sites with some mutations as well. New sites can appear easily, and old sites can be destroyed just as easily. So these regulatory proteins bind all over the genome and these sites change frequently, allowing regulatory variation to happen easily by mutation. The authors conclude "Taken together, the steady accumulation of small changes in the genetic sequence appears to rapidly remodel thousands of TF binding sites in mammals." [TF refers to transcription factor, another word for DNA binding regulator].

Given the complexity of biology, the network is the real locus of evolution, with the pieces (proteins encoded by genes) being shuffled around by regulatory experiments over time. Indeed, another recent paper compared the multicellular organism Volvox with its single-celled relative Chlamydomonas, and found that they had almost exactly the same number of genes, and few gene differences overall. They conclude: "This is consistent with previous observations indicating co-option of ancestral genes into new developmental processes without changes in copy number or function." And one of the most important mechanisms of such co-option is placing the given gene under novel regulation. This process is slightly reminiscent of the human economy, which is being driven increasingly as a "knowledge economy", shuffling around financing, software, and organization while the basic commodities of existence remain far more constant.

  1. Burk,

    You may want to re-read the article before you assume that even some aspects of free-will have been "explained." Here is an interesting quote:

    "In other words, we have no reason to assume that either predictability or lack of predictability has anything to say about free will. The fact that we do make this association has more to do with the model of the world that we subtly import into such thought experiments than with the experiments themselves."

    And here he speaks directly to you:

    "The model in question holds that the universe exists in space and time as a kind of ultimate code that can be deciphered. This image of the universe has a philosophical and religious provenance, and has made its way into secular beliefs and practices as well. In the case of human freedom, this presumption of a “code of codes” works by convincing us that a prediction somehow decodes or deciphers a future that already exists in a coded form."

    Here is another interesting one:

    "Let’s return to the example of the experiment predicting the monkeys’ decisions. What the experiment tells us is nothing other than that the monkeys’ decision making process moves through the brain, and that our technology allows us to get a reading of that activity faster than the monkeys’ brain can put it into action. From that relatively simple outcome, we can now see what an unjustified series of rather major conundrums we had drawn."

    And so your assertion that some aspect of free-will had been explained was as well unjustified.


  2. Burk,

    Excuse me, I should have noted that I'm referencing the NY Times article you link to at the end of your post and not your main post or any link or reference in your main post.

  3. Hi, Darrell-

    Thanks for pointing all this out. I found this article interesting, but as you indicate, it hardly hews entirely to my view point, or ends up quite coherent, either. I was just skimming as I got to the end, apparently. Indeed, I might be better off taking a critical stance.

    But first, let me try to reconstruct his argument as well as I can. There is basic unpredictability to the future, especially in human choices. Thus we have free will, because however physically determined, our choices are not known to us or even potentially known, so our wills are de facto free.

    This is not false as far as it goes, but I don't think this really works either for the theist or the naturalist views of free will. My understanding is that theist free will depends on consciousness being sovereign- the experience that we have of making choices is precisely the act of making those choices. Additionally, such free will is unconstrained by prior influences and ulterior issues.. it is really free so that we can be condemned (even eternally) if we make the wrong choices. The article doesn't agree with that form of free will at all.

    The naturalist conception of free will is that our choices are determined by our histories, interpreted at physical, psychological, developmental, evolutionary, etc. levels. The naturalist doesn't particularly care whether these choices are actually predictable, but only about the principle that there is no free "extra" element in the universe that frees human choices from this embeddedness in our nature and history.

    In that sense, the article basically agrees with naturalist free will, even though it takes the basic unpredictability (if so) of our choices as making that lack of free will unthreatening. That is a common approach, but as mentioned, doesn't really resolve the issue for those who are paying attention. His "freedom" is a false freedom to the theist, since it has no sources other than the chaotic details of naturalism.

    I think my approach on moral freedom is more productive, (in combination with complete lack of physics-level free will), positing that our sense of freedom is actually an algorithmically encoded openness to experience and sensitivity to others, such that we can change our minds or be induced to change our minds. This creates the moral responsibility by which rules are expected to be followed and choices made in accordance with the moral norms of the society (whether taking the form of lofty ideals or the letter of a law). Anyone incapable of such moral learning is deemed morally incompetent and institutionalized. Thus what we call moral free will is really an inborn aspect of our social nature, as determined (yet variable) as any other aspect of our natures.