Biophilia: cell biology

Showing posts with label cell biology. Show all posts

Saturday, March 29, 2025

What Causes Cancer? What is Cancer?

There is some frustration in the literature.

Fifty years into the war on cancer, what have we learned and gained? We do not have a general cure, though we have a few cures and a lot of treatments. We have a lot of understanding, but no comprehensive theory or guide to practice. While some treatments are pin-point specific to certain proteins and even certain mutated forms of those proteins, most treatments remain empirical, even crude, and few provide more than a temporary respite. Cancer remains an enormous challenge, clinically and intellectually.

Recently, a prominent journal ran a provocative commentary about the origins of cancer, trashing the reigning model of "Somatic Mutation Theory", or SMT. Which is the proposition that cancer is caused by mutations that "drive" cell proliferation, and thus tumor growth. I was surprised at the cavalier insinuations being thrown around by these authors, their level of trash talk, and the lack of either compelling evidence or coherent alternative model. Some of their critiques have a fair basis, as discussed below, but to say, as the title does, that this is "The End of the Genetic Paradigm of Cancer" is simply wrong.

"It is said that the wise only believe in what they can see, and the fools only see what they can believe in. The latter attitude cements paradigms, and paradigms are amplified by any new-looking glass that puts one’s way of seeing the world on steroids. In cancer research, such a self-fulfilling prophecy has been fueled by next-generation DNA sequencing."

"However, in the quest for predictive biomarkers and molecular targets, the cancer research community has abandoned deep thinking for deep sequencing, interpreting data through the lens of clinical translation detached from fundamental biology."

Whew!

The main critique, once the gratuitous insults and obligatory references to Kuhn and Feynman are cleared away, is that cancer does not resemble other truly clonal disease / population processes, like viral or bacterial infections. In such processes, (which have become widely familiar after the COVID and HIV pandemics), a founder genotype can be identified, and its descendants clearly derive from that founder, while accumulating additional mutations that may respond to the Darwinian pressures, such as the immune system and other host defenses. While many cancers are clearly driven by some founding mutation, when treatments against that particular "driver" protein are given, resistance emerges, indicating that the cancer is a more diverse population with a very active mutation and adaptation process.

Additionally, tumors are not just clones fo the driving cell, but have complex structure and genetic variety. Part of this is due to the mutator phenotypes that arise during carcinogenesis, that blow up the genome and cause large numbers of additional mutations- many deleterious, but some carrying advantages. More significantly, tumors arise from and continue to exist in the context of organs and tissues. They can not just grow wildly as though they were on a petri plate, but must generate, for example, vascular structures and a "microenvironment" including other cells that facilitate their life. Similarly, metastasis is highly context-dependent and selective- only very few of the cells released by a tumor land in a place they find conducive to new growth. This indicates, again, that the organ setting of cancer cells is critically important, and accounts in large part for this overall difference between cancers and more straightforward clonal processes.

Schematic of cancer development, from a much more conventional and thorough review of the field.

Cancer cells need to work with the developmental paradigms of the organism. For instance, the notorious "EMT", or epithelial-mesenchymal transition is a hallmark of de-differentiation of many cancer cells. They frequently regress in developmental terms to recover some of the proliferative and self-repair potential of stem cells. What developmental program is available or allowed in a particular tissue will vary tremendously. Thus cancer is not caused by each and every oncogenic mutation, and each organ has particular and distinct mutations that tend to cause cancers within it. Indeed, some organs hardly foster any cancers at all, while other organs with more active (and perhaps evolutionarily recent) patterns of proliferation (such as breast tissue, or prostate tissue) show high rates of cancer. Given the organ setting, cancer "driver" mutations need not only unleash the cell's own proliferation, but re-engineer its relations with other cells to remove their inhibition of its over-growth, and pursuade them to provide the environment it needs- nutritionally, by direct contact, by growth factors, vascular formation, immune interactions, etc., in a sort of para-organ formation process. It is a complicated job, and one mutation is, empirically, rarely enough.

"Instead, cancer can be broadly understood as “development gone awry”. Within this perspective, the tissue organization field theory is based on two principles that unite phylogenesis and ontogenesis."
"The organicist perspective is based on the interdependency of the organism and its organs. It recognizes a circular causal regimen by closure of constraints that makes parts interdependent, wherein these constraints are not only molecules, but also biophysical force."

As an argument or alternative theory, this leaves quite a bit to be desired, and does not obviate the role of initiating mutations in the process.

It remains, however, that oncogenic mutations cause cancer, and treatments that address those root causes have time and again shown themselves to be effective cancer treatments, if tragically incomplete. The rise of shockingly effective immunotherapies for cancer have shown, however, that the immune system takes a more holistic approach to attacking disease than such "precision" single-target therapies, and can make up for the vagaries of the tissue environment and the inflammatory, developmental, and mutational derangements that happen later in cancer development.

In one egregious citation, the authors hail an observation that certain cancers need both a mutation and a chemical treatment to get started, and that the order of these events is not set in stone. Traditionally, the mutation is induced first, and then the chemical treatment, which causes inflammation, comes second. They state:

"The qualitative dichotomy between a mutagenic initiator that creates ’cancer cells’ and the non-genetic, tissue-perturbing promoter that expands them may not be as clear-cut. Indeed, the reverse experiment (first treatment with the promoter followed by the initiator) equally produces tumors. This result refutes the classical model that requires that the mutagenic (alleged) initiator must act first."

The citation is to a paper entitled "The reverse experiment in two-stage skin carcinogenesis. It cannot be genuinely performed, but when approximated, it is not innocuous". This paper dates from 1993, long before sequencing was capable of evaluating the mutation profiles of cancer cells. Additionally, the authors of this paper themselves point out (in the quote below) a significant assymetry in the treatments. Their results are not "equal":

"The two substances showed a reciprocal enhancing effect, which was sometimes weak, sometimes additive, and sometimes even synergistic, and was statistically most significant when the results were assessed from the time of DMBA application. Although the reverse experiment was not in any way innocuous it always resulted in a lower tumor crop than the classical sequence of DMBA followed by a course of TPA treatment.
However, the lower tumor crop in the reverse experiment cannot be used to prove a qualitative difference between initiators and promoters."

(DMBA is the mutagen, while TPA is the inflammatory accelerant.)

So chemical treatment can prepare the ground for subsequent mutant generation in forming cancers in this system, while being much less efficient than the traditional order of events. This is not a surprise, given that this chemical (TPA) treatment causes relatively long-term inflammation and cell proliferation on its own.

"An epistemic shift towards a biological theory of cancer may still be an uphill battle in the current climate of thought created by the ease of data collection and a culture of research that discourages ’disruptive science’. Here, we have made an argument for dropping the SMT and its epicycles. We presented new and old but sidelined theoretical alternatives to the SMT that embrace theory and organismal biology and can guide experiments and data interpretation. We expect that the diminishing returns from the ceaselessly growing databases of somatic mutations, the equivalent to Darwin’s gravel pit, may soon reach a pivot point."

One rarely reads such grandiloquent summaries (or mixed metaphors) in scientific papers! But here they are truly beating up on straw men. In the end, it is true that cancer is quite unlike clonal infectious diseases, and for this, as for many other reasons, has had scientists scratching their heads for decades, if not centuries. But rest assured that this chest-thumping condescension is quite unnecessary, since those in the field are quite aware of these difficulties. The various nebulous alternatives these authors offer, whether the "epigenetic landscape", the "tissue organization field theory", or the "biological theory of cancer" all have kernels of logic, but the SMT remains the foundation-stone of cancer study and treatment, while being, for all the reasons enumerated above and by these authors, only part of the edifice, not the whole truth.

Some more thoughts about crypto.
Resistance is up to congress, ultimately.
China takes purges quite a bit more seriously than we do.
But we are getting there.

Saturday, February 15, 2025

Cloudy, With a Chance of RNA

Long RNAs play structural and functional roles in regulation of chromosome replication and expression.

One of the wonderful properties of the fruit fly as a model system of genetics and molecular biology has been its polytene chromosomes. These are hugely expanded bundles of chromosomes, replicated thousands of times, which have been observed microscopically since the late 1800's. They exist in the larval salivary gland, where huge amounts of gene expression are needed, thus the curious evolutionary solution of expanding the number of templates, not only of the gene needed, but of the entire genome.

These chromosomes where closely mapped and investigated, almost like runic keys to the biology of the fly, especially in the day before molecular biology. Genetic translocations, loops, and other structural variations could be directly observed. The banding patterns of light, dark, expanded, and compressed regions were mapped in excruciating detail, and mapped to genetic correlates and later to gene expression patterns. These chromosomes provided some of the first suggestions of heterochromatin- areas of the genome whose expression is shut down (repressed). They may have genes that are shut off, but they may also be structural components, such as centromeres and telomeres. These latter areas tend to have very repetitive DNA sequences, inherited from old transposons and other junk.

A diagram of polytene chromosomes, bunched up by binding at the centromeres. The banding pattern is reproducible and represents differences in proteins bound to various areas of the genome, and gene activity.

It has become apparent that RNA plays a big role in managing these areas of our chromosomes. The classic case is the XIST RNA, which is a long (17,000 bases) non-coding RNA that forms a scaffold by binding to lots of "heterogeneous" RNA-binding proteins, and most importantly, stays bound near the site of its creation, on the X chromosome. Through a regulatory cascade that is only partly understood, the XIST RNA is turned off on one of the X chromosomes, and turned on the other one (in females), leading the XIST molecule to glue itself to its chromosome of origin, and then progressively coat the rest of that chromosome and turn it off. That is, one entire X is turned into heterochromatin by a process that requires XIST scaffolding all along its length. That results in "dosage compensation" in females, where one X is turned off in all their cells, allowing dosage (that is, the gene expression) of its expressed genes to approximate those of males, despite the presence of the extra X chromosome. Dosage is very important, as shown by Down Syndrome, which originates from a duplication of one of the smallest human chromosomes, creating imbalanced gene dosage.

A recent paper described work on "ASAR" RNAs, which similarly arise from highly repetitive areas of human chromosomes, are extremely long (180,000 bases), and control expression and chromosome replication in an allele-specific way on (at least) several non-X chromosomes. These RNAs, again, like XIST, specifically bind a bunch of heternuclear binding proteins, which is presumably central to their function. Indeed, these researchers dissected out the 7,000 base segment of ASAR6 that is densest in protein binding sites, and find that, when transplanted into a new location, this segment has dramatic effects on chromosome condensation and replication, as shown below.

The intact 7,000 base core of ASAR6 was transplanted into chromosome 5, and mitotic chromosomes were spread and stained. The blue is a general DNA stain. The green is a stain for newly synthesized DNA, and the red is a specific probe for the ASAR6 sequence. One can see on the left that this chromosome 5 is replicating more than any other chromosome, and shows delayed condensation. In contrast, the right frame shows a control experiment where an anti-sense version of the ASAR6 7,000 base core was transplanted to chromosome 5. The antisense sequence not only does not have the wild-type function, but also inhibits any molecule that does by tightly binding to it. Here, the chromosome it resides on (arrows) is splendidly condensed, and hardly replicating at all (no green color).

Why RNA? It has become clear over the last two decades that our cells, and particularly our nuclei, are swimming with RNAs. Most of the genome is transcribed in some way or other, despite a tiny proportion of it coding for anything. 95% of the RNAs that are transcribed never get out of the nucleus. There has been a growing zoo of different kinds of non-coding RNAs functioning in translational control, ribosomal maturation, enhancer function, and here, in chromosome management. While proteins tend to be compact bundles, RNAs can be (as these ASARs are) huge, especially in one dimension, and thus capable of physically scaffolding the kinds of structures that can control large regions of chromosomes.

Chromosomes are sort of cloudy regions in our cells, long a focus of observation and clearly also a focus of countless proteins and now RNAs that bind, wind, disentangle, transcribe, replicate, and congregate around them. What all these RNAs and especially the various heteronuclear proteins actually do remains pretty unclear. But they form a sort of organelle that, while it protects and manages our DNA, remarkably also allows access to it for sequence-specific binding proteins and the many processes that plow through it.

"In addition, recent studies have proposed that abundant nuclear proteins such as HNRNPU nonspecifically interact with ‘RNA debris’ that creates a dynamic nuclear mesh that regulates interphase chromatin structure."

What it takes to be a real Christian.
Courts, schmorts.
Assholes.
I wonder what Christians think about the destruction of US AID? Or of consumer protections?
A disconnect down at Silicon Valley.
Retaliation is real.

Saturday, February 8, 2025

Sugar is the Enemy

Diabetes, cardiovascular health, and blood glucose monitoring.

Christmas brought a book titled "Outlive: The Science and Art of Longevity". Great, I thought- something light and quick, in the mode Gweneth Paltrow or Deepak Chopra. I have never been into self-help or health fad and diet books. Much to my surprise, however, it turned out to be a rather rigorous program of preventative medicine, with a side of critical commentary on our current medical system. A system that puts various thresholds, such as blood sugar and blood pressure, at levels that represent serious disease, and cares little about what led up to them. Among the many recommendations and areas of focus, blood glucose levels stand out, both for their pervasive impact on health and aging, and also because there are new technologies and science that can bring its dangers out of the shadows.

Reading:

Outlive
The many facets and forms of damage by high blood sugar, at the level of molecular biology.
Recent analyses of blood sugar variability, from researchers at Stanford
... with tool for anyone with a CGM to upload their own data

Where do cardiovascular problems, the biggest source of mortality, come from? Largely from metabolic problems in the control of blood sugar. Diabetics know that uncontrolled blood sugar is lethal, on both the acute and long-terms. But the rest of us need to realize that the damage done by swings in blood sugar are more insidious and pervasive than commonly appreciated. Both microvascular (what is commonly associated with diabetes, in the form of problems with the small vessels of the kidney, legs, and eyes) and macrovascular (atherosclerosis) are due to high and variable blood sugar. The molecular biology of this was impressively unified in 2005 in the paper above, which argues that excess glucose clogs the mitochondrial respiration mechanisms. Their membrane voltage maxes out, reactive forms of oxygen accumulate, and glucose intermediates pile up in the cell. This leads to at least four different and very damaging consequences for the cell, including glucose modification (glycation) of miscellaneous proteins, a reduction of redox damage repair capacity, inflammation, and increased fatty acid export from adipocytes to endothelial (blood vessel) cells. Not good!

Continuous glucose monitored concentrations from three representative subjects, over one day. These exemplify the low, moderate, and severe variability classes, as defined by the Stanford group. Line segments are individually classed as to whether they fall into those same categories. There were 57 subject in the study, of all ages, none with an existing diagnosis of diabetes. Yet five of them had diabetes by traditional criteria, and fourteen had pre-diabetes by those criteria. By this scheme, 25 had severe variability as their "glucotype", 25 had moderate variability, and only 7 had low variability. As these were otherwise random subjects selected to not have diabetes, this is not great news about our general public health, or the health system.

Additionally, a revolution has occurred in blood glucose monitoring, where anyone can now buy a relatively simple device (called a CGM) that gives continuous blood glucose monitoring to a cell phone, and associated analytical software. This means that the fasting blood glucose level that is the traditional test is obsolete. The recent paper from Stanford (and the literature it cites) suggests, indeed, that it is variability in blood glucose that is damaging to our tissues, more so than sustained high levels.

One might ask why, if blood glucose is such a damaging and important mechanism of aging, hasn't evolution developed tighter control over it. Other ions and metabolites are kept under much tighter ranges. Sodium ranges between 135 to 145 mM, and calcium from 8.8 to 10.7 mM. Well, glucose is our food, and our need for glucose internally is highly variable. Our livers are tiny brains that try very hard to predict what we need, based on our circadian rhythms, our stress levels, our activity both current and expected. It is a difficult job, especially now that stress rarely means physical activity, and nor does travel, in our automobiles. But mainly, this is a problem of old age, so evolution cares little about it. Getting a bigger spurt of energy for a stressful event when we, in our youth, are in crisis may, in the larger scheme of things, outweigh the slow decay of the cardiovascular system in old age. Not to mention that traditional diets were not very generous at all, certainly not in sugar and refined carbohydrates.

First, Twitter was turned into a dumpster fire, next the US.
US government to be rebranded as X.
Truth must die.
"Not strictly constitutional"
Let's go there.

Saturday, December 7, 2024

Cranking Up DNA, One Gyration at a Time

The mechanism of DNA gyrase, which supercoils bacterial DNA.

Imagine that you have a garden hose that is thirty miles long. How would you keep it from getting tangled? That is unlikely to be easy. Now add randomly placed heavy machinery that actively twists that hose as it travels / pulls along, causing it to wind up ahead, and unwind behind. And that machinery can be placed in either direction, often getting into head-on conflicts, not to mention going at quite different speeds. That is the problem our cells have, managing their DNA.

They use a set of topoisomerases to manage the topology of DNA- that is, its twist-i-ness. One easy method is to nick the DNA on one of its two strands, allowing it to relax by spinning around the remaining phosphate bond, before resealing it back to a double strand and sending it on its way. But what if you encounter coils or knots that can't be resolved that way? The next level is to cut one entire DNA molecule, not just one side/strand of it, and pass the conflicting one though it. All organisms contain topoisomerases of both kinds, and they are essential.

How DNA gets twisted. While most topoisomerases relax DNA (top) to resolve the many twisty problems posed by transcription and replication, gyrase increases twist by grabbing and holding a quasi-positive twist, then cutting and resolving it, as shown at bottom.

Bacteria have an additional enzyme that we do not have, called gyrase, to crank up the supercoiling of their DNA, to make it easier to open for transcription. Gyrase works just like a type II topoisomerase that cuts a double-stranded DNA and lets another DNA through, but it does so in a special way that puts a twist on the DNA first, so instead of relaxing the DNA, it increases the stress. How exactly that works has been a bit mysterious, though gyrases and the general principles they operate under have been clear for decades. Gyrase uses ATP, and grabs onto two parts of a DNA molecule, one of which is pre-twisted into coil, after which one is cut and the other passed through to create a change (-2) in the twisting number of that DNA.

A general model of gyrase action. The G segment of DNA is firmly held by the gyrase dimer in the center. The same DNA is forcibly twisted about, around the pinwheel structures, and bent back around to enter through the N-gate (as the T segment). Then, the N gate closes, paving the way for the G-segment to be cut and separated (step 3). ATP is the energy source behind all this structural drama. The T-segment then passes through the cut, enters the C-gate, and the cycle is complete.

A recent paper determined the structure of active gyrase complexes, and was able to trace the pre-twisted conformation. This, combined with a lot of past work on the ATPase and cleavage functions of gyrase, allows a reasonably full picture of how this enzyme works. It is a symetric dimer of a two-subunit protein, so there are four protein chains in all. There are three major regions of the full structure. The N-gate at top where one segment (the T-segment) of DNA binds, then the central DNA gate, where the other (G-segment) DNA binds and is later cut to let the T-segment through, and the C-gate, where the T segment ends up and is released at the end of the cycle.

Focus on the pinwheel structure that dramatically pre-twists the DNA around between the G and T segments, pre-positioning the complex for strand passage and increased supercoiling.

The magic is that the T-segment and the G-segment of DNA are parts of the same DNA molecule, by being wrapped around the ears of the protein, which are also called pinwheels. That is what the newest structure solves in greatest detail. These pinwheels essentially allow the enzyme to yank an otherwise normal DNA strand into a pre-knotted (positive supercoil) form that, when cut and resolved as shown, results in a negative increment of supercoiling or twist. If they mutated the pinwheels away, the enzyme could still hold, cut, and relax DNA, but it could not increase its supercoiling. It is the ability of the pinwheel structures to set up a pre-twisted structure onto the DNA that makes this enzyme a machine to increase negative supercoiling, and thus ease other DNA transactions.

Topoisomerase enzymes through evolution, from gyrase (left) to human topoII on the right. Note how the details of the protein structure are virtually unrecognizable, while the overall shape and DNA-binding stays the same.

Bacteria also have more normal type II topoisomerases that cut DNA merely to relax it, so one might wonder how these two enzymes get along. Well, gyrase is responsible for the overall negative supercoiling of the bacterial genome, while the other topoisomerases have more localized roles to relieve transient knots and over-twisting. Indeed, if you negatively twist DNA enough, you can separate its strands entirely, which is not usually desirable. Further research shows that too much of either topoisomerase is lethal, and that they are kept in balance by transcriptional controls over the amount of each topoisomerase. This suggests a futile cycle of DNA winding and unwinding, as the optimal condition in bacterial cells when both are present in just the right amounts.

What happened to independent grocery stores?
A family that reads like Chinatown.
De-growth will require a radical change in the governance of capitalism.
Business as usual.

Saturday, November 9, 2024

Rings of Death

We make pore-forming proteins that poke holes in cells and kill them. Why?

Gasdermin proteins are parts of the immune system, and exist in bacteria as well. It was only in 2016 that their mechanism of action was discovered, as forming unusual pores. The function of these pores was originally assumed to be offensive, killing enemy cells. But it quickly became apparent that they more often kill the cells that make them, as the culmination of a process called pyroptosis, a form of (inflammatory) cell suicide. Further work has only deepened the complexity of this system, showing that gasdermin pores are more dynamic and tunable in their action than originally suspected.

The structure is quite striking. The protein starts as an auto-inhibited storage form, sitting around in the cell. When the cell comes under attack, a cascade of detection and signaling occurs that winds up expressing a family of proteases called caspases. Some of these caspases can cut the gasdermin proteins, removing their inhibitory domain and freeing them to assemble into multimers. About 26 to 32 of these activated proteins can form a ring on top of a membrane (let's say the plasma membrane), which then cooperatively jut down their tails into the membrane and make a massive hole in it.

Overall structure of assembled gasdermin protein pores.

Simulations of pore assembly, showing how the trapped membrane lipids would pop out of the center, once pore assembly is complete.

These holes, or pores, are big enough to allow small proteins through, and certainly all sorts of chemicals. So one can understand that researchers thought that these were lethal events. And gasdermins are known to directly attack bacterial cells, being responsible in part for defense against Shigella bacteria, among others. But then it was found that gasdermins are the main way that important cytokines like the highly pro-inflammatory IL-1β get out of the cell. This was certainly an unusual mode of secretion, and the gasdermin D pore seems specifically tailored, in terms of shape and charge, to conduct the mature form of IL-1β out of the cell.

It also turned out that gasdermins don't always kill their host cells. Indeed, they are far more widely used for temporary secretion purposes than for cell killing. And this secretion can apparently be regulated, though the details of that remain unclear. In structural terms, gasdermins can apparently form partial and mini-pores that are far less lethal to their hosts, allowing, by way of their own expression levels, a sensitive titration of the level of response to whatever danger the cell is facing.

Schematic of how lower concentrations of gasdermin D (lower path, blue) allow smaller pores to form with less lethality.

Equally interesting, the bacterial forms of gasdermin have just begun to be studied. While they may have other functions, they certainly can kill their host cell in a suicide event, and researchers have shown that they can shut down phage infection of a colony or lawn of bacterial cells. That is, if a phage-infected cell can signal and activate its gasdermin proteins fast enough, it can commit suicide before the phage has time to fully replicate, beating the phage at its own race of infection and propagation.

Bacteria committing suicide for the good of the colony or larger group? That introduces the theme of group selection, since committing suicide certainly doesn't do the individual bacterium any good. It is only in a family group, clonal colony, or similar community that suicide for the sake of the (genetically related) group makes sense. We, as multicellular organisms, are way past that point. Our cells are fully devoted to the good of the organism, not themselves. But to see this kind of heroism among bacteria is, frankly, remarkable.

Bacteria have even turned around to attack the attacker. The Shigella bacteria mentioned above, which are directly killed by gasdermins, have evolved an enzymatic activity that tags gasdermin with ubiquitin, sending it to the cellular garbage disposal and saving themselves from destruction. It is an interesting validation of the importance of gasdermins and the arms race that is afoot, within our bodies.

A tortured ballot.
Great again? Corruption and degradation is our lot.
We may be in a (lesser) Jacksonian age. Populism, bad taste, big hair, and mass deportation.
Beautiful Jupiter.
Bill Mitchell on our Depression job guarantee: "So for every $1 outlaid the total societal benefits were around $6 over the lifetime of the participant."
US sanctions are scrambling our alliances and the financial system.
Solar works for everyone.

Saturday, October 26, 2024

A Hunt for Causes of Atherosclerosis

Using the most advanced tools of molecular biology to sift through the sands of the genome for a little gold.

Blood vessels have a hard life. Every time you put on shoes, the vessels in your feet get smashed and smooshed, for hours on end. And do they complain? Generally, not much. They bounce back and make do with the room you give them. All through the body, vessels are subject to the pumping of the heart, and variations in blood volume brought on by our salt balance. They have to move when we do, and deal with it whenever we sit or lie on them. Curiously, it is the veins in our legs and calves, that are least likely to be crushed in daily life, that accumulate valve problems and go varicose. Atherosclerosis is another, much more serious problem in larger vessels, also brought on by age and injury, where injury and inflammation of the lining endothelial cells can lead to thickening, lipid/cholesterol accumulation, necrosis, calcification, and then flow restriction and fragmentation risk.

Cross-section of a sclerotic blood vessel. LP stands for lipid pool, while the box shows necrotic and calcified bits of tissue.

The best-known risk factors for atherosclerosis are lipid-related, such as lack of liver re-capture of blood lipids, or lack of uptake around the body, keeping cholesterol and other lipid levels high in the blood. But genetic studies have found hundreds of areas of the genome with risk-conferring (or risk-reducing) variants, most of which are not related to lipid management. These genome-wide association studies (or GWAS) look for correlations between genetic markers and disease in large populations. So they pick up a lot of low-impact genetic variations that are difficult to study, due to their large number and low impact, which can often imply peripheral / indirect function. High-impact variations (mutations) tend to not survive in the population very long, but when found tend to be far more directly involved and informative.

A recent paper harnessed a variety of modern tools and methods to extract more from the poor information provided by GWAS. They come up with a fascinating tradeoff / link between atherosclerosis and cerebral cavernous malformation (CCM), which is distinct blood vessel syndrome that can also lead to rupture and death. The authors set up a program of analysis that was prodigious, and only possible with the latest tools.

The first step was to select a cell line that could model the endothelial cells at issue. Then they loaded these cells with custom expression-reducing RNA regulators against each one of the ~1600 genes found in the neighborhood of the mutations uncovered by the GWAS analyses above, plus 600 control genes. Then they sequenced all the RNA messages from these single cells, each of which had received one of these "knock-down" RNA regulators. This involved a couple hundred thousand cells and billions of sequencing reads- no simple task! The point was to gather comprehensive data on what other genes were being affected by the genetic lesion found in the GWAS population, and then to (algorithmically) assemble them into coherent functional groups and pathways which could both identify which genes were actually being affected by the original mutations, and also connect them to the problems resulting in atherosclerosis.

Not to be outdone, they went on to harness the AlphaFold program to hunt for interactions among the proteins participating in some of the pathways they resolved through this vast pipeline, to confirm that the connections they found make sense.

They came up with about fifty different regulated molecular programs (or pathways), of which thirteen were endothelial cell specific. Things like angiogenesis, wound healing, flow response, cell migration, and osmoregulation came up, and are naturally of great relevance. Five of these latter programs were particularly strongly connected to coronary artery disease risk, and mostly concerned endothelial-specific programs of cell adhesion. Which makes sense, as the lack of strong adhesion contributes to injury and invasion by macrophages and other detritus from the blood, and adhesion among the endothelial cells plays a central role in their ability / desire to recover from injury, adjust to outside circumstances, reshape the vessel they are in, etc.

Genes near GWAS variations and found as regulators of other endothelial-related genes are mapped into a known pathway (a) of molecular signaling. The color code of changed expression refers to the effect that the marked gene had on other genes within the five most heavily disease-linked programs/pathways. The numbers refer to those programs, (8=angiogenesis and osmoregulation, 48=cell adhesion, 35=focal adhesion, related to cell adhesion, 39=basement membrane, related to cell polarity and adhesion, 47=angiogenesis, or growth of blood vessels). At bottom (c) is a layout of 41 regulated genes within the five disease-related programs, and how they are regulated by knockdown of the indicated genes on the X axis. Lastly, in d, some of these target genes have known effects on atherosclerosis or vascular barrier syndromes when mutated. And this appears to generally correlate with the regulatory effects of the highlighted pathway genes.

"Two regulators of this (CCM) pathway, CCM2 and TLNRD1, are each linked to a CAD (coronary artery disease) risk variant, regulate other CAD risk genes and affect atheroprotective processes in endothelial cells. ... Specifically, we show that knockdown of TLNRD1 or CCM2 mimics the effects of atheroprotective laminar blood flow, and that the poorly characterized gene TLNRD1 is a newly identified regulator in the CCM pathway."

On the other hand, excessive adhesiveness and angiogenesis can be a problem as well, as revealed by the reverse correlation they found with CCM syndrome. The interesting thing was that the gene CCM2 came up as one of strongest regulators of the five core programs associated with atherosclerosis risk mutations. As can be guessed from its name, it can harbor mutations that lead to CCM. CCM is a relatively rare syndrome (at least compared with coronary artery disease) of localized patches of malformed vessels in the brain, which are prone to rupture, which can be lethal. CCM2 is part of a protein complex, with KRIT1 and PDCD10, and part of a known pathway from fluid flow sensing receptors to transcription regulators (TFs) that turn on genes relevant to the endothelial cells. As shown in the diagram above, this pathway is full of genes that came up in this pathway analysis, from the atherosclerosis GWAS mutations. Note that there is a repression effect in the diagram above (a) between the CCM complex and the MAP kinase cascade that sends signals downstream, accounting for the color reversal at this stage of the diagram.

Not only did they find that this known set of three CCM gene are implicated in the atherosclerosis mutation results, but one of the genes they dug up through their pipeline, TLNRD1, turned out to be a fourth, hitherto unknown, member of the CCM complex, shown via the AlphaFold program to dock very neatly with the others. It is loss of function mutations of genes encoding this complex, which inhibits the expression of endothelial cell pro-cell adhesion and pro-angiogenesis sets of genes, that cause CCM, unleashing these angiogenesis genes to do too much.

The logic of this pathway overall is that proper fluid flow at the cell surface, as expected in well-formed blood vessels, activates the pathway to the CCM complex, which then represses programs of new or corrective angiogenesis and cell adhesion- the tissue is OK as it is. Conversely, when turbulent flow is sensed, the CCM complex is turned down, and its target genes are turned up, activating repair, revision, and angiogenesis pathways that can presumably adjust the vessel shape to reduce turbulence, or simply strengthen it.

Under this model, malformations may occur during brain development when/where turbulent flow occurs, reducing CCM activation, which is abetted by mutations that help the CCM complex to fall apart, resulting (rarely) in run-away angiogenesis. The common variants dealt with in this paper, that decrease risk of cardiovascular disease / atherosclerosis, appear to have similar, but much weaker effects, promoting angiogenesis, including recovery from injury and adhesion between endothelial cells. In this way, they keep the endothelium tighter and more resistant to injury, invasion by macrophages, and all the downstream sequelae that result in atherosclerosis. Thus strong reduction of CCM gene function is dangerous in CCM syndrome, but more modest reductions are protective in atherosclerosis, setting up a sensitive evolutionary tradeoff that we are clearly still on the knife's edge of. I won't get into the nature of the causal mutations themselves, but they are likely to be diffuse and regulatory in the latter case.

Image of the CCM complex, which regulates response to blood flow, and whose mutations are relevant both to CCM and to atherosclerosis. The structures of TLNRD1 and the docking complex are provided by AlphaFold.

This method is particularly powerful by being unbiased in its downstream gene and pattern finding, because it samples every expressed gene in the cell and automatically creates related pathways from this expression data, given the perturbations (knockdown of expression) of single target genes. It does not depend on using existing curated pathways and literature that would make it difficult to find new components of pathways. (Though in this case the "programs" it found align pretty closely with known pathways.) On the other hand, while these authors claim that this method is widely applicable, it is extremely arduous and costly, as evidenced by the contribution of 27 authors at top-flight institutions, an unusually large number in this field. So, for diseases and GWAS data sets that are highly significant, with plenty of funding, this may be a viable method of deeper analysis. Otherwise, it is beyond the means of a regular lab.

A backgrounder on sedition, treason, and insurrection.
And why it matters.
Jan 6 was an attempted putsch.
Trumpies for Putin.
Solar is a no-brainer.
NDAs are blatantly illegal and immoral. One would think we would value truth over lies.

Saturday, September 28, 2024

Dangerous Memories

Some memory formation involves extracellular structures, DNA damage, and immune component activation / inflammation.

The physical nature of memories in the brain is under intensive scrutiny. The leading general theory is that of positive reinforcement, where neurons that are co-activated strengthen their connections, enhancing their ability to co-fire and thus to express the same pattern again in the future. The nature of these connections has been somewhat nebulous, assumed to just be the size and stability of their synaptic touch-points. But it turns out that there is a great deal more going on.

A recent paper started with a fishing expedition, looking at changes in gene expression in neurons at various time points after the mice were subjected to a fear learning regimen. They took this out to much longer time points (up to a month) than had been contemplated previously. At short times, a bunch of well-known signals and growth-oriented gene expression happened. At the longest time points, organization of a structure called the perineural net (PNN) was read out of the gene expression signals. This is a extracellular matrix sheath that appears to stabilize neuronal connections and play a role in long-term memory and learning.

But the real shocker came at the intermediate time point of about four days. Here, there was overexpression of TLR9, which is an immune system detector of broken / bacterial DNA, and inducer in turn of inflammatory responses. This led the authors down a long rabbit hole of investigating what kind of DNA fragmentation is activating this signal, how common this is, how influential it is for learning, and what the downstream pathways are. Apparently, neuronal excitation, particularly over-excitation that might be experienced under intense fear conditions, isn't just stressful in a semiotic sense, but is highly stressful to the participating neurons. There are signs of mitochondrial over-activity and oxidative stress, which lead to DNA breakage in the nucleus, and even nuclear perforation. It is a shocking situation for cells that need to survive for the lifetime of the animal. Granted, these are not germ cells that prioritize genomic stability above all else, but getting your DNA broken just for the purpose of signaling a stress response that feeds into memory formation? That is weird.

Some neuronal cell bodies after fear learning. The red dye is against a marker of DNA repair proteins, which form tight dots around broken DNA. The blue is a general DNA stain, and the green is against a component of the nuclear envelope, showing here that nuclear envelopes have broken in many of these cells.

The researchers found that there are classic signs of DNA breakage, which are what is turning on the TLR9 protein, such as seeing concentrated double-strand DNA repair complexes. All this stress also turned on proteases called caspases, though not the cell suicide program that these caspases typically initiate. Many of the DNA break and repair complexes were, thanks to nuclear perforation, located diffusely at the centrosome, not in the nucleus. TLR9 turns on an inflammatory response via NFKB / RELA. This is clearly a huge event for these cells, not sending them into suicide, but all the alarms short of that are going off.

The interesting part was when the researchers asked whether, by deleting the TLR9 or related genes in the pathway, they could affect learning. Yes, indeed- the fear memory was dependent on the expression of this gene in neurons, and on this cell stress pathway, which appears to be the precondition of setting up the perineural net structures and overall stabilization. Additionally, the DNA damage still happened, but was not properly recognized and repaired in the absence of TLR9, creating an even more dangerous situation for the affected neurons- of genomic instability amidst unrepaired DNA.

When TRL9 is knocked out, DNA repair is cancelled. At bottom are wild-type cells, and at top are mouse neurons after fear learning that have had the gene TLR9 deleted. The red dye is against DNA repair proteins, as is the blue dye in the right-most frames. The top row is devoid of these repair activities.

This paper and its antecedent literature are making the case that memory formation (at least under these somewhat traumatic conditions- whether this is true for all kinds of memory formation remains to be seen) has commandeered ancient, diverse, and quite dangerous forms of cell stress response. It is no picnic in the park with madeleines. It is an all-hands-on-deck disaster scene that puts the cell into a permanently altered trajectory, and carries a variety of long-term risks, such as cancer formation from all the DNA breakage and end-joining repair, which is not very accurate. They mention in passing that some drugs have been recently developed against TLR9, which are being used to dampen inflammatory activities in the brain. But this new work indicates that such drugs are likely double-edged swords, that could impair both learning and the long-term health of treated neurons and brains.

Cutting the line to the American Dream.

Saturday, August 24, 2024

Aging and Death

Our fate was sealed a very long time ago.

Why do we die? It seems like a cruel and wasteful way to run a biosphere, not to mention a human life. After we have accumulated a lifetime of experience and knowledge, we age, decline, and sign off, whether to go to our just reward, or into oblivion. What is the biological rationale and defense for all this, which the biblical writers assigned to the fairy tale of the snake and the apple?

A recent paper ("A unified framework for evolutionary genetic and physiological theories of aging") discusses evolutionary theories of aging, but in typical French fashion, is both turgid and uninteresting. Aging is widely recognized as the consequence of natural selection, or more precisely, the lack thereof after organisms have finished reproducing. Thus we are at our prime in early adulthood, when we seek mates and raise young. Evolutionarily, it is all downhill from there. In professional sports, athletes are generally over the hill at 30, retiring around 35. Natural selection is increasingly irrelevant after we have done the essential tasks of life- surviving to mate and reproduce. We may participate in our communities, and do useful things, but from an evolutionary perspective, genetic problems at this phase of life have much less impact on reproductive success than those that hit earlier.

All this is embodied in the "disposable soma" theory of aging, which is that our germ cells are the protected jewels of reproduction, while the rest of our bodies are, well, disposable, and thus experience all the indignities of age once their job of passing on the germ cells is done. The current authors try to push another "developmental" theory of aging, which posits that the tradeoffs between youth and age are not so much the resources or selective constraints focused on germ cell propagation vs the soma, but that developmental pathways are, by selection, optimized for the reproductive phase of life, and thus may be out of tune for later phases. Some pathways are over-functional, some under-functional for the aged body, and that imbalance is sadly uncorrected by evolution. Maybe I am not doing justice to these ideas, which maybe feed into therapeutic options against aging, but I find this distinction uncompelling, and won't discuss it further.

A series of unimpressive distinctions in the academic field studying aging from an evolutionary perspective.

Where did the soma arise? Single cell organisms are naturally unitary- the same cell that survives also mates and is the germ cell for the next generation. There are signs of aging in single cell organisms as well, however. In yeast, "mother" cells have a limited lifespan and ability to put out daughter buds. Even bacteria have "new" and "old" poles, the latter of which accumulate inclusion bodies of proteinaceous junk, which apparently doom the older cell to senescence and death. So all cells are faced with processes that fail over time, and the only sure bet is to start as a "fresh" cell, in some sense. Plants have taken a distinct path from animals, by having bodies and death, yes, but being able to generate germ cells from mature tissues instead of segregating them very early in development into stable and distinct gonads.

Multicellularity began innocently enough. Take slime molds, for example. They live as independent amoebae most of the time, but come together to put out spores, when they have used up the local food. They form a small slug-like body, which then grows a spore-bearing head. Some cells form the spores and get to reproduce, but most don't, being part of the body. The same thing happens with mushrooms, which leave a decaying mushroom body behind after releasing their spores.

We don't shed alot of tears for the mushrooms of the world, which represent the death-throes of their once-youthful mycelia. But that was the pattern set at the beginning- that bodies are cells differentiated from the germ cells, that provide some useful, competitive function, at the cost of being terminal, and not reproducing. Bodies are forms of both lost energy and material, and lost reproductive potential from all those extra cells. Who could have imagined that they would become so ornate as to totally overwhelm, in mass and complexity, the germ cells that are the point of the whole exercise? Who could have imagined that they would gain feelings, purposes, and memories, and rage against the fate that evolution had in store for them?

On a more mechanistic level, aging appears to arise from many defects. One is the accumulation of mutations, which in soma cells lead to defective proteins being made and defective regulation of cell processes. An extreme form is cancer, as is progeria. Bad proteins and other junk like odd chemicals and chemically modified cell components can accumulate, which is another cause of aging. Cataracts are one example, where the proteins in our lenses wear out from UV exposure. We have quite intricate trash disposal processes, but they can't keep with everything, as we have learned from the advent of modern chemistry and its many toxins. Another cause is more programmatic: senescent cells, which are aged-out and have the virtue that they are blocked from dividing, but have the defect that they put out harmful signals to the immune system that promote inflammation, another general cause of aging.

Aging research has not found a single magic bullet, which makes sense from the evolutionary theory behind it. A few things may be fixable, but mostly the breakdowns were never meant to be remedied or fixed, nor can they be. In fact, our germ cells are not completely immune from aging either, as we learn from older fathers whose children have higher rates of autism. We as somatic bodies are as disposable as any form of packaging, getting those germ cells through a complicated, competitive world, and on to their destination.

Climate policy as foreign policy.
The future of home and grid energy.
Sometimes AI is not so impressive.
Are Christians better than other Americans?

Sunday, August 11, 2024

Modeling Cell Division

Is molecular biology ready to use modeling to inform experimental work?

The cell cycle is a holy grail of biology. The first mutants that dissected some of its regulatory apparatus, the CDC mutants of Saccharomyces cerevisiae (yeast), electrified the field and led to a Nobel prize. These were temperature sensitive mutants, making only small changes to the protein sequence that rendered that protein inactive at high temperature (thus inducing a cell cycle arrest phenotype), while allowing wild-type growth at normal temperatures. In the fifty years since, a great deal of the circuitry has been worked out, with the result that it is now possible, as a recent paper describes, to make a detailed mathematical model of the process that claims to be useful in the sense of explaining existing findings in a unified model and making predictions of places to look for additional actors.

At the center of this regulatory scheme are transcription activators, SBF/MBF, that are partly controlled by, and in turn control the synthesis of, a series of cyclins. Cyclins are proteins that were observed (another Nobel prize) to have striking variations in abundance during the cell cycle. There are characteristic cyclins for each phase of the cell cycle, which goes from G1, a resting phase, to S, which is DNA replication, to G2, a second resting phase, and then M, which is mitosis, which brings us back to G1. Cyclins work by binding to a central protein kinase, Cdc28, which, as regulated by each distinct cyclin, phosphorylates and thus regulates distinct sets of target proteins. The key decision a cell has to make is whether to commit to DNA replication, i.e. S phase. No cell wants to run out of energy during this process, so its size and metabolic state needs to be carefully monitored. That is done by Cyclin 3 (Cln3), Whi5, and Bck2, which each influence whether the SBF/MBF regulators are active.

Some highly simplified elements of the yeast cell cycle. Cyclins (Cln and Clb) are regulators of a central protein kinase, Cdc28, that direct it to regulate appropriate targets at each stage of the cell cycle. Cyclins themselves are regulated by transcriptional control (here, the activators SBF and MBF), and then destroyed at appropriate times by proteolysis, rendering them abundant only at specific times during the cell cycle. Focusing on the "START" process that starts the process from rest (G1 phase) to new bud formation and DNA replication (S phase), Cln3 and Bck2 respond to upstream nutritional and size cues, and each activate the SBF/MBF transcription activator.

As outlined in the figure above, Cyclin 3 is the G1 cyclin, which, in complex with Cdc28 phosphorylates Whi5, turning it off. Whi5 is an inhibitor that binds to SBF/MBF, so the Cyclin 3 activation turns these regulators on, and thus starts off the cell cycle under the proper conditions. Incidentally, the mammalian version of Whi5, Rb (for retinoblastoma), is a notorious oncogene, that, when mutated, releases cells from regulatory control over cell division. SBF and MBF bind to genes for the next series of cyclins, Cln1, Cln2, Clb5, Clb6. The first two are further G1 cyclins that orchestrate the end of G1. They induce phosphorylation and inactivation of Sic1 and Cdc6, which are inhibitors of Clb5 and Clb6. These latter two are then the initiators of S phase and DNA replication. Meanwhile, Cln3 stays around till M phase, but is then degraded in definitive fashion by the proteases that end M phase. Starvation conditions lead to rapid degradation of Cln3 at all times, and thus to no chance of starting a new cell cycle.

Charts of the abundance of some cyclins through the cell cycle. Each one has its time to shine, after which it is ubiquitinated and sent off to the recycling center / proteasome.

Bck2 is another activator of SBF/MBF that is unrelated to the Cln3/Whi5 system, but also integrates cell size and metabolic status information. Null mutants of Cln3 (or Bck2) are viable, if altered in cell cycle, while double null mutants of Cln3 and Bck2 are dead, indicating that these regulators are each important, in a complementary way, in cell cycle control. Given that little is known about Bck2, the modelers in this paper assume various properties and hope for the best down the line, predicting that cell size (at the key transition to S phase) is more affected in the Cln3 null mutant than in the Bck2 null mutant, since in the former, excess active Whi5 soaks up most of the available SBF/MBF, and requiring extra-high and active levels of Bck2 to overcome this barrier and activate the G1 cyclins and other genes.

The modelers are working from the accumulated, mostly genetic data, and in turn validate their models against the same genetic data, plus a few extra mutants they or others have made. The models are mathematical representations of how each node (i.e protein, or gene) in the system responds to the others, but since there are a multitude of unknowns, (such as what really regulates Bck2 from upstream, to cite just one example), the system is not really able to make predictions, but rather fine-tunes/reconciles what knowledge there is, and, at best, points to gaps in knowledge. It is a bit like AI, which magically recombines and regurgitates material from a vast corpus based on piece-wise cues, but is not going to find new data, other than through its notorious hallucinations.

For example, a new paper came out after this modeling, which finds that Cln3 affects Cln2 abundance by mechanisms quite apart from its SBF/MBF transcriptional control, and that it regulates cell size in large part at M phase, not through its G1/S gating. All this comes from new experimental work, unanticipated by the modeling. So, in the end, experimental work always trumps modeling, which is a bit different than how things are in, say, physics, where sometimes the modeling can be so strong that it predicts new particles, forces, and other phenomena, to be validated later experimentally. Biology may have its master predictive model in the theory of evolution, but genetics and molecular biology remain much more of an empirical slog through the resulting glorious mess.

Bitcoin isn't a currency, but rather just another asset class, one without any fundamental or socially positive value. A little like gold, actually, except without gold's resilience against social / technological disruption.
The disastrous post-Soviet economic transition, on our advice.
The enormous labor drain, and resource drain, from global South to North.

Saturday, June 15, 2024

The Quest for the Perfect Message, in E. coli

Translation efficiency has some weird rules, and a tortured history.

One would think we know everything there is to know about the workhorse of bacterial molecular biology, Escherichia coli. And that would be especially true for its technological applications, like the expression of engineered genes, which is at the very heart of molecular biology and much of biotechnology. Getting genes you put into E. coli expressed at high levels is critical for making drugs, and for making enough for structural and biochemical studies. For decades, the wisdom of the field was to design introduced genes using the codon adaptation index (CAI). This is a measurement of the three-letter codes (codons of the genetic code) that are used in highly expressed genes. They tend to correspond to tRNAs that are more abundant in the cell. So, for example, the amino acid leucine is encoded by six different codons, any of which can be chosen at intended leucine positions in the intended protein. In E. coli, CTG is over ten times more frequently used than CTA, however. Thus, even though they code for the same amino acid, one is more common, perhaps because its cognate tRNA is more common and more easily used during translation. This is basically a diffusion-based argument, that translation will be easier if the tRNA that carries the next amino acid is easier to find.

A recent paper provides a remarkable review of this field. For one thing, it turns out that use of the CAI has virtually no effect on translation efficiency. Whether using rare or common codons, translation is equally efficient for introduced genes. Needless to say, this is quite surprising. It seems as though the role of common vs uncommon tRNAs/codons is more to manage the health of the cell by relieving bottlenecks to translation in a global sense and managing the free pool of ribosomes, rather than regulating the efficiency of translation of any particular mRNA message. tRNAs are highly abundant generally, so there are significant savings possible by managing their levels judiciously, and reducing investment in some versus others.

So what does affect the efficiency of translation? Some messages are better translated than others, after all. The authors point to a completely different mechanism, which is the melting stability of the first ten codons of the mRNA message. RNA can form hairpin and other secondary structures / shapes, and this can apparently strongly affect the ability of ribosomes to find initiation sites. While eukaryotic ribosomes scan in from the 5 prime cap of the mRNA, bacterial ribosomes bind directly to a sequence slightly upstream of the initiating AUG codon. And this can be inhibited by mRNAs that are not neatly ironed out, but knotted up in hairpins and loops.

Ratio of occurrence of nucleosides in the third codon position of the first ten codons of high versus low expressing genes in E. coli. This was not run on native E. coli genes, but on a large panel of transgenes engineered from outside. The strong bias towards A at this position in high expressing genes shows a preference for initiating sequences to have weak secondary structure, allowing better ribosome access.

Use of A-rich sequences around the ribosomal initiation sites and the first ten codons, then, dramatically increases the translation efficiency, (via the initiation efficiency) of introduced genes, and provide a much more robust method to control their expression. But then the authors make another observation, which is that the bacteria themselves do not seem to use this mechanism for their own genes. In a massive analysis of data from other labs, (below), there is actually a negative correlation between the quality of the initiation region (X- axis) and the abundance of the respective protein (Y- axis). Again, quite a surprising result, which the authors can only speculate about.

There is negative correlation between the initiation codon quality (X- axis), as shown above, and the native E. coli gene expression level (Y- axis). So these cells are not optimizing their translation at all in accordance with the findings above.

The picture that they paint is that highly expressed genes in E. coli benefit from consistent, smooth translation. This depends less on maximal initiation speed than on the holistic picture of translation. The CAI optimal codons (called translationally optimal in this paper, or TO) tend to be poor at initiation, but have good codon-anticodon pairing and thus low A content. So there are conflicting pressures at work, in basic chemical terms, where different codons are intrinsically good for initiation, and complementary ones for elongation. The obvious solution is to use the initiation-optimal codons for the first ten codons, and translationally optimal codons the rest of the way. But that is not what is found either. The authors claim that, for native proteins, lower levels of initiation are actually beneficial for smoother protein production with less noise from time to time and cell to cell.

Additionally, lower initiation rates preserve free ribosome levels globally, another important goal for the cell, via evolutionary selection. The authors find, for instance, a correlation between low variability of initiation (low noise) and low initiation rate. This is a bit mystifying, since ribosomes should always be present in excess, and it is not immediately apparent why holdups to translation initiation would lend themselves to more even initiation. Perhaps the search process by which ribosomes find free mRNAs is inefficient, so that those with slower initiation sequences have a constant backlog of incoming, bound and poised ribosomes, while after they get past the initiation region, those ribosomes progress rapidly and rejoin the free pool. That would be one way of setting up a smooth production process, suitable for essential protein products, that is relatively insensitive to the free ribosome concentration and other variations in the cell.

Technologists trying to express some drug-associated protein in bacteria don't care about smoothness and noise, but just want to maximize production while not killing the cell (or before killing the cell). So all these subtle considerations that go into the evolution of the native gene complement of E. coli and its high or low expression levels don't apply. But for researchers trying to predict the expression level of a given natural gene, it is maddening, since it seems currently impossible to predict the expression level (via translation) of a gene from its sequence. It is one more case where modeling of what is going on inside cells is surprisingly difficult, even for a system we had thought we understood, in one of the simplest and most well-studied bacteria. As researchers never tire of saying ... more research is needed.

War suits Russia very well.
The same old Microsoft.

Saturday, June 8, 2024

A Membrane Transistor

Voltage sensitive domains can make switches out of ion channels, antiporters, and other enzymes.

The heart of modern electronics is the transistor. It is a valve or switch, using a small electrical signal to control the flow of other electrical signals. We have learned that the simple logic this mechanism enables can be elaborated into hugely complex, even putatively intelligent, computers, databases, applications, and other paraphernalia of modernity. The same mechanism has a very long history in biology, quite apart from its use in neurons and brains, since membranes are typically charged, well-poised to be sensitive to changes in charge for all sorts of signaling.

The voltage sensitive domain (VSD) in proteins is an ancient (going back to archaea) bundle of four alpha helices that were first found attached to voltage-sensitive ion channels, including sodium, potassium, and calcium channels. But later it became fascinatingly apparent that it can control other protein activities as well. A recent paper discussed the mechanism and structure of a sodium/hydrogen antiporter with a role in sperm navigation, which uses a VSD to control its signaling. But there are also voltage-sensitive phosphatases, and other kinds of effectors hooked up to VSD domains.

Schematic of a basic VSD, with helix 4 in pink, moving against the other three helices colored teal. Imagine a membrane going horizontally over these embedded proteins. When voltage across the local membrane changes, (hyperpolarized or de-polarized), helix 4 can plunge by one helical repeat unit in either direction, up or down.

One of the helixes (#4) in the VSD bundle has positive charges, while the others have specifically positioned negative charges. This creates a structure where changes in the ambient voltage across the membrane it sits in can cause helix #4 to plunge down by one or two steps (that is, turns of the alpha helix) versus its partners. This movement can then be propagated out along extensions of helix #4 to other domains of the protein in order to switch on or off their activities.

The helices of numerous proteins that have a VSD domain (in red) are drawn out, showing the diversity of how this domain is used.

While the studied protein, SLC9C1, is essential in mammalian sperm for motility, the paper studied its workings in sea urchin sperm, a common model system. The logic (as illustrated below) is that (female) chemoattractants bind to receptors on the sperm surface. These receptors generate cyclic GMP, which turns on potassium channels that increase the voltage across the membrane. This broadcasts the signal locally, and is received by the SLC9C1 transporter, which does two things. It activates a linked soluble adenylate cyclase enzyme, making the further signaling molecule cAMP. And it also activates the transporter itself, pumping protons out (in return 1:1 for sodium ions in) and causing cytoplasmic alkalinization. The cAMP activates sodium ion channels to cancel the high membrane voltage (a fast process), and the alkalinization activates calcium channels that direct the sperm directional swimming responses- the ultimate response. The latter is relatively slow, so the whole cascade has timing characteristics that allow the signal to be dampened, but the response to persist a bit longer as the sperm moves through a variable and stochastic gradient.

A schematic of the logic of this pathway, and of the SLC9C1 anti-porter. At top, the transport mechanism is crudely illustrated as a rocking motion that ensures that only one H+ is exchanged for one Na+ for each cycle of transport. The transport is driven thermodynamically by the higher concentration of Na+ outside.

But these researchers weren't interested in what the sperm were thinking, but rather how this widely used protein domain became hitched to this unusual protein and how it works there, turning on a sodium/hydrogen antiporter rather than the usual ion channel. They estimate that the #4 helix of the VSD moves by 10 angstroms, or 1 nm, upon voltage activation, which is a substantial movement, roughly equivalent to the width of these helices. In their final model, this movement significantly reshapes the intracellular domain of the transporter, which in turn releases its hold on the transporter's throat, allowing it to move cyclically as it needs to exchange hydrogen ions for sodium ions. This protein is known to bind and activate an adenylyl cyclase, which produces cAMP, which is one key next actor in the signaling cascade. This activation may be physically direct, or it may be through the local change in pH- that part is as yet unknown. cAMP also, incidentally, binds to and turns up the activity of this transporter, providing a bit of positive feedback.

Model of the SLC9C1 protein, with the VSD in teal and a predicted activation mechanism illustrated (only the third panel is activated/open). Upon voltage activation, the very long helix 4 dips down and changes orientation, dramatically opening the intracellular portion of the transporter (purple and orange portion). This in turn lets go of the bottom of the actual transporter portion of the protein (gray), allowing alkalinization of the cytoplasm to go forth. At the bottom sides, in brown, is the cAMP binding domain, which lowers the voltage threshold for activation.

There are a variety of interesting lessons from this work. One is that useful protein domains like VSD are often duplicated and propagated to unexpected places to regulate new processes. Another is that the new cryo-electron microscopy methods have made structural biology like this far easier and more common than it used to be, especially for membrane proteins, which are exceedingly difficult to crystalize. A third is that signaling systems in biology are shockingly complex. One would think that getting sperm cells to where they are going would take a bare minimum of complexity, yet we are studying a five or more part cascade involving two cyclic nucleotides, four ions, intricate proteins to manage them all, and who knows what else into the mix. It is difficult to account for all this, other than to say that when you have a few billion years to tinker with things, and have eons of desperate races to the egg for selective pressure, they tend to get more ornate. And a fourth is that it is regulatory switches all the way down.

A tale of two empires.
Epicureanism
UCP1 and the evolution of warm-bloodedness.
Can your amoeba do this?
Bonkers discussion of consciousness and panpsychism.