May 8, 2025

Answering this question may seem straightforward, but actually requires an odyssey through information theory and molecular biology.

9 Comments

Chris Adami

May 8, 2025

If only someone had written a book about that subject. We wouldn't have to guess so much. A book like this one perhaps. https://press.princeton.edu/books/paperback/9780691241142/the-evolution-of-biological-information

saar

May 9, 2025

I wonder if you could get an estimate of your "phenotypic Kolmogorov complexity" from gene essentiality or better yet one of those reductionist synthetic biology attempts to make a minimal cell.

Though you run into the problem of defining phenotypically identical. Proving that a genetic element is *never* useful seems impossible : maybe you simply haven't tested the condition where it is used. e.g. lab strains of S. Cerevicae commonly have genes required for sporulation knocked out, but you'd never conclude they were necessary (or "informative") unless you deprived them of nutrients : then you'd see that the wild-type made spores, and the mutants did not. As the old saw goes : "The knockout mouse has no phenotype", "Well, did you take it to the opera?" (implication is that the phenotype may only be observable under opera conditions). So maybe the information figure would be contingent on a "reasonable" phenotyping panel?

Also a minor nitpick: there is X-Y crossing-over on the PARs, I believe.

Patrick Martin

May 8, 2025

Pretty informative! This reminds me of a paper entitled : "The Genomic Code: The genome instantiates a generative model of the organism".

They argue that the genome is a latent representation of an organisms. Conceptually similar to this compressed view of DNA. Interestingly, some stretch of "useless" DNA could be there for structural regulation (TADs, LADs, etc).

Tom

May 8, 2025

I think some simple organisms have much more DNA than humans. Any idea why that is and how that fits with your information estimates?

Reply (1)

dynomight

May 8, 2025

If you take (say) lungfish DNA, they have much more repetitive elements / jumping genes than humans. The exact cause of this seems to be unclear. But the impact in terms of information is that while they have more "storage space", I doubt they actually have more "information". That is, I speculate that you could theoretically engineer DNA to create a lungfish-like organism with vastly smaller DNA.

Jonathan McMenamin-Balano

May 8, 2025

Nice, tightly compressed bit of information here.

Comment deleted

May 8, 2025

Comment deleted

dynomight

May 8, 2025

I believe it's substantial!

"DNA is a “blueprint” for a cell. But information is needed to interpret that blueprint. Imagine a machine that could take in a DNA sequence and build a human cell. How many bits would be needed to describe that machine? A lot, right?

Of course, there’s a recursive “chicken and the egg” issue here: The machines that actually make human cells from DNA are… other human cells. But you need some information to get the loop started!"

(Although I have no idea how to quantify it.)

https://dynomight.net/data-wall/#its-not-just-dna

Comment deleted

Comment deleted

I certainly agree that if you're thinking about an adult human, information comes from everywhere. But I think it's reasonable to think about how much of that comes from (A) DNA in the gametes, (B) the physical structure of the gametes, and (C) everything else. So I guess the "phenotypic Kolmogorov complexity" would only be looking at category A, category B seems hard, and category C seems REALLY hard?

Thomas Marks

May 9, 2025

I'm not sure the zygote even has everything it needs. There's a lot of interaction between the developing embryo and the mother that can influence the final phenotype. Another large well of information is the microbiome, much of which is transferred from the mother but has many different species with their own DNA. Given how the microbiome can influence overall health and cognition, I think it's necessary to at least consider it. It does seem that it is somewhat fungible as to the specific composition of the microbiome, so that info may be quite compressible.

Asimov Press

How Much Information is in DNA?