What is it about?
Decoding Life's Secret Language: Why Do Species Use DNA 'Words' Differently? Every living thing uses the same genetic code to translate DNA into proteins; the genetic code is the universal dictionary of life. This dictionary uses three-letter "words" called codons. Even though the dictionary is the same, different species have favorite ways of spelling these three-letter words. For example, a single amino acid might be spelled two or three different ways, and one species might strongly prefer one spelling over the others. Our paper investigates why these preferences exist across the Tree of Life (from Archaea and Bacteria to Eukaryotes). We found that the first two letters of the codon carry almost all the crucial information needed to decide which amino acid is made. The third letter is mostly just for flexibility and fine-tuning. To prove this, we used Information Theory (a mathematical tool to measure data and uncertainty) on the genetic code of over 1,400 species. This allowed us to quantify exactly how much information is stored in each position of the codon. We confirmed that the third position has the highest informational variability (it changes the most without changing the resulting amino acid). When we looked at different domains of life, we saw highly distinct coding strategies: Bacteria and Eukaryotes tend to stick to more constrained, standard spelling rules. Archaea (a domain of single-celled organisms) showed unique and surprising levels of variability. This suggests their extreme environments and life histories have profoundly changed how they process genetic information. Crucially, we showed that these informational measures reveal relationships and patterns among species that traditional measures (like GC-content) completely miss. Our work provides a brand new mathematical framework for understanding the "grammar" of genetic information, offering a complementary way to study evolution alongside traditional phylogenetic analysis.
Featured Image
Photo by Logan Voss on Unsplash
Why is it important?
1. New Tools for Understanding Evolution For decades, scientists have mainly relied on tracking the overall chemistry of DNA (like GC-content) to study how genes evolve. Our work provides a powerful new mathematical lens (using Information Theory) that measures the meaning and structure of genetic messages, not just the raw chemical composition. This new framework gives researchers better tools to: * Spot unique evolutionary adaptations in species that are easily missed by traditional methods. * Track genetic change with more precision across all domains of life. 2. Revealing the "Grammar" of Life Our findings directly clarify a fundamental mystery of the genetic code: the hierarchical roles of the three codon bases. We confirm that genetic information is hierarchically organized, with the first and second bases serving as the core "subject" and "verb," and the third base acting as a flexible "modifier." Understanding this intrinsic structure is crucial for: * Engineering better synthetic biology —designing stable, functional genes from scratch. * Interpreting genetic diseases —since subtle changes in codon usage can affect protein production and folding. 3. Understanding Life's Diversity By showing that different domains of life (especially Archaea) use significantly varied coding strategies, our paper highlights how extreme life histories and unique ecological niches actively shape the way organisms encode information. This demonstrates that the seemingly universal genetic code is actually flexible and continues to be fine-tuned by evolutionary pressure, deepening our understanding of the vast diversity and adaptability of life on Earth.
Perspectives
We invite researchers globally to adopt this informational framework to study codon usage. While our study sampled 1,434 species, the vast diversity of genetic strategies remains largely unexplored. Applying these entropy measures to new data sets will be crucial for fully unraveling the intricate grammars of genetic information across the entire tree of life.
Octavio Martínez
Cinvestav
Read the Original
This page is a summary of: Sampling informational properties of codon usage through the tree of life, PLOS One, November 2025, PLOS,
DOI: 10.1371/journal.pone.0335824.
You can read the full text:
Resources
Contributors
The following have contributed to this page







