Discussion about this post

User's avatar
Michael Frank Martin's avatar

The simulation of parallel information acquisition maps perfectly onto the mathematics of transformer attention. An Iranian mathematical biologist, Shahshahani, demonstrated in 1979 that evolutionary models are actually gradient flows optimizing the Fisher information metric.

I explored this isomorphism in a short essay here https://www.symmetrybroken.com/asymmetric-evolution/. There is a connection between these mathematics and the mathematics of transformers too.

In this framework, asexual "clonal interference" is a triviality failure mode, which means the system’s unique attractor collapsing into a single uniform state. Sexual reproduction avoids this by maintaining metastable multi-cluster states, preserving the structural divergence necessary to accumulate "bits" of certainty without collapsing the search space.

Luke Lea's avatar

Possibly irrelevant to the point you are trying to make (I'm not sure), but the phenomenon of "crossing over" is maybe an even more important advantage of sexual reproduction: shuffling the genes on a particular chromosome between parents results in a situation in which every gene is evolving independently, as it were, instead of being stuck as a part of a fixed ensemble. At lease this is a conclusion I came to when I thought about it many years ago.

4 more comments...

No posts

Ready for more?