Simulating 500 million years of evolution with a language model
Protein molecule (screenshot for Evolutionary Scale announcement video)

Simulating 500 million years of evolution with a language model

"Biology is the most advanced technology that has ever been created, far beyond anything that people have engineered. The ribosome is programmable—it takes the codes of proteins in the form of RNA and builds them up from scratch"

EvolutionaryScale - a company founded by ex members of Metas Fundamental AI Research lab, have just announced ESM3 - the first generative model for biology that simultaneously reasons over the sequence, structure, and function of proteins.

ESM3 is a game-changer. It allows scientists to not just better understand proteins, but to create new ones.

The model was trained with an astonishing 1 trillion teraflops, outpacing any other biological model out there. The dataset? A massive 2.78 billion proteins from all corners of the Earth’s natural diversity.

What makes ESM3 truly revolutionary is that it’s the first generative model in biology that can handle the sequence, structure, and function of proteins all at once. This opens up a whole new frontier for scientific innovation.

How does it work?

ESM3 has a straightforward goal. For each protein, it looks at its sequence, structure, and function. These parts are broken down and partially hidden. ESM3’s job is to guess the hidden parts, much like language models guess missing words. To do this, ESM3 must deeply understand how sequence, structure, and function are connected.

By working with billions of proteins and parameters, ESM3 learns to mimic evolution.

Once trained, ESM3 can generate new proteins based on prompts. Scientists can guide ESM3 to create proteins for various uses, like medicine, research, and clean energy.

ESM3’s ability to understand and combine sequence, structure and function allows scientists to create new proteins with great control.

Say Hi To A New Green Florescent Protein

In their scientific preprint, Evolutionary Scale announce they have synthesised a new Green Florescent Protein (GFP) with significantly improved brightness (am I alone in wanting protein based lighting for my home?) which they've dubbed esmGFP.

The power of the generative model is illustrated by the fact that this new protein is far removed from other GFPs occurring in nature, and that the emergence of new GFPs takes a very long time in nature. To cite their press release :

"The process of evolution that gives rise to new fluorescent proteins takes epochs of time—the story of this protein family reaches back into depths of natural history and geologic time where somewhere in the distant past nature invented the first fluorescent protein. Natural fluorescent proteins have diverged over 100s of millions of years from ancestral sequences in deep history to become the proteins they are today."

ESM3 is truly simulating millions of years of natural evolution!

Rendering of esmGFP - a brand new Green Florescent Protein, courtesy of ESM3.

Why does this matter?

In the 1970s, molecular biology changed dramatically with the start of the recombinant DNA era. Scientists invented genetic engineering then. This led to a revolution in our understanding of genetics, decoding the human genome, and creating groundbreaking new medicines.

Today, making biology programmable and exploring the possible sequences, structures, and functions of molecules signals the start of a similar revolution. This will lead to numerous medical advances and significant scientific progress.


That's it for now, I'm all out of coffee! Talk soon ?









Yuji Satoh

Co-Founder & COO at Impala Hub

8 个月

Very interesting!

?? Yoann Fol ??

Committed to People happiness and our Planet regeneration ?? | Nature & People lover | Data Scientist | Entrepreneur | GloCal Citizen | Get in touch now if you aim to build a better world for all

8 个月

Thanks for the detailed analysis of this amazing news! You rock!

要查看或添加评论,请登录

Niko Groeneweg的更多文章

  • Crazy multi-Billion dollar valuations in AI

    Crazy multi-Billion dollar valuations in AI

    It's time we talked about valuation, and what AI is really worth to investors. Maybe you've seen some the recent…

    6 条评论
  • We've drastically underestimated the impact of AI on the labor market.

    We've drastically underestimated the impact of AI on the labor market.

    Many people claim that the rise of AI, particularly large language models (LLMs) like GPTs, will revolutionise the job…

    5 条评论
  • Whom To Trust In The Realm Of AI?

    Whom To Trust In The Realm Of AI?

    I'm choosing not to delve into the recent turmoil and controversies involving OpenAI, as many people have already…

  • Should we talk about the hard problems?

    Should we talk about the hard problems?

    After 12 months of riding the A.I.

    4 条评论
  • AI Frenzy: Tips for Decoding the Debate

    AI Frenzy: Tips for Decoding the Debate

    For the last month or so, people have been screaming over each other about AI ethics, AI alignment and whether or not…

    1 条评论
  • Nick Cave slams A.I. in angry letter

    Nick Cave slams A.I. in angry letter

    I never thought I'd write this sentence, but here is no two ways about it : Nick Cave hates generative A.I.

    15 条评论
  • Something dark is brewing in A.I.

    Something dark is brewing in A.I.

    I've hesitated to write this post, but I think it's important that I speak up. Something dark is brewing.

    15 条评论
  • Do you use AI? You might get sued!

    Do you use AI? You might get sued!

    Whose data is it, anyway? AI has a massive copyright issue. Generative models - models that can be used to generate…

    3 条评论
  • Optimus Cringe ?

    Optimus Cringe ?

    This week, Elon Musk unveiled the first version of Tesla’s humanoid robot. Although it was received with mixed…

    1 条评论

社区洞察

其他会员也浏览了