Machine learning cracked the protein-folding problem and won the 2024 Nobel Prize in chemistry

The 2024 Nobel Prize in chemistry recognised Demis Hassabis, John Jumper and David Baker for using machine learning to tackle one of biology’s biggest challenges: predicting the 3D shape of proteins and designing them from scratch.

This year’s award stood out because it honoured research that originated at a tech company: DeepMind, an AI research startup that was acquired by Google in 2014. Most previous chemistry Nobel Prizes have gone to researchers in academia. Many laureates went on to form startup companies to further expand and commercialise their groundbreaking work – for instance, CRISPR gene-editing technology and quantum dots – but the research, from start to end, wasn’t done in the commercial sphere.

Although the Nobel Prizes in physics and chemistry are awarded separately, there is a fascinating connection between the winning research in those fields in 2024. The physics award went to two computer scientists who laid the foundations for machine learning, while the chemistry laureates were rewarded for their use of machine learning to tackle one of biology’s biggest mysteries: how proteins fold.

The 2024 Nobel Prizes underscore both the importance of this kind of artificial intelligence and how science today often crosses traditional boundaries, blending different fields to achieve groundbreaking results.

The challenge of protein folding

Proteins are the molecular machines of life. They make up a significant portion of our bodies, including muscles, enzymes, hormones, blood, hair, and cartilage.

Proteins are chains of amino acid molecules that form a 3D shape based on their atoms’ interactions. ©Johan Jarnestad/The Royal Swedish Academy of Sciences

Understanding proteins’ structures is essential because their shapes determine their functions. Back in 1972, Christian Anfinsen won the Nobel Prize in chemistry for showing that the sequence of a protein’s amino acid building blocks dictates the protein’s shape, which, in turn, influences its function. If a protein folds incorrectly, it may not work properly and could lead to diseases such as Alzheimer’s, cystic fibrosis or diabetes.

A protein’s overall shape depends on the tiny interactions, the attractions and repulsions, between all the atoms in the amino acids its made of. Some want to be together, some don’t. The protein twists and folds itself into a final shape based on many thousands of these chemical interactions.

For decades, one of biology’s greatest challenges was predicting a protein’s shape based solely on its amino acid sequence. Although researchers can now predict the shape, we still don’t understand how the proteins manoeuvre into their specific shapes and minimise the repulsions of all the interatomic interactions in a few microseconds.

To understand how proteins work and to prevent misfolding, scientists needed a way to predict the way proteins fold, but solving this puzzle was no easy task.

In 2003, University of Washington biochemist David Baker wrote Rosetta, a computer program for designing proteins. With it he showed it was possible to reverse the protein-folding problem by designing a protein shape and then predicting the amino acid sequence needed to create it.

It was a phenomenal jump forward, but the shape chosen for the calculation was simple, and the calculations were complex. A major paradigm shift was required to routinely design novel proteins with desired structures.

A new era of machine learning

Machine learning is a type of AI where computers learn to solve problems by analysing vast amounts of data. It’s been used in various fields, from game-playing and speech recognition to autonomous vehicles and scientific research. The idea behind machine learning is to use hidden patterns in data to answer complex questions.

This approach made a huge leap in 2010 when Demis Hassabis co-founded DeepMind, a company aiming to combine neuroscience with AI to solve real-world problems.

Hassabis, a chess prodigy at age 4, quickly made headlines with AlphaZero, an AI that taught itself to play chess at a superhuman level. In 2017, AlphaZero thoroughly beat the world’s top computer chess program, Stockfish-8. The AI’s ability to learn from its own gameplay, rather than relying on preprogrammed strategies, marked a turning point in the AI world.

Soon after, DeepMind applied similar techniques to Go, an ancient board game known for its immense complexity. In 2016, its AI program AlphaGo defeated one of the world’s top players, Lee Sedol, in a widely watched match that stunned millions.

In 2016, Hassabis shifted DeepMind’s focus to a new challenge: the protein-folding problem. Under the leadership of John Jumper, a chemist with a background in protein science, the AlphaFold project began. The team used a large database of experimentally determined protein structures to train the AI, which allowed it to learn the principles of protein folding. The result was AlphaFold2, an AI that could predict the 3D structure of proteins from their amino acid sequences with remarkable accuracy.

This was a significant scientific breakthrough. AlphaFold has since predicted the structures of over 200 million proteins – essentially all the proteins that scientists have sequenced to date. This massive database of protein structures is now freely available, accelerating research in biology, medicine and drug development.

Designer proteins to fight disease

Understanding how proteins fold and function is crucial for designing new drugs. Enzymes, a type of protein, act as catalysts in biochemical reactions and can speed up or regulate these processes. To treat diseases such as cancer or diabetes, researchers often target specific enzymes involved in disease pathways. By predicting the shape of a protein, scientists can figure out where small molecules – potential drug candidates – might bind to it, which is the first step in designing new medicines.

In 2024, DeepMind launched AlphaFold3, an upgraded version of the AlphaFold program that not only predicts protein shapes but also identifies potential binding sites for small molecules. This advance makes it easier for researchers to design drugs that precisely target the right proteins.

Google bought Deepmind for reportedly around half a billion dollars in 2014. Google DeepMind has now started a new venture, Isomorphic Labs, to collaborate with pharmaceutical companies on real-world drug development using these AlphaFold3 predictions.

For his part, David Baker has continued to make significant contributions to protein science. His team at the University of Washington developed an AI-based method called “family-wide hallucination,” which they used to design entirely new proteins from scratch. Hallucinations are new patterns – in this case, proteins – that are plausible, meaning they are a good fit with patterns in the AI’s training data. These new proteins included a light-emitting enzyme, demonstrating that machine learning can help create novel synthetic proteins. These AI tools offer new ways to design functional enzymes and other proteins that never could have evolved naturally.

AI will enable research’s next chapter

The Nobel-worthy achievements of Hassabis, Jumper and Baker show that machine learning isn’t just a tool for computer scientists – it’s now an essential part of the future of biology and medicine.

By tackling one of the toughest problems in biology, the winners of the 2024 prize have opened up new possibilities in drug discovery, personalised medicine and even our understanding of the chemistry of life itself.

This article was authored by Marc Zimmer, Professor of Chemistry at Conneticut University. It is republished from The Conversation under a Creative Commons license. Read the original article.

Cookie	Duration	Description
_ga	1 year 1 month 4 days	Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
CONSENT	2 years	YouTube sets this cookie via embedded YouTube videos and registers anonymous statistical data.

Cookie	Duration	Description
OAID	1 year	Cookie set to record whether the user has opted out of the collection of information by the AdsWizz Service Cookies.
test_cookie	15 minutes	doubleclick.net sets this cookie to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-device-id	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt.innertube::nextId	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.

Robotics & Automation – November 2024

Robotics & Automation – November 2024

Robotics & Automation – July 2024

Robotics & Automation – May 2024

Machine learning cracked the protein-folding problem and won the 2024 Nobel Prize in chemistry

AI will continue to grow in 2025. But it will face major challenges along the way

Survey on AI finds most people want it regulated, but trust in government remains low

Trump may cancel Nasa’s powerful SLS Moon rocket – here’s what that would mean for Elon Musk and the future of space travel

CHG-Meridian has established ISO-certified management systems throughout Europe

AI will continue to grow in 2025. But it will face major challenges along the way

Robotics & Automation: 2024 in review

2024 in review: Robotics & Automation in the USA

Upcoming Events

Machine learning cracked the protein-folding problem and won the 2024 Nobel Prize in chemistry

Related Stories

Upcoming Events