Scientists have developed artificial intelligence software that can create proteins that may be useful as vaccines, cancer treatments, or even tools for pulling carbon pollution out of the air.
This research, reported today in the journal Science, was led by the University of Washington School of Medicine and Harvard University. The article is titled"Scaffolding protein functional sites using deep learning."
The proteins we find in nature are amazing molecules, but designed proteins can do so much more, said senior author David Baker, an HHMI Investigator and professor of biochemistry at UW Medicine. In this work, we show that machine learning can be used to design proteins with a wide variety of functions.
For decades, scientists have used computers to try to engineer proteins. Some proteins, such as antibodies and synthetic binding proteins, have been adapted into medicines to combat COVID-19. Others, such as enzymes, aid in industrial manufacturing. But a single protein molecule often contains thousands of bonded atoms; even with specialized scientific software, they are difficult to study and engineer.
Inspired by how machine learning algorithms can generate stories or even images from prompts, the team set out to build similar software for designing new proteins. The idea is the same: neural networks can be trained to see patterns in data. Once trained, you can give it a prompt and see if it can generate an elegant solution. Often the results are compelling or even beautiful, said lead author Joseph Watson, a postdoctoral scholar at UW Medicine.
The team trained multiple neural networks using information from the Protein Data Bank, which is a public repository of hundreds of thousands of protein structures from across all kingdoms of life. The neural networks that resulted have surprised even the scientists who created them.
The team developed two approaches for designing proteins with new functions. The first, dubbed hallucination is akin to DALL-E or other generative A.I. tools that produce new output based on simple prompts. The second, dubbed inpainting, is analogous to the autocomplete feature found in modern search bars and email clients.
Most people can come up with new images of cats or write a paragraph from a prompt if asked, but with protein design, the human brain cannot do what computers now can, said lead author Jue Wang, a postdoctoral scholar at UW Medicine. Humans just cannot imagine what the solution might look like, but we have set up machines that do.
To explain how the neural networks hallucinate a new protein, the team compares it to how it might write a book: You start with a random assortment of words total gibberish. Then you impose a requirement such as that in the opening paragraph, it needs to be a dark and stormy night. Then the computer will change the words one at a time and ask itself Does this make my story make more sense? If it does, it keeps the changes until a complete story is written, explains Wang.
Both books and proteins can be understood as long sequences of letters. In the case of proteins, each letter corresponds to a chemical building block called an amino acid. Beginning with a random chain of amino acids, the software mutates the sequence over and over until a final sequence that encodes the desired function is generated. These final amino acid sequences encode proteins that can then be manufactured and studied in the laboratory.
The team also showed that neural networks can fill in missing pieces of a protein structure in only a few seconds. Such software could aid in the development of new medicines.
With autocomplete, or Protein Inpainting, we start with the key features we want to see in a new protein, then let the software come up with the rest. Those features can be known binding motifs or even enzyme active sites, explains Watson.
Laboratory testing revealed that many proteins generated through hallucination and inpainting functioned as intended. This included novel proteins that can bind metals as well as those that bind the anti-cancer receptor PD-1.
The new neural networks can generate several different kinds of proteins in as little as one second. Some include potential vaccines for the deadly respiratory syncytial virus,orRSV.
All vaccines work by presenting a piece of a pathogen to the immune system. Scientists often know which piece would work best, but creating a vaccine that achieves a desired molecular shape can be challenging. Using the new neural networks, the team prompted a computer to create new proteins that included the necessary pathogen fragment as part of their final structure. The software was free to create any supporting structures around the key fragment, yielding several potential vaccines with diverse molecular shapes.
When tested in the lab, the team found that known antibodies against RSV stuck to three of their hallucinated proteins. This confirms that the new proteins adopted their intended shapes and suggests they may be viable vaccine candidates that could prompt the body to generate its own highly specific antibodies. Additional testing, including in animals, is still needed.
I started working on the vaccine stuff just as a way to test our new methods, but in the middle of working on the project, my two-year-old son got infected by RSV and spent an evening in the ER to have his lungs cleared. It made me realize that even the test problems we were working on were actually quite meaningful, said Wang.
These are very powerful new approaches, but there is still much room for improvement, said Baker, who was a recipient of the 2021 Breakthrough Prize in Life Sciences. Designing high activity enzymes, for example, is still very challenging. But every month our methods just keep getting better! Deep learning transformed protein structure prediction in the past two years, we are now in the midst of a similar transformation of protein design.
This project was led by Jue Wang, Doug Tischer, and Joseph L. Watson, who are postdoctoral scholars at UW Medicine, as well as Sidney Lisanza and David Juergens, who are graduate students at UW Medicine. Senior authors include Sergey Ovchinnikov, a John Harvard Distinguished Science Fellow at Harvard University, and David Baker, professor of biochemistry at UW Medicine.
Compute resources for this work were donated by Microsoft and Amazon Web Services.
Funding was provided by the Audacious Project at the Institute for Protein Design; Microsoft; Eric and Wendy Schmidt by recommendation of the Schmidt Futures; the DARPA Synergistic Discovery and Design project (HR001117S0003 contract FA8750-17-C-0219); the DARPA Harnessing Enzymatic Activity for Lifesaving Remedies project (HR001120S0052 contract HR0011-21-2-0012); the Washington Research Foundation; the Open Philanthropy Project Improving Protein Design Fund; Amgen; the Human Frontier Science Program Cross Disciplinary Fellowship (LT000395/2020-C) and EMBO Non-Stipendiary Fellowship (ALTF 1047-2019); the EMBO Fellowship (ALTF 191-2021); the European Molecular Biology Organization (ALTF 139-2018); the la Caixa Foundation; the National Institute of Allergy and Infectious Diseases (HHSN272201700059C), the National Institutes ofHealth (DP5OD026389); the National Science Foundation (MCB 2032259); the Howard Hughes Medical Institute, the National Institute on Aging (5U19AG065156); the National Cancer Institute (R01CA240339); the Swiss National Science Foundation; the Swiss National Center of Competence for Molecular Systems Engineering; the Swiss National Center of Competence in Chemical Biology; and the European Research Council(716058).
Written by Ian Haydon, UW Medicine Institute for Protein Design
View post:
Biologists train AI to generate medicines and vaccines - UW Medicine Newsroom
- Machine learning provides a new picture of the great gray owl - Phys.org - April 2nd, 2024
- What is Machine Learning? Definition, Types, Tools & More - April 2nd, 2024
- Revolutionizing Industries: The Convergence of RFID, AI, and Machine Learning - yTech - April 2nd, 2024
- Layerwise Importance Sampled AdamW (LISA): A Machine Learning Optimization Algorithm that Randomly Freezes Layers of LLM Based on a Given Probability... - April 2nd, 2024
- Dimensionality reduction for images of IoT using machine learning | Scientific Reports - Nature.com - April 2nd, 2024
- 3 Machine Learning Stocks That Could Be Multibaggers in the Making: March Edition - InvestorPlace - April 2nd, 2024
- Researchers use machine learning to improve the taste of Belgian beers Physics World - physicsworld.com - April 2nd, 2024
- PM Modi Emphasizes The Importance Of Incorporating AI & Machine Learning To Enhance Digital Infra - Business Today - April 2nd, 2024
- Accurate and rapid antibiotic susceptibility testing using a machine learning-assisted nanomotion technology platform - Nature.com - March 21st, 2024
- Machine Learning Accelerates the Simulation of Dynamical Fields - Eos - March 21st, 2024
- Quantum Machine Learning: Exploring the Intersection of New Frontiers - DataScientest - March 21st, 2024
- Advancements in Pancreatic Cancer Detection: Integrating Biomarkers, Imaging Technologies, and Machine Learning ... - Cureus - March 21st, 2024
- Google Health Researchers Propose HEAL: A Methodology to Quantitatively Assess whether Machine Learning-based Health Technologies Perform Equitably -... - March 21st, 2024
- A change in the machine learning landscape - InfoWorld - March 21st, 2024
- Informing immunotherapy with multi-omics driven machine learning | npj Digital Medicine - Nature.com - March 21st, 2024
- Crypto Entities That Neglect AI and Machine Learning Investment Will Lag Behind, Warns Binance CTO Bitcoin News - Bitcoin.com News - March 21st, 2024
- MIT Researchers Developed an Image Dataset that Allows Them to Simulate Peripheral Vision in Machine Learning Models - MarkTechPost - March 21st, 2024
- BurstAttention: A Groundbreaking Machine Learning Framework that Transforms Efficiency in Large Language Models with Advanced Distributed Attention... - March 21st, 2024
- A machine learning system to identify progress level of dry rot disease in potato tuber based on digital thermal image ... - Nature.com - January 24th, 2024
- Mind the Gap Machine Learning, Dataset Shift, and History in the Age of Clinical Algorithms | NEJM - nejm.org - January 24th, 2024
- Cracking the Business Code of Clusters Machine Learning Times - The Machine Learning Times - January 24th, 2024
- Machine-learning-based models found to have predictive abilities no better than chance in out-of-sample evaluations - 2 Minute Medicine - January 24th, 2024
- Hybrid machine learning method boosts resolution of electrical impedance tomography - Tech Xplore - January 24th, 2024
- Cow moos and burps to be monitored using machine learning - FoodNavigator.com - January 24th, 2024
- Enhancing foveal avascular zone analysis for Alzheimer's diagnosis with AI segmentation and machine learning using ... - Nature.com - January 24th, 2024
- How to Use AI and Machine Learning for Academic Research - Innovation & Tech Today - January 24th, 2024
- Smart Use of Machine Learning Algorithms: Beyond the Hype, Into Real-World Solutions - Medium - January 24th, 2024
- How A.I./Machine Learning Is Boosting the Diversity of U.S. Med Students and Americas Future Doctors - Higher Education Digest - January 24th, 2024
- Weekly AiThority Roundup: Biggest Machine Learning, Robotic And Automation Updates - AiThority - January 24th, 2024
- How to Develop and Deploy Machine Learning Project in Python - Analytics Insight - January 24th, 2024
- Machine learning education | TensorFlow - January 7th, 2024
- How LinkedIn Uses Machine Learning to Address Content-Related Threats and Abuse - InfoQ.com - January 7th, 2024
- What is AI and Machine Learning? - GovernmentCIO Media & Research - January 7th, 2024
- Overview: Machine Learning Specialization by Andrew Ng (Course 1) - Medium - January 7th, 2024
- Study uses new tools, machine learning to investigate major cause of blindness in older adults - Medical Xpress - January 7th, 2024
- Leveraging AI and Machine Learning on AWS | by Be | Jan, 2024 - Medium - January 7th, 2024
- The Future at the Intersection of AI, Machine Learning, and Data Science - Medriva - January 7th, 2024
- Navigating the AI Landscape: From Machine Learning Foundations to Multimodal Advancements - Medium - January 7th, 2024
- Brake Noise And Machine Learning (3 of 4) - The BRAKE Report - January 7th, 2024
- 'Local' machine learning promises to cut the cost of AI development in 2024 - ITPro - January 7th, 2024
- Voice Recognition with Machine Learning on Arduino Nano 33 BLE Sense - Medium - January 7th, 2024
- This Paper from MIT and Microsoft Introduces LASER: A Novel Machine Learning Approach that can Simultaneously Enhance an LLMs Task Performance and... - January 7th, 2024
- How to Choose the Right Advanced Certification Program in AI & Machine Learning - TechGraph - January 7th, 2024
- What Is Machine Learning? | A Beginner's Guide - Scribbr - November 17th, 2023
- AI vs. Machine Learning vs. Deep Learning vs. Neural Networks ... - IBM - January 30th, 2023
- The Latest Google Research Shows how a Machine Learning ML Model that Provides a Weak Hint can Significantly Improve the Performance of an Algorithm... - January 30th, 2023
- What Is Machine Learning and Why Is It Important? - January 22nd, 2023
- Achieving Next-Level Value From AI By Focusing On The Operational Side Of Machine Learning - Forbes - January 22nd, 2023
- UCLA Researcher Develops a Python Library Called ClimateLearn for Accessing State-of-the-Art Climate Data and Machine Learning Models in a... - January 22nd, 2023
- Alto Neuroscience Presents New Data Leveraging EEG and Machine Learning to Predict Individual Response to Antidepressants at the 61st Annual Meeting... - December 12th, 2022
- Apple has released a Set of Optimizations that allow the Stable Diffusion AI Image Generator to be used on Apple Silicon, making use of Core ML,... - December 12th, 2022
- Genomic Testing Cooperative to Present Data at the American Society of Hematology Meeting on New Applications of its Proprietary Tests that Combine... - December 12th, 2022
- Astronomers at Caltech Have Used a Machine Learning Algorithm to Classify 1,000 Supernovae Completely Autonomously - MarkTechPost - December 4th, 2022
- Deep Learning | NVIDIA Developer - November 25th, 2022
- Check Out This Tool That Uses Machine Learning To Animate 3D Models In Real-Time And Will Soon Be Compatible With Unreal Engine - MarkTechPost - November 17th, 2022
- The NFT World is Evolving, and That's No Secret. Machine Learning and Algorithmic Tools ... - Latest Tweet - LatestLY - October 23rd, 2022
- Its Not Just About Accuracy - Five More things to Consider for a Machine Learning Model - AZoM - October 15th, 2022
- Machine learning operations offer agility, spur innovation - MIT Technology Review - October 15th, 2022
- Machine learning to predict the development of recurrent urinary tract infection related to single uropathogen, Escherichia coli | Scientific Reports... - October 15th, 2022
- The more data, the more deep learning capacity - Innovation Origins - October 15th, 2022
- Outlook on the Machine Learning in Life Sciences Global Market to 2027 - Featuring Alteryx, Anaconda, Canon Medical Systems and Imagen Technologies... - October 15th, 2022
- Forensic Discovery Taps Reveal-Brainspace to Bolster its Analytics, AI and Machine Learning Capabilities - Business Wire - October 15th, 2022
- Long-term exposure to particulate matter was associated with increased dementia risk using both traditional approaches and novel machine learning... - October 15th, 2022
- Machine Learning | Google Developers - October 7th, 2022
- Machine Learning in Oracle Database | Oracle - October 7th, 2022
- Learning on the edge | MIT News | Massachusetts Institute of Technology - MIT News - October 7th, 2022
- Study: Few randomized clinical trials have been conducted for healthcare machine learning tools - Mobihealth News - October 7th, 2022
- The Worldwide Industry for Machine Learning in the Life Sciences is Expected to Reach $20.7 Billion by 2027 - ResearchAndMarkets.com - Business Wire - October 7th, 2022
- Dominos MLops release focuses on GPUs and deep learning, offers multicloud preview - VentureBeat - October 7th, 2022
- MLOps Company Iterative Sees Steady Growth in First Half of 2022 - Business Wire - October 7th, 2022
- Machine learning tool could help people in rough situations make sure their water is good to drink - ZME Science - October 7th, 2022
- Developing Machine-Learning Apps on the Raspberry Pi Pico - Design News - October 7th, 2022
- Arctoris welcomes on board globally recognized experts in Machine Learning, Chemical Computation, and Alzheimer's Disease - Business Wire - October 7th, 2022
- Machine vision breakthrough: This device can see 'millions of colors' - Northeastern University - October 7th, 2022
- RBI plans to extensively use artificial intelligence, machine learning to improve regulatory supervision - ETCIO - October 7th, 2022
- Artificial intelligence may improve suicide prevention in the future - EurekAlert - October 7th, 2022
- Google turns to machine learning to advance translation of text out in the real world - TechCrunch - September 29th, 2022
- Machine learning has predicted the winners of the Worlds - CyclingTips - September 29th, 2022
- Peking University released the first open-source dataset for machine learning applications in fast chip design - EurekAlert - September 29th, 2022
- Predicting the effects of winter water warming in artificial lakes on zooplankton and its environment using combined machine learning models |... - September 29th, 2022