As we all know, the entire universe is filled with countless molecules.
How many of these molecules have potentially drug-like properties that could be used to develop life-saving drugs? Is it a million? Or billions? Or trillions?
The answer is: 10 to the 60th power.
Such a huge number has greatly delayed the development of new drugs. For rapidly spreading diseases such as COVID-19, there is currently no specific drug. This is also because the types and quantities of molecules are too large, far beyond what is currently available. The range of calculations that drug design models can make.
A research team at MIT doesn’t believe this evil. It doesn’t work out, right? Then it’s okay to accelerate the previous model, right?
This acceleration is 1200 times.
They studied a geometric deep learning model called "EquiBind", which is 1,200 times faster than the previous fastest computational molecular docking model "QuickVina2-W", and successfully combined drug-like molecules Binding to proteins reduces the chance and cost of drug trial failure.
The research paper will be published at ICML 2022.
"EquiBind" is developed based on its predecessor "EquiDock". "EquiDock" uses the late MIT The technology developed by Octavian-Eugen Ganea, an AI researcher at the college, combines two proteins. Ganea is also a co-author of the "EquiBind" paper.
Before drug development can begin, researchers must find promising drug-like molecules that can correctly bind, or "docked," to specific protein targets during the drug discovery process.
After successfully docking with the protein, combining the drug (ligand) can prevent the protein from functioning. If this happens to one of the bacteria's essential proteins, it can kill the bacteria and thus protect the body.
However, the process of drug discovery can be expensive, both from an economic and computational perspective. The research and development process often costs billions of dollars and will take more than 10 years before final approval by the FDA. Ten years of development and testing.
More importantly, 90% of drugs fail after human trials because they have no effect or have too many side effects.
So one of the ways pharmaceutical companies recoup these costs is to raise the price of the drug they ultimately successfully develop.
Currently, the computational process for finding promising drug candidate molecules is as follows: most state-of-the-art computational models rely on a large number of Candidate samples, coupled with methods such as scoring, ranking and fine-tuning, to obtain the best "match" between ligand and protein.
Hannes Stärk, a first-year graduate student in MIT's Department of Electrical Engineering and Computer Science and the lead author of the paper, likened the typical "ligand-protein" binding method to "trying to put the key into something." Many keyholes in the lock."
# Typical models spend time scoring each "fit" before selecting the best model. In contrast, “EquiBind” does not need to know the target pocket of the protein in advance, and can directly predict the precise key position in just one step, which is called “blind docking”.
Unlike most models that require multiple attempts to find the favorable position of a ligand in a protein, “EquiBind” already has built-in geometric reasoning capabilities that help the model learn the underlying physical properties of the molecule and successfully to summarize. This allows for successful generalization to make better predictions when encountering new or unrecognizable data.
The release of these findings quickly attracted the attention of industry professionals, including Relay Therapeutics chief data officer Pat Walters.
Walters suggested that the team could try their model on an existing drug and protein used in lung cancer, leukemia and gastrointestinal tumors. Although most traditional docking methods fail to successfully bind ligands on these proteins, EquiBind succeeds.
Walters said: "EquiBind provides a unique solution to the docking problem, which combines pose prediction and binding site identification."
"And this method utilizes data from "The information from thousands of published crystal structures has the potential to impact the field in new ways." Stärk said: "We were surprised when all the other approaches were completely wrong or only one was right, because EquiBind was able to put it into the right pocket, and we are very excited to see this result!"
Help "EquiBind"
Stärk said: "The feedback I am most looking forward to is suggestions on how to further improve the model."
"I would like to discuss with these researchers, tell them what I think the next steps can be, and encourage them to move forward and use the model in their own papers and methods. We have already been contacted by many researchers, Ask us if this model would be useful for their problem."
In addition, this article also commemorates Octavian-Eugen Gane, who made crucial contributions to geometric machine learning research and generously He mentored many students and was an outstanding scholar with a humble soul.
In the first half of this year, he left us forever during a hiking trip.
The above is the detailed content of 1200 times faster! MIT develops a new generation of drug research and development AI to defeat the old model. For more information, please follow other related articles on the PHP Chinese website!