Professional Documents
Culture Documents
Aidd Bii
Aidd Bii
Aidd Bii
EXAMPLES:
1. British AI startup BenevolentAI identified baricitinib, a drug
developed by Elli Lily for the treatment of rheumatoid arthritis,
as a potentially effective COVID-19 drug in just a few days. The
medication has been approved as a COVID-19 treatment in the
U.S. and Japan.
WHAT HAPPENED
In the eight years since, IBM has trumpeted many more
high-profile efforts to develop AI-powered medical
technology—many of which have fizzled, and a few of
which have failed spectacularly. The company spent
billions on acquisitions to bolster its internal efforts, but
insiders say this did not work.
Multiple of methods:
Heuristic search over large spaces
Decision trees
Clustering
Machine learning and genetic algorithms were the best tools for these
sets
https://www.epa.gov/sites/production/files/2013-
20 Minute Break
Read the Articles Linked in the Slides
What metrics can AI improve in DDD?
> Shorter Development Time
> Decreased Failure Rate
> Greater Novelty, Better Starting Points
> Lower Cost
Highlights
•Deep learning technology has gained remarkable success.
•Shows recent applications of deep learning in drug discovery research.
•Some popular deep learning architectures are introduced in the current study.
•Future development of deep learning in drug discovery is discussed.
https://www.linkedin.com/pulse/deep-learning-drug-
discovery-new-frontier-oscar-bastidas/?
trackingId=qjsgI9S3T9ewRSAVWb3jFA%3D%3D
One of the most appealing abilities that DeepNetts® has to
contribute is its freedom from quantitative structure activity
relationship (QSAR) input as an obligatory predictor variable.
Although extremely useful in considering molecular
characteristics as part of a data analytics campaign, there are
situations where obtaining structural information such as in
the form of the simplified molecular-input line-entry system
(SMILES) is extremely challenging. This is of particular concern
especially for thousands of molecule data points that might
not be registered in databases such as ChEMBL (or the lab
might have them listed by more esoteric names not used in
such databases). In such cases, being able to carry out a deep
learning analysis of the data using physical and chemical
properties already provided as part of the compound library,
can prove an invaluable supplement or even alternative to
QSAR.
Tensorflow PyTorch
https://github.com/tensorflow/tensorflow https://pytorch.org/
For Deep Learning Easy to Use
https://pytorch.org/tutorials/
Online Tutorials
May be used on AWS
Makes use of a Generative Adversarial Network
GAN
• First, artificial intelligence has been applied to finding molecular targets, being a molecular structure with which an active pharmaceutical ingredient interacts
and thus triggers its effects. Most drug targets are proteins such as receptors, enzymes, transporters and ion channels. Nucleic acids and structural proteins can
also be targets. Scientists have recently developed a machine learning algorithm capable of predicting biological targets of prospective drug molecules. An
example is the Bayesian analysis for the determination of drug interaction targets (BANDIT), which is designed to use virtually all available data on any potential
drug to predict which enzyme or receptor or other target in the cells it interacts with to achieve its therapeutic effect. BANDIT can streamline the process of target
identification by limiting the possibilities to perhaps only one or two targets, which can then be studied.
• Second, artificial intelligence has been applied to finding a “hit” or “lead”. Automated high-throughput screening is a method for finding so-called hits and thus
candidates for lead structures. A lead is a chemical substance that is investigated in the course of drug design as a starting point for the development of a drug
candidate. A lead already shows a desired biological effect in vitro, but does not yet have the qualities required for a drug substance in terms of potency, selectivity
and pharmacokinetics. A lead differs from any hit from high-throughput screening in that its structure allows the synthesis of analogue molecules. A lead can also
be a natural substance whose pharmacological properties must be improved by chemical modifications.
• Third, artificial intelligence has been used in drug repurposing. Many drugs are small, flexible molecules that not only interact with the originally determined
target structure, but can also interact with other biological structures. Such associated effects can lead to the identification of new applications.
• Fourth, artificial intelligence can help in identifying candidate substances by screening existing databases. Scientists have applied machine learning models to
identify completely new antibiotically-active substances. The machine learning models can analyse molecular structures and correlate them with certain
properties, such as the ability to kill bacteria, and convert them into algorithms.
CONTINUED....
• Fifth, artificial intelligence has been used in polypharmacology. Drug development in recent decades has been guided by the paradigm “one drug – one target
structure – one disease”. This principle is based on the approach that a pharmacological agent is used to selectively address a target structure in the body in order
to treat a specific disease pattern. A large number of serious illnesses, such as cancer, mental disorders or cardiovascular diseases, often cannot be explained by a
single biological defect, but only the interplay of complex and usually networked malfunctions that together produce the clinical picture. Artificial intelligence can
assist.
• Sixth, artificial intelligence is a useful tool in the development of vaccines. Artificial intelligence can help with gene sequencing, so as to decode the genetic
material of the virus. Artificial intelligence can also help with the simulation of vaccines. In a simulation, possible vaccines can be “released” on the virus. The data
on the nature of the virus and the vaccine may be sufficient to simulate how they react to each other. This makes it possible to exclude less promising vaccines
even before the actual tests begin.
• Seventh, artificial intelligence can assist in determining the three-dimensional structure of proteins including their folding forms. The Alphabet subsidiary
DeepMind used deep learning, a subfield of machine learning, to develop the AlphaFold system, which generates 3D models of proteins based on their genetic
sequences. AlphaFold has been used to predict protein structures of the SARS-CoV-2 virus. This is important in research for a vaccine. The antibodies of a vaccine
must act precisely on the proteins of the virus in order to neutralise it.
• Eighth, let me say something about databases. There are many databases that are available to integrate diverse information of molecular pathways, crystal
structures, binding affinities, drug targets, disease relevance, chemical properties and biological activities. Artificial intelligence could be used to harvest
information from these databases to design new compounds both for single targets but also in polypharmacology.
• Ninth, artificial intelligence can also be used to support clinical trials by detecting the disease in patients, identifying the gene targets and predicting the effect of
the designed molecule and the on- and off-target effects. Further, patient selection is a critical process in clinical trials. In clinical studies, the relationship between
human biomarkers and in vitro phenotypes is essential. It allows a more predictable, quantifiable evaluation of the uncertainty of therapeutic responses in a given
patient. The development of artificial intelligence approaches for the identification and prediction of human relevant disease biomarkers enables the recruitment
of a specific patient population in Phase II and Phase III clinical trials. The success rate in clinical trials can thus be increased by predictive modelling using artificial
intelligence in the selection of a patient population.
What new problems in drug discovery and
development can we apply Machine Learning to to
save time or money, or to increase value, or decrease
failure rate? How? What data sources could be
used?
✦ Novo Nordisk uses NLP-based text mining to better understand the concerns that patients and providers have about the use
of its products. The company looks for data on such topics as safety, efficacy, pharmacokinetic/pharmacodynamics modeling,
randomized controlled trials, patient populations, dosing and devices. According to Novo Nordisk, use of NLP for data extraction
has replaced scanning of manual real-world data sources, allowing for the elimination of two full-time equivalent positions and
automating the process of generating insights.
✦ GlaxoSmithKline plc drug safety teams use NLP text mining to streamline the literature review process for pharmacovigilance
and ensure better patient safety. They also use NLP search tools to review medical literature for safety signals. In one project, the
company found an average of 60 new adverse event references per day over a 300-day period that included a review of 11,000
documents and 20 marketed products. The company calculated that similar searches, if performed manually, would have taken
thousands of hours and included less precise findings.
✦ Sanofi also used NLP to help identify linkages between specific genes, diseases and drug sensitivity for a specific multiple
sclerosis biomarker project. Using NLP, the company’s search identified all of the 22 previously published autoimmune diseases
and drug sensitivities associated with HLA alleles and haplotypes, but also uncovered another 33 novel unpublished disease and
drug-sensitivity associations.
https://www.linguamatics.com/solutions/drug-
discovery
See Case Studies for:
Target Prioritization
Target Selection
Text Extraction
https://www.outsourcing-
pharma.com/Article/2019/01/30/Sanofi-NLP-text-
mining-and-other-AI-tech-to-advance-findings-
improve-data-use
https://techcrunch-
com.cdn.ampproject.org/c/s/techcrunch.com/2020/11/30/alphabets-
deepmind-achieves-historic-new-milestone-in-ai-
based-protein-structure-prediction/amp/
The DeepMind database consists of 350,000
newly-predicted protein structures and the
company plans to release an additional 100
million protein structures over the next few
months – as Demis Hassabis, the CEO and Co-
founder of DeepMind puts it: “Almost every
sequenced protein known to science – over 100
million structures … a veritable protein almanac
of the world.” This massive database has been
made available in a searchable format and the
best part? It’s completely f ree!
https://www.longevity.technology/3d-structures-
of-all-protein-youll-have-to-google-that/
In Session Discussion
How can this resource be applied now?
Target Identification deals with identification of function of possible molecular target (genes/proteins of a small molecule) and its
role in a disease, which is aimed at finding the efficacy target of a drug. This requires evaluation of functional genomics,
structural genomics, proteomics, and cell based assays (in-vitro), animal research (in-vivo) assays.AI is being used to analyse
Drug Information Bank (contains drug candidates, gene expressions, protein-protein interactions and clinical data records) from
a public library for predicting therapeutic potential. For example, applying feature engineering by deep autoencoder, relief
algorithm and binary classification by Xgboost algorithm on ‘genome-wide protein interaction network, drugs and their targets
information’ to produce scores for potential targets to enable target prioritization.For finding of drug target site, discrete
chemicals can be encoded into continuous latent vector space, which permits gradient-based optimization in molecular space,
and allows predictions using graph convolutional network based on binding affinity and others properties.In the effort to
understand the detailed spatial 3D structure of proteins and molecular complexes, AI platform also rely on training computer
vision and machine learning models on cryo-EM microscope data (2D structure).Selection of drug candidates requires series of
desired properties especially pharmacokinetics, pharmacological and toxicity profile. AI drug design algorithm Reinforcement
Learning can be successfully deployed for a ‘Simplified molecular input line-entry system’string, which is a specification in the
form of line notation for describing Chemical species using short ASCII strings. It includes potential energy measurements,
molecular graphs with varying weights for atoms or bonds, coulomb matrices, molecular fragments or bonds, atomic
coordinates in 3D etc.
https://www.wipro.com/holmes/4-application-areas-of-artificial-intelligence-in-drug-discovery/
Tuesday, June 22, 2021 - Insilico
Medicine is pleased to announce that it
has entered into a collaboration with
Teva Branded Pharmaceutical Products
R&D, Inc. to utilize Insilico's generative
machine learning technology and
proprietary PandaOmics Drug Discovery
Platform, which aims at identifying
novel therapeutic targets implicated in
a variety of diseases.
Session 5
Survey of Companies in
AI Drug Discovery
230 Companies ??
https://blog.benchsci.com/startups-using-artificial-intelligence-in-drug-discovery
AI in Preclinical Research,
Biomarker Discovery and
Pipeline Management
Can we:
1. Reduce time?
2. Reduce cost?
3. Fail early?
4. Pick the best compounds?
5. Pick the best programs?
6. Increase success rate?
https://atomscience.org/
Industry Consortium
Atom is a partnership for collecting data for AI in drug discovery, particularly around problems with Pk and toxicity.
We will explore the Atom partnership and consortium.
Where does the data come
from?
DISCUSSION
https://www.outsourcing-
pharma.com/Article/2015/04/09/Predicting-oral-
bioavailability-is-better-with-AI-says-
PharmaInformatic
Preclinical CROs Using AI
https://electronichealthreporter.com/how-preclinical-cros-are-now-
using-future-technology-to-conduct-health-research/
DISCUSSION
https://blog.benchsci.com/startups-using-
artificial-intelligence-in-drug-
discovery#design_preclinical_experiments
-
https://blog.benchsci.com/startups-using-
artificial-intelligence-in-drug-
discovery#validate_and_optimize_drug_candidates
20 Minute Break
Download and Read the White Paper During break - to warm up
with AI in Biomarkers
https://www.iqvia.com/library/white-papers/breaking-new-ground-with-digital-
biomarkers
To be discussed
https://www.innoplexus.com/artificial-
intelligence-driven-biomarker-discovery/
41% of New Drug Approvals are Associated with
Biomarkers
Pay attention to the data sources used
https://www.innoplexus.com/technology/
Technologies Used
AI for Biomarker Discovery
an Active Research Topic
https://www.frontiersin.org/research-topics/18505/artificial-
intelligence-a-step-forward-in-biomarker-discovery-and-
integration-towards-improved-cance
An Example Application: Dermatitis https://www.pnas.org/content/117/52/33474
https://www.sciencedirect.com/science/article/pii/S2405471221002507
Biomarkers with AI for
Alzheimer's
https://www.technologynetworks.com/neuroscience/news/using-
advanced-ai-to-discover-diagnostic-biomarkers-and-drug-
targets-for-alzheimers-341638
https://www.tandfonline.com/doi/full/10.1080/17460441.2021.1918096
AI for Pipeline Management
What programs should I go forward with? Which ones should fail
early? AI can help....
https://www.iqvia.com/solutions/commercialization/brand-
strategy-and-management/product-pipeline
https://www.bloomberg.com/news/articles/2021-
02-05/out-of-grief-mit-s-andrew-lo-invented-a-
better-way-to-finance-biomedical-innovation
-
ROI on Pipeline is 12%, but beta is enormous. Need
150 programs (in oncology) to reduce the variance of
program success to allow financing by the bond
market.
Possible error in assumptions relate to correlation of
programs
This would require $30B.
How can AI do better?
Active Research...not fully solved
20 Minute Break
Brainstorm on Application of AI to Pipeline Management
Session 9
Management
More Accurate Assessment of Variance of Patient
Data Results in More Optimal Trial Design –
Reduce Cost by Including the Right Number of
Patients
Detection of Signals in Data That Humans
Cannot – to Better Assess Efficacy
https://deep6.ai/
Search EMR for Matching Patients at
Ways AI Potential Sites
Enhances NLP and Ontologies for
Clinical Extracting Info f rom EMR
Trial AI Based Advertising on Online
Enrollment Health and Wellness Sites
Better management of
multisite trials
ASSESS SITE REAL TIME AUTOMATE
AI for PERFORMANCE
(ENROLLMENTS,
DROPOUTS,
COMPLIANCE)
ANALYSIS OF
DIGITAL
BIOMARKERS
SHARING ACROSS
MULTIPLE SITES
[DOES NOT
REQUIRE AI]
Trial
Conduct SMART PHONE
ALERTS, E-
ALERTS FOR
MISSED VISITS,
TRACKING OF NONCOMPLIANCE
MEDICATION
COMPLIANCE
A Phase 3 failure is typically the end for a drug
program
Can We
Save a But what if a drug is safe and efficacious for a
subset of the population but not the entire
Drug if population?
Phase 3 Ai can analyze clinical data and EMR / Genomics
and determine if a drug can be saved for a portion
Fails? of the population
If this can be effectively done, the cost of drug
development goes down by eliminating a total
failure and providing a product for some people
Not yet proven, but a worthy exploration
Session 10
Future Directions
Targeted therapeutics are on-Off Switches
Modulated Biology Allows a New Paradigm
Optimal Controls of Pathways
Requires High Fidelity Systems Biology
AI Will Make a Difference Here
Draw f rom Literature Extraction, Pathways
Databases (ie KEGG), Protein Databases (ie
DeepMind)
Forge Network Model
Biochemical Reaction Chemistry forms Basis of
Dynamic Models
AI for System Identification
Drugs are Polypharmacy with multiple releases or
delivery mechanisms
Open Discussion
Future Directions of AI in DDD
Map data ©2021 Google