Professional Documents
Culture Documents
jm7b00935 Si 001
jm7b00935 Si 001
§
Roche Pharma Research and Early Development, Roche Innovation Center Basel,
Switzerland, ‡Genentech Inc, 1 DNA Way, South San Francisco, CA 94080, •AstraZeneca plc,
Milton Rd, Milton, Cambridge, CB4 0FZ, †MedChemica Ltd, Biohub Alderley Park, Macclesfield.
Cheshire SK10 4TG.
Contents
Additional detail and diagrams showing the capture of the local chemical environment round a
The data merging approach & Flowchart showing the method for assigning rules as significant
(S5 – S7)
S1
Importance of capturing the chemical environment (S8 – S13)
Solubility vs logD plots, confusion tables, and unexpected rules (S14 – S21)
PPB: Comparison with logD confusion matrix and unexpected rules (S32 – S35)
S2
MMP Generation
Examples of encoding different levels of environments for functional group inter-conversions
Molecule Pair
Molecule A → Molecule B
1 bond environment
[c:1]([H])>>[c:1][C]([H])([H])([H])
2 bond environment
[c:1]([H])[c:2]([H])[c:3]([H])>>[c:1]([H])[c:2]([c:3]([H]))[C]([H])([H])([H])
3 bond environment
[c:1]([H])[c:2]([H])[c:3]([H])[c:4]([H])[n:5]>>[c:1]([H])[c:2]([H])[c:3]([c:4]([H])
[n:5])[C]([H])([H])([H])
4 bond environment
[c:1]1([H])[c:2]([H])[c:3]([H])[n:4][c:5][c:6]1([H])>>[C]([H])([H])([H])[c:2]1[c:
1]([H])[c:6]([H])[c:5][n:4][c:3]1([H])
Figure S1: Functional group transformations with different environment specification. The green
groups are those being changed in the transformation, the red portions of the transformations
shows the increasing level of environment specification increasing in all directions from the point
of change. The blue atom numbers correspond to the atom mappings in the SMIRKS
transformation encoding, these change with environment as the SMIRKS are canonicalized.
S3
The encoding of a linker exchange (see Figure S2) and a core replacement (see Figure S3) are
also captured with the analogous environments to the functional group inter-conversion. In the
example shown with the amide to sulfonamide conversions the SMARTS specification used in
the Hussain and Rea method avoids cutting the amide N-C(=O)single bond or the sulfonamide
Molecule A → Molecule B
[c:1][C](=[O])[N]([H])[C:2]([H])>>[c:1][S](=[O])(=[O])[N]([H])[C:2]([H])
Molecule A → Molecule B
S4
[C:1]([H])([H])[C]1([H])[C]([H])([C](=[O])[N]1[C:2])[C:3]([H])>>
[C:1]([H])([H])[C]1([H])[C]([H])([S](=[O])(=[O])[N]1[C:2])[C:3]([H])
Data Merging
The data for each endpoint was aggregated and filtered according to the procedure shown
in Figure S4.
S5
Four classes of rule are defined: increase, decrease, neutral and No Effect Determined
(NED). The latter indicates that although there was sufficient data, there was not sufficient
The neutral rules were those that reproducibly made very little change to an endpoint. One
of the principles of our analysis was not to assume in advance any distribution for the underlying
data, therefore a simple binomial test was used to indicate if a transformation had statistical
support for significantly increasing or decreasing a property. This has the considerable
range to an out of range, such as some measured hERG inhibition to out of range inactive
hERG inhibition is highly valuable knowledge. A consequence of choosing the binomial test and
a p value of 0.05, is that all transformations require a minimum of 6 supporting examples as this
is the first point at which, if every example pair is in the same direction, there is a <=5% chance
In using the rules extracted, care needs to be exercised in how NED rules are interpreted.
For the comparison of sets of rules between assays, for example, in relating human liver
S6
increase in one assay are more variable in another assay and so yield a NED result. However,
when exploring the effect of a specific transformation, if the transformation is a NED in one
assay system the message is that the distribution of the current data is too variable to be
classified with a greater than 95% level of certainty that it is not merely random variation. Under
these circumstances any rule that is classed as a NED should be treated with caution when
logD7.4 153,449
Solubility 46,655
S7
Cardiac ion channels 15,636
environments. These allow us to explore the effect chemical environment exerts upon the effect
of chemical modifications. For example the replacement H→Me has been explored in 1,550
different local chemical environments as stored in the GRD. For the 225 different
average change in solubility can be as different as 3.4 log units, depending on the environment.
As shown in Figure S7, the median change in solubility for H→Me is -0.22 log units, however
the range is more than 3.4 log units. Even discounting the transformations that involve
methylation of an anion i.e. CO2H→CO2Me (Figure S6, left), there is still a 2.5 log range in the
S8
Figure S6: H→Me transformations effect on solubility
The benefit of specificity of the environments is also shown by the histograms below (the
overall median effect is shown as a solid vertical line). The larger magnitude changes in
solubility(environment level 4) are associated with the more specific rules, the blending of these
examples together in the less specific rules(environment level 1,2) would be misleading as it
would miss the opportunities and risks inherent to specific environments. This form of analysis
S9
Figure S7: Deeper environment level specification for H Me transformations reveals rules
Three of the examples of H→Me transformations giving large increases in solubility are
shown in Figure S8 and are consistent with the importance of conformational changes.
[c:1]([H])[c:2]([c:3])[N:4]([H])[C:5](=[O:6])[C:7]([H])([H])>>
[c:1]([H])[c:2]([c:3])[N:4]([C]([H])([H])([H]))[C:5](=[O:6])[C:7]([H])([H])
S10
[C:1][c:2]1[n:3]([H])[c:4]([n:5])[c:6][n:7]1>>[
C]([H])([H])([H])[n:3]1[c:2]([n:7][c:6][c:4]1[n:5])[C:1]
[c:1]1([H])[c:2]([c:3][n:4][n:5]1)[N:6]([H])>>
[C]([H])([H])([H])[c:1]1[c:2]([c:3][n:4][n:5]1)[N:6]([H])
Figure S8: HMe transformation examples that give a large increase in solubility
For the same transformation, 278 different environments are observed that give a
distribution around a median increase of 0.28 log Clint, but again with a few environments
where the H→Me transformation gives a consistent decrease in metabolic clearance. Where the
presumably associated with the ester hydrolysis or demethylation pathways now becoming
available.
S11
Figure S9: H→Me transformations effect on human microsomal clearance
The highly popular functional group replacement of H→F as a “metabolic fix” has 37
different environments where there is sufficient statistical support to comment. Although the
median change is precisely 0, implying in general no effect on metabolism, there are specific
environments where a H→F replacement is either highly effective or highly deleterious (see
Figures S10, S11). This may explain the continuing popularity with medicinal chemists of trying
H→F, in that it works well under highly specific circumstances, modestly under more, but
generally is a disappointment. The human bias to remember the successful cases can be
S12
[c:1]1([H])[c:2]([H])[c:3][c:4]([H])[c:5]([c:6]1([H]))[C:7](=[O:8])[N:9]([H])>>
[c:2]1([H])[c:1]([H])[c:6]([c:5]([c:4]([H])[c:3]1)[C:7](=[O:8])[N:9]([H]))[F]
[c:1][N:2]([H])[c:3]1[c:4]([H])[c:5][c:6]([H])[c:7]([c:8]1([H]))[C:9]([H])([F:10])[F:11]>>
[c:1][N:2]([H])[c:3]1[c:4]([H])[c:5][c:6]([H])[c:7]([c:8]1[F])[C:9]([H])([F:10])[F:11]
Figure S11: HF transformations have very different clearance effects in different environments
S13
Solubility
Figure S12: Solubility vs logD effects, min 50 pairs per rule. Red line = line of equality.
Linear fit Blue line: R2 = 0.70, slope = -1.4, green density ellipses at 50% and 99% of the data.
S14
Figure S13: Solubility vs logD effects, min. 20 pairs per rule. Black line = line of equality.
S15
Solubility
NED 33 33 128
Table S2: Confusion Matrix for Solubility-logD increase/neutral/decrease/NED rules for all
Solubility
neutral 5 4 5 30
Table S3: Confusion Matrix for Solubility-logD increase/neutral/decrease/NED rules for all
S16
Solubility
neutral 7 12 7 48
S17
Figure S14: Effect of change in Donor Count on logD/ Solubility (number of acceptors kept
constant). Increase in Donors = orange, decrease in donors = blue. Outliers are almost
exclusively related to changing a tertiary amine (unprotonated, does not count as donor in our
analysis) into something with more donors, e.g. hydroxyl group. Only rules based on more than
Figure S15: Effect of change in acceptor count on logD/ Solubility (number of donors kept
constant). Increase in acceptors = orange, decrease in acceptors = blue. Only rules based on
some exceptions. Table S5 shows five transformations which on average are logD neutral but
S18
ΔlogD ±std ΔlogSol ±std
Transformation (nPairs) (nPairs)
[C:1]([H])([H])[C:2]([H])([H])[N]1[C]([H])([H])[C]([H])([
H])[O][C]([H])([H])[C]([H])([H])1>>
[C:1]([H])([H])[C:2]([H])([H])[N]1[C]([H])([H])[C]([H])([
H])[C]([H])([H])[C]([H])([H])[C]([H])([H])1
[c:1]([H])[c:2]([n:3])[C]#[N]>>
[c:1]([H])[c:2]([H])[n:3]
[c:1][N]1[C]([H])([H])[C]([H])([H])[N]([C]([H])([H])[C]([
H])([H])1)[C](=[O])[C]([H])([H])([H])>>
[c:1][N]1[C]([H])([H])[C]([H])([H])[N]([C]([H])([H])[C]([
H])([H])1)[C]([H])([H])([H])
[c:1][S](=[O])(=[O])[C]([H])([H])([H])>>[c:1][C](=[O])[
N]([C]([H])([H])([H]))[C]([H])([H])([H])
solubility. Note that although the std. of the effects can be rather large, the mean is estimated
with high certainty, because there are more than 50 pairs for each transformation.
S19
There are also transformations that significantly reduce logD without affecting solubility on
[C]([H])([H])([H])[C]([C]([H])([H])([H]))([C:1]([H])([H]))[
O:2]([H])>>
[C:1]([H])([H])[C]([H])([H])[O:2]([H])
[c:1]([H])[c:2]([H])[c:3]([c:4]([H])[c:5])[C]([H])([H])([H])
>>
[c:1]([H])[c:2]([H])[c:3]([c:4]([H])[c:5])[C]#[N]
[c:1][C](=[O])[N]([H])[C]([H])([C]([H])([H])([H]))[C]([H])
([H])([H])>>
[c:1][C](=[O])[N]([H])[C]1([H])[C]([H])([H])[C]([H])([H])[
O][C]([H])([H])[C]([H])([H])1
[c:1][N]([H])[C]([H])([H])([H])>>
[c:1][N]([H])[C]1([H])[C]([H])([H])[O][C]([H])([H])1
[c:1]([H])[c:2]([c:3])[F]>>
[c:1]([H])[c:2]([c:3])[C]#[N]
S20
Table S6: Transformations that on average keep solubility constant and significantly decrease
logD.
Among all rules based on more than 50 pairs, there is only one transformation that on
average increases both logD and solubility by more than 0.3 log units. This is shown in Table
S7.
[c:1]1([H])[c:2]([c:3]([H])[c:4]([c:5][c:6]1[Cl:7])[Cl:8])[
C]#[N]>>
[c:2]1([H])[c:1]([H])[c:6]([c:5][c:4]([c:3]1([H]))[Cl:8])[Cl
:7]
Table S7: Transformation that on average increases Solubility and logD.
S21
Clearance
ID Transformation Hu LM Clearance Hepatocyte
median change ± Clearance median
std ( nPairs, change ± std
percentage of (nPairs,
indicated percentage of
changes) indicated changes)
direction direction
[c:1][N]([H])[c]1[c]([H])[n][n]([c]([H])1)[C]([
H])([H])([H])>>[c:1][N]([H])[c]1[c]([H])[c]([
H])[n][n]1[C]([H])([H])([H])
[c:1][S](=[O])(=[O])[N]([C]([H])([H])([H]))[C
]([H])([H])([H])>>[c:1][S](=[O])(=[O])[C]([H
])([H])([H])
[c:1][c]1[c]([H])[n][c]([H])[n][c]([H])1>>[c:1]
[C](=[O])[N]([H])[C]([H])([H])([H])
S22
5 -0.61±0.52 -0.95±0.28 (7,100)
(80,85) decrease decrease
[c:1][C]([H])([H])[O]([H])>>[c:1][C](=[O])[O
]([H])
6 -0.52±0.51 -0.73±0.39
(33,84) decrease (11,100) decrease
[c:1][O][C]([H])([H])[c]1[c]([H])[c]([H])[c]([H
])[c]([H])[c]([H])1>>[c:1][O][C]([H])([H])[C](
[H])([H])([H])
[c:1][C]([H])([H])([H])>>[c:1][S](=[O])(=[O]
)[C]([H])([H])([H])
[c:1][N]([H])[C](=[O])[C]1([H])[C]([H])([H])[
C]([H])([H])1>>[c:1][N]([H])[C](=[O])[N]([H
])[C]([H])([H])([H])
[c:1][O][C]([H])([H])([H])>>[c:1][C](=[O])[N
]([H])([H])
S23
[C]([H])([H])([H])[C:1]>>[C:1][O]([H])
[c:1][C](=[O])[C]([H])([H])([H])>>[c:1][C](=[
O])[N]([H])[C]([H])([H])([H])
[c:1][C]([H])([H])[N]1[C]([H])([H])[C]([H])([
H])[O][C]([H])([H])[C]([H])([H])1>>[c:1][C](
[H])([H])[N]1[C]([H])([H])[C]([H])([H])[N]([C
]([H])([H])[C]([H])([H])1)[C]([H])([H])([H])
[c:1][N]1[C]([H])([H])[C]([H])([H])[N]([C]([H
])([H])[C]([H])([H])1)[C]([H])([H])([H])>>[c:
1][N]1[C]([H])([H])[C]([H])([H])[N]([H])[C]([
H])([H])[C]([H])([H])1
14 -0.39±0.48 -0.61±0.26
(116,85) (30,100) decrease
decrease
[c:1]SC([H])([H])([H])>>[c:1]OC([H])([H])([
H])
S24
O])[C]([H])([H])([H])
[C]([H])([H])([H])[C]([H])([C]([H])([H])([H]))[
C]([H])([H])[C:1]>>[C]([H])([H])([H])[C:1]
[H][C@@]1([C]([H])([H])[O][C]([H])([H])[C]
([H])([H])[N]1[c:1])[C]([H])([H])([H])>>[c:1][
N]1[C]([H])([H])[C]([H])([H])[O][C]([H])([H])
[C]([H])([H])1
[C:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c]([H]
)1>>[C:1][C]1([H])[C]([H])([H])[C]([H])([H])
[O][C]([H])([H])[C]([H])([H])1
Table S8: The top 20 transformations that significantly decrease both human liver
microsomal and hepatocyte clearance. The rules were selected by the following criteria: 1.the
rule has more than 20 examples for human liver microsomal clearance. 2.the rule decreases
both human liver microsomal clearance and hepatocyte clearance. 3.the rule has example pairs
from multiple companies 4.only single atom environments are were included to balance
generality and specificity for the rule. 5.very similar rules were also removed to maximize
S25
ID Transformation Hu LM Clearance logD
median change ± median change ±
std (nPairs) std (nPairs)
direction direction
[C:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c]([H]
)1>>[C:1][c]1[c]([H])[c]([H])[c]([c]([H])[c]([H
])1)[Cl]
[C:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c]([H]
)1>>[C:1][c]1[c]([H])[c]([H])[c]([H])[c]([c]([H
])1)[Cl]
[C:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c]([H]
)1>>[C:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c
]1[Cl]
[c:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c]([H])
1>>[c:1][c]1[c]([H])[c]([H])[c]([c]([H])[c]([H])
1)[Cl]
[c:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c]([H])
1>>[c:1][c]1[c]([H])[c]([H])[c]([H])[c]([c]([H])
S26
1)[Cl]
[c:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c]([H])
1>>[c:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c]
1[Cl]
[C:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c]([H]
)1>>[C:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c
]1[F]
[C:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c]([H]
)1>>[C:1][c]1[c]([H])[c]([H])[c]([H])[c]([c]([H
])1)[F]
[C:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c]([H]
)1>>[C:1][c]1[c]([H])[c]([H])[c]([c]([H])[c]([H
])1)[F]
[c:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c]([H])
1>>[c:1][c]1[c]([H])[c]([H])[c]([c]([H])[c]([H])
1)[F]
S27
11 0.0±0.32 (47) 0.2±0.44 (92)
NED increase
[c:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c]([H])
1>>[c:1][c]1[c]([H])[c]([H])[c]([H])[c]([c]([H])
1)[F]
[c:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c]([H])
1>>[c:1][c]1[c]([H])[c]([H])[c]([H])[c]([H])[c]
1[F]
Table S9. The Summary of the in-vitro human microsomal clearance change of ortho, meta
microsomal
logD
Clearance median
ID median change ± std
Transformation change ± std (nPairs)
(nPairs) direction
direction
S28
-0.23±0.36 (18) 0.73±0.61 (26)
3
[c:1][c]1[c]([H])[c]([H])[n][c]([H])[c]([H]) decrease increase
1>>
[c:1][c]1[c]([H])[c]([H])[c]([c]([H])[c]([H])
1)[C]#[N]
microsomal
logD median
Clearance median
ID Transformation change ± std
change ± std (nPairs)
(nPairs) direction
direction
S29
)([H])[C]([H])([H])[N]([C]([H])([H])[C]([H
])([H])1)[C]([H])([H])[C:1]>>
[C]([H])([H])([H])[N]1[C]([H])([H])[C]([H]
)([H])[N]([C]([H])([H])[C]([H])([H])1)[C]([
H])([H])[C:1]
ID microsomal
logD
Transformation Clearance median
median change ± std
change ± std (nPairs)
(nPairs) direction
direction
S30
[c:1][C](=[O])[N]([H])[S](=[O])(=[O])[N]( -1.14±0.6 (14) -0.33±0.59 (13)
2
[H])[C]([H])([H])([H])>> decrease decrease
[c:1][C](=[O])[N]([H])[S](=[O])(=[O])[C](
[H])([H])[C]([H])([H])[O][C]([H])([H])([H]
)
S31
Plasma Protein Binding
neutral 15 42 15 46
(nPairs) (nPairs)
[c:1][c:2]1[c:3]([H])[c:4]([c:5]([H])[n:6][c:7]1([H]))[C](
=[O])[O]([H])>>
[c:1][c:2]1[c:3]([H])[c:4]([c:5]([H])[n:6][c:7]1([H]))[C](
=[O])[N]([H])([H])
1.84 ± 0.52 (20) 0.57 ± 0.33 (13)
[C:1]([H])([H])[C]([H])([H])[C](=[O])[O]([H])>>
[C:1]([H])([H])[C]([H])([H])[C](=[O])[N]([H])([H])
S32
1.56 ± 0.76 (60) 0.93 ± 0.56 (30)
[c:1][c]1[n]([H])[n][n][n]1>>
[c:1][C](=[O])[N]([H])([H])
2.32 ± 0.90 (151) 0.35 ± 0.54 (74)
[c:1][C](=[O])[O]([H])>>
[c:1][C]([H])([H])[O]([H])
2.36 ± 0.62 (64) 0.35 ± 0.24 (38)
[c:1][c]1[c]([H])[c]([c]([H])[n][c]([H])1)[C](=[O])[O]([H
])>>
[c:1][c]1[c]([H])[n][c]([H])[n][c]([H])1
Table S14. Five selected rules that increases logD and increases the log(free/bound) for plasma
protein binding
(11)
[c:1]CCNc1ccc(cc1)[c:2]>>
[c:1]CCNC(=O)c1ccc(cc1)[c:2]
[c]1([H])[c]([c]([c]([H])[c]([c]1[F])[Cl])[F])[N:1]>>
[c]1([H])[c]([H])[c]([c]([c]([H])[c]1[C]([F])([F])[F])[N:1])[
F]
[c:1][N:2]([C:3]([H])([H])([H]))[c]1[c]([H])[c]([H])[c]([c]([
S33
c]([H])1)[C]([H])([H])[O]([H]))[C]([H])([H])([H])>>
[c:1][N:2]([C:3]([H])([H])([H]))[c]1[c]([H])[c]([c]([H])[c]([
H])[c]1[C]([H])([H])([H]))[C]([H])([H])[O]([H])
[c:1][c:2]([n:3])[N]([H])[c]1[c]([H])[c]([H])[c]([H])[c]([H])[
c]1[F]>>
[c:1][c:2]([n:3])[N]([H])[C]([H])([H])[C]1([H])[C]([H])([H]
)[C]([H])([H])1
[c:1][n:2]([c:3])[C]1([C]([H])([H])[C]([H])([H])[O][C]([H])
([H])[C]([H])([H])1)[C]([H])([H])([H])>>
[H][C@]1([C]([H])([H])[C@@]([C]([H])([H])1)([O][C]([H
])([H])([H]))[H])[n:2]([c:1])[c:3]
Table S15. Five selected rules that are logD neutral but increase the free proportion in human
plasma.
(nPairs) (nPairs)
[c:1][N:2]([H])[S:3]>>
[c:1][N:2]([C]([H])([H])([H]))[S:3]
0.48 ± 0.29 (11) 0.42 ± 0.28 (12)
[c:1][O][c]1[c]([H])[c]([H])[c]([c]([H])[c]([H])1)[C:2]([H])>>
[c:1][C]([H])([H])[c]1[c]([H])[c]([H])[c]([c]([H])[c]([H])1)[C:
2]([H])
S34
0.69 ± 0.63 (22) 0.36 ± 0.36 (21)
[c:1][c]1[c]([H])[c]([c]([n][c]([H])1)[O][C:2]([H])([H])([H]))[
C:3]>>
[c:1][c]1[c]([H])[c]([c](=[O])[n]([c]([H])1)[C:2]([H])([H])([H
]))[C:3]
0.33 ± 0.14 (11) 0.32 ± 0.34 (11)
[c:1][n:2]([c:3])[C]1([C]([H])([H])[C]([H])([H])[O][C]([H])([
H])[C]([H])([H])1)[C]([H])([H])([H])>>
[H][C@@]1([C]([H])([H])[C]([H])([H])[C@]([C]([H])([H])1
)([H])[O][C]([H])([H])([H]))[n:2]([c:1])[c:3]
Table S16: Four selected rules that are increase logD but increase the log(free/bound)human
PPB.
S35
Comparison with rules published in Papadatos’2010 paper
In 2010, Papadatos et al published the first MMP paper highlighting the importance of the
chemical context within which transformations take place. In Table S17, we compare some of
the rules they found using their specific context description with our findings. Note that
Papadatos et al use a different scheme for encoding the chemical context, so it is not possible
to directly compare the rule statistics from GRDv3 with their rule statistics. Nevertheless, Table
S17 shows that the rules we find are in qualitative agreement with the rules found by Papadatos
et al. There is very good qualitative agreement between the effect on solubility of the piperidine
morpholine has a detrimental effect on solubility, whereas within an aromatic environment, this
has a beneficial effect. The structural reason for this is most probably the change of the pKa
within the aliphatic environment. This is the strongest context dependence seen, and there is a
very good agreement between Papadatos et al’s and our data. If the effects are weaker (the
other two examples in Table S17), the agreement is qualitatively still good, but it is hard to judge
more details due to the different encoding of the environment and the absence of quantitative
S36
Hydrophobic aromatic ring neutral 0.01 ± 0.29
(p-phenyl) (137)
[c:1]1([H])[c:2]([H])[c:3]([H])[
c:4][c:5]([H])[c:6]1([H])>>[C]([
H])([H])([H])[O][c:1]1[c:2]([H])
[c:3]([H])[c:4][c:5]([H])[c:6]1([
H])
[c:1]1([H])[c:2]([H])[c:3][c:4]([
H])[n:5][c:6]1([H])>>[C]([H])([
H])([H])[O][c:6]1[c:1]([H])[c:2]
([H])[c:3][c:4]([H])[n:5]1
Solubility Piperidine >> Polar aromatic ring increase 0.78 ± 0.87 (62)
Morpholine (aromatic ring)
[c:1][N]1[C]([H])([H])[C]([H])(
[H])[C]([H])([H])[C]([H])([H])[C
]([H])([H])1>>[c:1][N]1[C]([H])
([H])[C]([H])([H])[O][C]([H])([H
])[C]([H])([H])1
[C:1]([H])([H])[N]1[C]([H])([H
])[C]([H])([H])[C]([H])([H])[C]([
H])([H])[C]([H])([H])1>>[C:1]([
H])([H])[N]1[C]([H])([H])[C]([H
])([H])[O][C]([H])([H])[C]([H])([
H])1
S37
logD H >> CN Aromatic ring with H-bond Neutral - -0.10 ± 0.33
acceptor (p-3-N-pyridine) increase (43)
[c:1]1([H])[c:2]([H])[c:3]([H])[
n:4][c:5][c:6]1([H])>>[c:6]1([H
])[c:1]([H])[c:2]([c:3]([H])[n:4][
c:5]1)[C]#[N]
[C:1]([H])([H])([H])[C:2]([H
])([H])>>[C:2]([H])([H])[C:1]([
H])([H])[C]#[N]
Table S17: Comparison with context-specific MMP rules previously identified by Papadatos
S38
Single Company versus GRDv3 rules
Roche
GRD
neutral 204 7041 204 967
Table S18: Qualitative change in rule direction for logD rules based Roche pairs alone and
S39
Roche
neutral 0 80 0 0
NED 15 42 15 799
Table S19: Qualitative change in rule direction for logD rules based on Roche pairs alone and
S40
Roche
Table S20: Qualitative change in rule direction for human microsomal clearance rules based on
S41