Professional Documents
Culture Documents
4 Ethyl 40 Toxicity Report
4 Ethyl 40 Toxicity Report
C23H23N2O2[?]
Molecular Weight: 359.44091 Actual Endpoint Non-Degradable Non-Degradable Non-Degradable
ALogP: 5.163 Predicted Endpoint Non-Degradable Non-Degradable Non-Degradable
Rotatable Bonds: 6 Distance 0.617 0.659 0.690
Acceptors: 4 Reference Environmental Toxicology Environmental Toxicology Environmental Toxicology
Donors: 1 & Chemistry 18(9), 1763- & Chemistry 18(9), 1763- & Chemistry 18(9), 1763-
1768, 1999. 1768, 1999. 1768, 1999.
Model Prediction
Model Applicability
Prediction: Non-Degradable
Unknown features are fingerprint features in the query molecule, but not found or appearing too infrequently
Probability: 0.118 in the training set.
Enrichment: 0.271
1. All properties and OPS components are within expected ranges.
Bayesian Score: -10.6
Mahalanobis Distance: 18.5
Mahalanobis Distance p-value: 4.07e-025
Feature Contribution
Prediction: Positive if the Bayesian score is above the estimated Top features for positive contribution
best cutoff value from minimizing the false positive and false
negative rate. Fingerprint Bit/Smiles Feature Structure Score Degradable in
Probability: The esimated probability that the sample is in the training set
positive category. This assumes that the Bayesian score follows
a normal distribution and is different from the prediction using a SCFP_12 1242547645 0.501 4 out of 5
cutoff.
Enrichment: An estimate of enrichment, that is, the increased
likelihood (versus random) of this sample being in the category.
Bayesian Score: The standard Laplacian-modified Bayesian
score.
Mahalanobis Distance: The Mahalanobis distance (MD) is the
distance to the center of the training data. The larger the MD, the
less trustworthy the prediction.
Mahalanobis Distance p-value: The p-value gives the fraction of
training data with an MD greater than or equal to the one for the
given sample, assuming normally distributed data. The smaller
the p-value, the less trustworthy the prediciton. For highly non-
normal X properties (e.g., fingerprints), the MD p-value is wildly
inaccurate.
SCFP_12 392579710 0.422 3 out of 4
C23H23N2O2[?]
Actual Endpoint Moderate_Severe Mild Mild
Molecular Weight: 359.44091
Predicted Endpoint Moderate_Severe Mild Mild
ALogP: 5.163
Distance 0.577 0.621 0.625
Rotatable Bonds: 6
Reference CIGET* -;-;77 28ZPAK-;90;72 Prehled Prumyslove
Acceptors: 4 Toxikologie; Organicke
Donors: 1 Latky; Marhold; J. pp
648;86
Model Prediction
Model Applicability
Prediction: Mild
Unknown features are fingerprint features in the query molecule, but not found or appearing too infrequently
Probability: 0.543 in the training set.
Enrichment: 0.788
1. All properties and OPS components are within expected ranges.
Bayesian Score: -5.69 2. Unknown FCFP_2 feature: -1151884458: ['?'][c](:[*]):[c](N):n:[*]
Mahalanobis Distance: 7.12
Mahalanobis Distance p-value: 0.997
Prediction: Positive if the Bayesian score is above the estimated
Feature Contribution
best cutoff value from minimizing the false positive and false
negative rate. Top features for positive contribution
Probability: The esimated probability that the sample is in the Fingerprint Bit/Smiles Feature Structure Score Moderate_Severe
positive category. This assumes that the Bayesian score follows
a normal distribution and is different from the prediction using a in training set
cutoff.
Enrichment: An estimate of enrichment, that is, the increased FCFP_10 -497728148 0.356 24 out of 25
likelihood (versus random) of this sample being in the category.
Bayesian Score: The standard Laplacian-modified Bayesian
score.
Mahalanobis Distance: The Mahalanobis distance (MD) is the
distance to the center of the training data. The larger the MD, the
less trustworthy the prediction.
Mahalanobis Distance p-value: The p-value gives the fraction of
training data with an MD greater than or equal to the one for the
given sample, assuming normally distributed data. The smaller
the p-value, the less trustworthy the prediciton. For highly non-
normal X properties (e.g., fingerprints), the MD p-value is wildly
inaccurate.
FCFP_10 1028934530 0.256 2 out of 2
C23H23N2O2[?]
Actual Endpoint Irritant Irritant Irritant
Molecular Weight: 359.44091
Predicted Endpoint Irritant Irritant Non-Irritant
ALogP: 5.163
Distance 0.573 0.613 0.616
Rotatable Bonds: 6
Reference CIGET* -;-;77 28ZPAK-;90;72 Prehled Prumyslove
Acceptors: 4 Toxikologie; Organicke
Donors: 1 Latky; Marhold; J. pp
648;86
Model Prediction
Model Applicability
Prediction: Irritant
Unknown features are fingerprint features in the query molecule, but not found or appearing too infrequently
Probability: 0.976 in the training set.
Enrichment: 1.15
1. All properties and OPS components are within expected ranges.
Bayesian Score: 0.142 2. Unknown FCFP_2 feature: -1151884458: ['?'][c](:[*]):[c](N):n:[*]
Mahalanobis Distance: 6.84
Mahalanobis Distance p-value: 0.999
Prediction: Positive if the Bayesian score is above the estimated
Feature Contribution
best cutoff value from minimizing the false positive and false
negative rate. Top features for positive contribution
Probability: The esimated probability that the sample is in the Fingerprint Bit/Smiles Feature Structure Score Irritant in training
positive category. This assumes that the Bayesian score follows
a normal distribution and is different from the prediction using a set
cutoff.
Enrichment: An estimate of enrichment, that is, the increased FCFP_12 1747237384 0.208 44 out of 44
likelihood (versus random) of this sample being in the category.
Bayesian Score: The standard Laplacian-modified Bayesian
score.
Mahalanobis Distance: The Mahalanobis distance (MD) is the
distance to the center of the training data. The larger the MD, the
less trustworthy the prediction.
Mahalanobis Distance p-value: The p-value gives the fraction of
training data with an MD greater than or equal to the one for the
given sample, assuming normally distributed data. The smaller
the p-value, the less trustworthy the prediciton. For highly non-
normal X properties (e.g., fingerprints), the MD p-value is wildly
inaccurate.
FCFP_12 17 0.189 48 out of 49
C23H23N2O2[?]
Actual Endpoint Carcinogen Non-Carcinogen Non-Carcinogen
Molecular Weight: 359.44091
Predicted Endpoint Carcinogen Non-Carcinogen Non-Carcinogen
ALogP: 5.163
Distance 0.622 0.626 0.673
Rotatable Bonds: 6
Reference NTP364 NTP433 TR-446
Acceptors: 4
Donors: 1
Model Applicability
Unknown features are fingerprint features in the query molecule, but not found or appearing too infrequently
Model Prediction in the training set.
Prediction: Non-Carcinogen
1. OPS PC11 out of range. Value: 3.8229. Training min, max, SD, explained variance: -2.3897,
Probability: 0.438 3.1905, 1.314, 0.0302.
Enrichment: 0.961
Bayesian Score: -2.82
Feature Contribution
Mahalanobis Distance: 10.9
Mahalanobis Distance p-value: 6.13e-005 Top features for positive contribution
Prediction: Positive if the Bayesian score is above the estimated Fingerprint Bit/Smiles Feature Structure Score Carcinogen in
best cutoff value from minimizing the false positive and false training set
negative rate.
Probability: The esimated probability that the sample is in the FCFP_12 -1861645784 0.387 6 out of 9
positive category. This assumes that the Bayesian score follows
a normal distribution and is different from the prediction using a
cutoff.
Enrichment: An estimate of enrichment, that is, the increased
likelihood (versus random) of this sample being in the category.
Bayesian Score: The standard Laplacian-modified Bayesian
score.
Mahalanobis Distance: The Mahalanobis distance (MD) is the
distance to the center of the training data. The larger the MD, the
less trustworthy the prediction.
Mahalanobis Distance p-value: The p-value gives the fraction of
training data with an MD greater than or equal to the one for the
given sample, assuming normally distributed data. The smaller
the p-value, the less trustworthy the prediciton. For highly non-
normal X properties (e.g., fingerprints), the MD p-value is wildly
inaccurate.
FCFP_12 342000123 0.344 1 out of 1
C23H23N2O2[?]
Molecular Weight: 359.44091 Actual Endpoint Irritant Irritant Irritant
ALogP: 5.163 Predicted Endpoint Irritant Non-Irritant Non-Irritant
Rotatable Bonds: 6 Distance 0.608 0.642 0.669
Acceptors: 4 Reference BCFAAI Bollettino 85JCAE "Prehled 85JCAE "Prehled
Donors: 1 Chimico Farmaceutico. Prumyslove Toxikologie; Prumyslove Toxikologie;
(Societa Editoriale Organicke Latky," Organicke Latky,"
Farmaceutica, Vi a Marhold, J., Prague , Marhold, J., Prague ,
Model Prediction Ausonio 12, 20123 Milan,
Italy) V.33- 1894-
Czechoslovakia,
Avicenum, 1986
Czechoslovakia,
Avicenum, 1986
Prediction: Irritant Volume(issue)/page/year: Volume(issue)/page/year: Volume(issue)/page/year:
Probability: 0.973 107,3 10,1968 -,725,1986 -,648,1986
Enrichment: 1.06
Bayesian Score: -0.742
Model Applicability
Mahalanobis Distance: 6.82 Unknown features are fingerprint features in the query molecule, but not found or appearing too infrequently
in the training set.
Mahalanobis Distance p-value: 0.998
Prediction: Positive if the Bayesian score is above the estimated 1. All properties and OPS components are within expected ranges.
best cutoff value from minimizing the false positive and false
negative rate.
Probability: The esimated probability that the sample is in the
positive category. This assumes that the Bayesian score follows
Feature Contribution
a normal distribution and is different from the prediction using a
cutoff. Top features for positive contribution
Enrichment: An estimate of enrichment, that is, the increased Fingerprint Bit/Smiles Feature Structure Score Irritant in training
likelihood (versus random) of this sample being in the category.
Bayesian Score: The standard Laplacian-modified Bayesian set
score.
Mahalanobis Distance: The Mahalanobis distance (MD) is the
distance to the center of the training data. The larger the MD, the
less trustworthy the prediction.
Mahalanobis Distance p-value: The p-value gives the fraction of
training data with an MD greater than or equal to the one for the
given sample, assuming normally distributed data. The smaller
the p-value, the less trustworthy the prediciton. For highly non-
normal X properties (e.g., fingerprints), the MD p-value is wildly
inaccurate.
FCFP_12 -1038421835 0.0795 9 out of 9
C23H23N2O2[?]
Actual Endpoint Strong-Sensitizer Strong-Sensitizer Strong-Sensitizer
Molecular Weight: 359.44091
Predicted Endpoint Strong-Sensitizer Strong-Sensitizer Strong-Sensitizer
ALogP: 5.163
Distance 0.663 0.675 0.708
Rotatable Bonds: 6
Reference SAR and QSAR in Env Howard I Maibach (priv SAR and QSAR in Env
Acceptors: 4 Res (1994) 2:159 comm) Res (1994) 2:159
Donors: 1
Model Applicability
Model Prediction Unknown features are fingerprint features in the query molecule, but not found or appearing too infrequently
Prediction: Strong-Sensitizer in the training set.
Probability: 0.992 1. All properties and OPS components are within expected ranges.
Enrichment: 1.28 2. Unknown FCFP_2 feature: -1861645784: [*]:[cH]:[c](:[cH]:[*])[c](:[*]):[*]
Bayesian Score: 4.26 3. Unknown FCFP_2 feature: 690511177: [*]:[cH]:[c](:n:[*])[c](:[*]):[*]
Mahalanobis Distance: 6.5
Mahalanobis Distance p-value: 0.243 Feature Contribution
Prediction: Positive if the Bayesian score is above the estimated
best cutoff value from minimizing the false positive and false Top features for positive contribution
negative rate.
Probability: The esimated probability that the sample is in the Fingerprint Bit/Smiles Feature Structure Score Strong-Sensitizer
positive category. This assumes that the Bayesian score follows in training set
a normal distribution and is different from the prediction using a
cutoff. FCFP_12 16 0.232 165 out of 165
Enrichment: An estimate of enrichment, that is, the increased
likelihood (versus random) of this sample being in the category.
Bayesian Score: The standard Laplacian-modified Bayesian
score.
Mahalanobis Distance: The Mahalanobis distance (MD) is the
distance to the center of the training data. The larger the MD, the
less trustworthy the prediction.
Mahalanobis Distance p-value: The p-value gives the fraction of
training data with an MD greater than or equal to the one for the
given sample, assuming normally distributed data. The smaller
the p-value, the less trustworthy the prediciton. For highly non-
normal X properties (e.g., fingerprints), the MD p-value is wildly
inaccurate.
FCFP_12 1618154665 0.232 164 out of 164
C23H23N2O2[?]
Actual Endpoint (-log C) 5.49293 5.39369 2.8543
Molecular Weight: 359.44091
Predicted Endpoint (-log 4.9569 4.27874 3.40838
ALogP: 5.163 C)
Rotatable Bonds: 6 Distance 0.563 0.564 0.629
Acceptors: 4 Reference CPDB CPDB CPDB
Donors: 1
Model Applicability
Model Prediction Unknown features are fingerprint features in the query molecule, but not found or appearing too infrequently
Prediction: 0.726 in the training set.
Unit: mg/kg_body_weight/day 1. OPS PC19 out of range. Value: -3.4242. Training min, max, SD, explained variance: -2.9709,
Mahalanobis Distance: 15.9 5.6065, 1.282, 0.0158.
Mahalanobis Distance p-value: 5.61e-014 2. OPS PC20 out of range. Value: -4.2597. Training min, max, SD, explained variance: -3.9266,
5.5565, 1.236, 0.0147.
Mahalanobis Distance: The Mahalanobis distance (MD) is a
generalization of the Euclidean distance that accounts for
correlations among the X properties. It is calculated as the
distance to the center of the training data. The larger the MD, the Feature Contribution
less trustworthy the prediction.
Mahalanobis Distance p-value: The p-value gives the fraction of
training data with an MD greater than or equal to the one for the
Top features for positive contribution
given sample, assuming normally distributed data. The smaller Fingerprint Bit/Smiles Feature Structure Score
the p-value, the less trustworthy the prediciton. For highly non-
normal X properties (e.g., fingerprints), the MD p-value is wildly FCFP_6 136627117 0.69
inaccurate.
FCFP_6 -1861645784 0.359
FCFP_6 17 -0.149
Molecule TOPKAT_Chronic_LOAEL
Structural Similar Compounds
Name ISOXABEN ASSURE RHODAMINE 6G
Structure
Feature Contribution
Top features for positive contribution
Fingerprint Bit/Smiles Feature Structure Score
ECFP_6 1559650422 0.129
FCFP_6 3 0.0924
Feature Contribution
Top features for positive contribution
Fingerprint Bit/Smiles Feature Structure Score
ECFP_6 -1059365320 0.165
FCFP_2 0 -0.275
C23H23N2O2[?]
Molecular Weight: 359.44091
ALogP: 5.163
Actual Endpoint (-log C) 2.2548 1.6031 1.7472
Rotatable Bonds: 6
Predicted Endpoint (-log 1.69815 2.17352 2.15944
Acceptors: 4 C)
Donors: 1 Distance 0.620 0.683 0.685
Reference MDACAP Medicamentos PEMNDP Pesticide NNGADV Nippon Noyaku
Model Prediction de Actualidad. (J.R.
Prous; S.A.; Apartado de
Manual. (The British Crop
Protection Council; 20
Gakkaishi. Journal of the
Pesticide Science Society
Prediction: 1.49e+004 Correos 54 0; 08080 Bridport R d.; Thornton of Japan. (Nippon Noyaku
Unit: mg/m3/h Barcelona; Spain) V.1- Heath CR4 7QG; UK) Gakkai; 1-43-11;
1965- V.1- 1968- Komagome; Toshima-ku;
Mahalanobis Distance: 13.1 Volume(issue)/page/year: Volume(issue)/page/year: Tokyo 170; Japan) V.1-
21;227;1985 9;157;1 991 1976-
Mahalanobis Distance p-value: 3.83e-009 Volume(issue)/page/year:
Mahalanobis Distance: The Mahalanobis distance (MD) is a 15;125;1990
generalization of the Euclidean distance that accounts for
correlations among the X properties. It is calculated as the
distance to the center of the training data. The larger the MD, the
less trustworthy the prediction.
Model Applicability
Mahalanobis Distance p-value: The p-value gives the fraction of Unknown features are fingerprint features in the query molecule, but not found or appearing too infrequently
training data with an MD greater than or equal to the one for the
given sample, assuming normally distributed data. The smaller in the training set.
the p-value, the less trustworthy the prediciton. For highly non-
normal X properties (e.g., fingerprints), the MD p-value is wildly 1. OPS PC21 out of range. Value: -3.4664. Training min, max, SD, explained variance: -3.0247,
inaccurate. 4.4972, 1.058, 0.0155.
2. Unknown ECFP_2 feature: -1305021906: [*]['?']
3. Unknown ECFP_2 feature: -428002189: [*]:[cH]:[c](:n:[*])[c](:[*]):[*]
4. Unknown ECFP_2 feature: -1734834311: ['?'][c](:[*]):[c](N):n:[*]
5. Unknown ECFP_2 feature: -1659020767: [*][c](:[*]):[c](['?']):[c]([*]):[*]
6. Unknown ECFP_2 feature: 432684389: ['?'][c](:[*]):[*]
Feature Contribution
Top features for positive contribution
Fingerprint Bit/Smiles Feature Structure Score
ECFP_2 642810091 0.214
FCFP_2 3 0.0737
FCFP_2 1 -0.0796
FCFP_2 16 -0.0512
Molecule TOPKAT_Rat_Maximum_Tolerated_Dose_Gavage
Structural Similar Compounds
Name PHENYLBUTAZONE PROMETHAZINE.HCL 8-METHOXYPSORALEN
Structure
FCFP_2 3 0.104
FCFP_2 0 -0.29
Molecule TOPKAT_Rat_Oral_LD50
Structural Similar Compounds
Name BUFEZOLAC INDOMETHAZINE DIFENAMIZOLE
Structure
Feature Contribution
Top features for positive contribution
Fingerprint Bit/Smiles Feature Structure Score
ECFP_6 642810091 0.281