Professional Documents
Culture Documents
4 Ethyl 39 Toxicity Report
4 Ethyl 39 Toxicity Report
C27H21N2[?]
Molecular Weight: 373.46903 Actual Endpoint Moderate_Severe Mild Mild
ALogP: 6.545 Predicted Endpoint Mild Mild Mild
Rotatable Bonds: 3 Distance 0.609 0.688 0.697
Acceptors: 2 Reference 28ZPAK-;242;72 28ZPAK-;74;72 Prehled Prumyslove
Donors: 1 Toxikologie; Organicke
Latky; Marhold; J. pp
648;86
Model Prediction
Prediction: Mild Model Applicability
Probability: 0.654 Unknown features are fingerprint features in the query molecule, but not found or appearing too infrequently
Enrichment: 0.949 in the training set.
Bayesian Score: -4.27 1. All properties and OPS components are within expected ranges.
Mahalanobis Distance: 6.59 2. Unknown FCFP_2 feature: -1151884458: ['?'][c](:[*]):[c](N):n:[*]
Mahalanobis Distance p-value: 1
Prediction: Positive if the Bayesian score is above the estimated
best cutoff value from minimizing the false positive and false
Feature Contribution
negative rate.
Probability: The esimated probability that the sample is in the Top features for positive contribution
positive category. This assumes that the Bayesian score follows Fingerprint Bit/Smiles Feature Structure Score Moderate_Severe
a normal distribution and is different from the prediction using a
cutoff. in training set
Enrichment: An estimate of enrichment, that is, the increased
likelihood (versus random) of this sample being in the category. FCFP_10 -497728148 0.356 24 out of 25
Bayesian Score: The standard Laplacian-modified Bayesian
score.
Mahalanobis Distance: The Mahalanobis distance (MD) is the
distance to the center of the training data. The larger the MD, the
less trustworthy the prediction.
Mahalanobis Distance p-value: The p-value gives the fraction of
training data with an MD greater than or equal to the one for the
given sample, assuming normally distributed data. The smaller
the p-value, the less trustworthy the prediciton. For highly non-
normal X properties (e.g., fingerprints), the MD p-value is wildly
inaccurate.
FCFP_10 136120670 0.206 53 out of 65
C27H21N2[?]
Molecular Weight: 373.46903 Actual Endpoint Irritant Irritant Irritant
ALogP: 6.545 Predicted Endpoint Irritant Irritant Non-Irritant
Rotatable Bonds: 3 Distance 0.603 0.680 0.681
Acceptors: 2 Reference 28ZPAK-;242;72 CIGET* -;-;77 Prehled Prumyslove
Donors: 1 Toxikologie; Organicke
Latky; Marhold; J. pp
648;86
Model Prediction
Prediction: Irritant Model Applicability
Probability: 1 Unknown features are fingerprint features in the query molecule, but not found or appearing too infrequently
Enrichment: 1.18 in the training set.
Bayesian Score: 0.94 1. All properties and OPS components are within expected ranges.
Mahalanobis Distance: 5.81 2. Unknown FCFP_2 feature: -1151884458: ['?'][c](:[*]):[c](N):n:[*]
Mahalanobis Distance p-value: 1
Prediction: Positive if the Bayesian score is above the estimated
best cutoff value from minimizing the false positive and false
Feature Contribution
negative rate.
Probability: The esimated probability that the sample is in the Top features for positive contribution
positive category. This assumes that the Bayesian score follows Fingerprint Bit/Smiles Feature Structure Score Irritant in training
a normal distribution and is different from the prediction using a
cutoff. set
Enrichment: An estimate of enrichment, that is, the increased
likelihood (versus random) of this sample being in the category. FCFP_12 1747237384 0.208 44 out of 44
Bayesian Score: The standard Laplacian-modified Bayesian
score.
Mahalanobis Distance: The Mahalanobis distance (MD) is the
distance to the center of the training data. The larger the MD, the
less trustworthy the prediction.
Mahalanobis Distance p-value: The p-value gives the fraction of
training data with an MD greater than or equal to the one for the
given sample, assuming normally distributed data. The smaller
the p-value, the less trustworthy the prediciton. For highly non-
normal X properties (e.g., fingerprints), the MD p-value is wildly
inaccurate.
FCFP_12 17 0.189 48 out of 49
C27H21N2[?]
Actual Endpoint Non-Carcinogen Non-Carcinogen Non-Carcinogen
Molecular Weight: 373.46903
Predicted Endpoint Non-Carcinogen Non-Carcinogen Non-Carcinogen
ALogP: 6.545
Distance 0.584 0.584 0.650
Rotatable Bonds: 3
Reference TR-446 NTP446 TR-40
Acceptors: 2
Donors: 1
Model Applicability
Unknown features are fingerprint features in the query molecule, but not found or appearing too infrequently
Model Prediction in the training set.
Prediction: Carcinogen
1. OPS PC11 out of range. Value: 3.7268. Training min, max, SD, explained variance: -2.3897,
Probability: 0.534 3.1905, 1.314, 0.0302.
Enrichment: 1.17
Bayesian Score: 0.507
Feature Contribution
Mahalanobis Distance: 10.8
Mahalanobis Distance p-value: 9.93e-005 Top features for positive contribution
Prediction: Positive if the Bayesian score is above the estimated Fingerprint Bit/Smiles Feature Structure Score Carcinogen in
best cutoff value from minimizing the false positive and false training set
negative rate.
Probability: The esimated probability that the sample is in the FCFP_12 -1320007763 0.475 10 out of 14
positive category. This assumes that the Bayesian score follows
a normal distribution and is different from the prediction using a
cutoff.
Enrichment: An estimate of enrichment, that is, the increased
likelihood (versus random) of this sample being in the category.
Bayesian Score: The standard Laplacian-modified Bayesian
score.
Mahalanobis Distance: The Mahalanobis distance (MD) is the
distance to the center of the training data. The larger the MD, the
less trustworthy the prediction.
Mahalanobis Distance p-value: The p-value gives the fraction of
training data with an MD greater than or equal to the one for the
given sample, assuming normally distributed data. The smaller
the p-value, the less trustworthy the prediciton. For highly non-
normal X properties (e.g., fingerprints), the MD p-value is wildly
inaccurate.
FCFP_12 -387072142 0.436 7 out of 10
C27H21N2[?]
Actual Endpoint Irritant Irritant Irritant
Molecular Weight: 373.46903
Predicted Endpoint Non-Irritant Non-Irritant Irritant
ALogP: 6.545
Distance 0.624 0.684 0.685
Rotatable Bonds: 3
Reference 28ZPAK "Sbornik 85JCAE "Prehled 85JCAE "Prehled
Acceptors: 2 Vysledku Toxixologickeho Prumyslove Toxikologie; Prumyslove Toxikologie;
Donors: 1 Vysetreni Latek A Organicke Latky," Organicke Latky,"
Pripravku," Marhol d, J.V., Marhold, J., Prague , Marhold, J., Prague ,
Institut Pro Vychovu Czechoslovakia, Czechoslovakia,
Model Prediction Vedoucicn Pracovniku
Chemickeho Prumyclu
Avicenum, 1986
Volume(issue)/page/year:
Avicenum, 1986
Volume(issue)/page/year:
Prediction: Non-Irritant Praha, Cz echoslovakia, -,486,1986 -,536,1986
Probability: 0.96 1972
Volume(issue)/page/year:
Enrichment: 1.04 -,242,1
Bayesian Score: -1.54
Mahalanobis Distance: 6.7 Model Applicability
Mahalanobis Distance p-value: 0.999 Unknown features are fingerprint features in the query molecule, but not found or appearing too infrequently
Prediction: Positive if the Bayesian score is above the estimated in the training set.
best cutoff value from minimizing the false positive and false
negative rate. 1. All properties and OPS components are within expected ranges.
Probability: The esimated probability that the sample is in the
positive category. This assumes that the Bayesian score follows
a normal distribution and is different from the prediction using a
cutoff. Feature Contribution
Enrichment: An estimate of enrichment, that is, the increased
likelihood (versus random) of this sample being in the category. Top Features for negative contribution
Bayesian Score: The standard Laplacian-modified Bayesian
score. Fingerprint Bit/Smiles Feature Structure Score Irritant in training
Mahalanobis Distance: The Mahalanobis distance (MD) is the set
distance to the center of the training data. The larger the MD, the
less trustworthy the prediction.
Mahalanobis Distance p-value: The p-value gives the fraction of
training data with an MD greater than or equal to the one for the
given sample, assuming normally distributed data. The smaller
the p-value, the less trustworthy the prediciton. For highly non-
normal X properties (e.g., fingerprints), the MD p-value is wildly
inaccurate.
FCFP_12 1069584379 -0.439 38 out of 65
FCFP_6 16 -0.354
FCFP_6 17 -0.149
Molecule TOPKAT_Chronic_LOAEL
Structural Similar Compounds
Name CLOTRIMAZOLE C.I.YELLOW 4 MIDAZOLAM.HCL
Structure
Feature Contribution
Top features for positive contribution
Fingerprint Bit/Smiles Feature Structure Score
ECFP_6 1559650422 0.129
FCFP_6 3 0.0924
Feature Contribution
Top features for positive contribution
Fingerprint Bit/Smiles Feature Structure Score
ECFP_6 642810091 0.148
FCFP_2 3 -0.198
Molecule TOPKAT_Rat_Inhalational_LC50
Structural Similar Compounds
Name 1H-Benzimidazole; 5- 1H-1;2;4-Triazole; 1- Propanamide; 2-(2-
chloro-6-(2;3- ((bis(4- naphthalenyloxy)-N-
dichlorophenoxy)-2- fluorophenyl)methylsilyl) phenyl-
(methylthio)- methyl)-
Structure
C27H21N2[?]
Molecular Weight: 373.46903 Actual Endpoint (-log C) 2.2548 1.5166 1.6396
ALogP: 6.545 Predicted Endpoint (-log 1.69815 1.21248 0.942087
Rotatable Bonds: 3 C)
Acceptors: 2 Distance 0.871 0.957 0.962
Donors: 1 Reference MDACAP Medicamentos NTIS** National Technical NNGADV Nippon Noyaku
de Actualidad. (J.R. Information Service. Gakkaishi. Journal of the
Prous; S.A.; Apartado de (Springfield; VA 22161) Pesticide Science Society
Model Prediction Correos 54 0; 08080
Barcelona; Spain) V.1-
Forme rly U.S.
Clearinghouse for
of Japan. (Nippon Noyaku
Gakkai; 1-43-11;
Prediction: 4.72e+004 1965- Scientific & Technical Komagome; Toshima-ku;
Unit: mg/m3/h Volume(issue)/page/year: Information. Tokyo 170; Japan) V.1-
21;227;1985 Volume(issue)/pag e/year: 1976-
Mahalanobis Distance: 14.9 OTS0543806 Volume(issue)/page/year:
12;563;1987
Mahalanobis Distance p-value: 1.35e-014
Mahalanobis Distance: The Mahalanobis distance (MD) is a
generalization of the Euclidean distance that accounts for
correlations among the X properties. It is calculated as the
Model Applicability
distance to the center of the training data. The larger the MD, the Unknown features are fingerprint features in the query molecule, but not found or appearing too infrequently
less trustworthy the prediction. in the training set.
Mahalanobis Distance p-value: The p-value gives the fraction of
training data with an MD greater than or equal to the one for the 1. Num_AromaticRings out of range. Value: 5. Training min, max, mean, SD: 0, 3, 0.57958, 0.795.
given sample, assuming normally distributed data. The smaller
the p-value, the less trustworthy the prediciton. For highly non- 2. OPS PC21 out of range. Value: -3.1422. Training min, max, SD, explained variance: -3.0247,
normal X properties (e.g., fingerprints), the MD p-value is wildly 4.4972, 1.058, 0.0155.
inaccurate.
3. Unknown ECFP_2 feature: -1305021906: [*]['?']
4. Unknown ECFP_2 feature: -428002189: [*]:[cH]:[c](:n:[*])[c](:[*]):[*]
5. Unknown ECFP_2 feature: -1734834311: ['?'][c](:[*]):[c](N):n:[*]
6. Unknown ECFP_2 feature: -1659020767: [*][c](:[*]):[c](['?']):[c]([*]):[*]
7. Unknown ECFP_2 feature: 432684389: ['?'][c](:[*]):[*]
8. Unknown ECFP_2 feature: -813997308: [*]:[c](:[*])[c](:[c](:[*]):[*]):[c](:[*]):[*]
Feature Contribution
Top features for positive contribution
Fingerprint Bit/Smiles Feature Structure Score
ECFP_2 642810091 0.214
FCFP_2 17 0.0441
FCFP_2 16 -0.0512
FCFP_2 0 -0.0314
Molecule TOPKAT_Rat_Maximum_Tolerated_Dose_Gavage
Structural Similar Compounds
Name PHENYLBUTAZONE PROMETHAZINE.HCL o-BENZYL-p-
CHLOROPHENOL
Structure
FCFP_2 0 -0.29
Molecule TOPKAT_Rat_Oral_LD50
Structural Similar Compounds
Name sym-DI-.beta.-NAPHTHYL- FENDOSAL ANTHRAQUINONE; 1;4-
p-PHENYLENEDIAMINE bis-(p-TOLYLAMINO)-
Structure
Feature Contribution
Top features for positive contribution
Fingerprint Bit/Smiles Feature Structure Score
ECFP_6 642810091 0.281