Download as pdf or txt
Download as pdf or txt
You are on page 1of 1

Association Rule Mining for the QSAR Problem

Langdon, W. B. & Barrett, S. J. (2004). Genetic Closure Operator: Let S be a set and c: ℘(S) →
Programming in Data Mining for Drug Discovery. ℘(S); c is a closure operator on S if ∀ X, Y ⊆ S, c
Evolutionary Computing in Data Mining, Springer, satisfies the following properties:
2004, Ashish Ghosh and Lakhmi C. Jain, 163, Studies
1. extension, X ⊆ c(X);
in Fuzziness and Soft Computing, 10, ISBN 3-540-
22370-3, pp. 211--235. 2. mononicity, if X⊆Y, then c(X) ⊆ c(Y);
Neagu, C.D., Benfenati, E., Gini, G., Mazzatorta, P., 3. idempotency, c(c(X)) = c(X).
Roncaglioni, A., (2002). Neuro-Fuzzy Knowledge
Representation for Toxicity Prediction of Organic Note: s◦t and t◦s are closure operators, when s and
Compounds. Proceedings of the 15th European Con- t are the mappings in a Galois connection.
ference on Artificial Intelligence, Frank van Harmelen Concept: The Galois connection of the (T, I, D)
(Ed.):, ECAI’2002, Lyon, France, July 2002. IOS Press context, a concept is a pair (X, Y), X⊆ T, Y⊆ I, that
2002: pp. 498-502. satisfies s(X)=Y and t(Y)=X. X is called the extent and
Pasquier, N., Bastide, Y., Taouil, R., & Lakhal, L. Y the intent of the concept (X,Y).
(1999, January). Discovering frequent closed itemsets Context: A triple (T, I, D) where T and I are sets
for association rules. Database Theory International and D ⊆T×I. The elements of T are called objects and
Conference, ICDT’99, Jerusalem, Israel, 398-416. the elements of I are called attributes. For any t ∈T
Pei, J., Han, J., & Mao, R. (2000, May). CLOSET: An and i ∈ I, we note tDi when t is related to i, i.e. ( t, i)
efficient algorithm for mining frequent closed itemsets. ∈ D.
Data Mining and Knowledge Discovery Conference, Frequent Itemset: Itemset with support higher than
DMKD 2000, Dallas, Texas, 11-20. a predefined threshold, denoted minsup.
Wang, Z., Durst, G., Eberhart, R., Boyd, D., & Ben- Galois Connection: Let (T, I, D) be a context.
Miled, Z., (2004). Particle Swarm Optimization and Then the mappings
Neural Network Application for QSAR. Proceedings
of the 18th International Parallel and Distributed s: ℘(T)→ ℘(I), s(X) = { i∈ I | (∀t ∈X) tDi }
Processing Symposium (IPDPS 2004), 26-30 April t: ℘(I)→ ℘(T), s(Y) = { t∈ T | (∀i ∈Y) tDi }
2004, Santa Fe, New Mexico, USA. IEEE Computer
Society 2004, ISBN 0-7695-2132-0. define a Galois connection between ℘(T) and ℘(I),
the power sets of T and I, respectively.
Zaki, M. J., & Ogihara, M. (1998, June). Theoretical
foundations of association rules. In 3rd Research Issues Itemset: Set of items in a Boolean database D,
in Data Mining and Knowledge Discovery ACM SIG- I={i1, i2, … in}.
MOD Workshop, DMKD’98, Seattle, Washington. Itemset Support: The ratio between the number of
Zaki, M. J., & Hsiao, C. J. (1999). CHARM: An Ef- transactions in D comprising all the items in I and the
ficient Algorithm for Closed Association Rule Mining, total number of transactions in D (support(I) = |{Ti∈D|
Technical Report 99-10, Department of Computer (∀ij∈I) ij∈ Ti }| / |D|).
Science, Rensselaer Polytechnic Institute. Pseudo-Intent: The set X is a pseudo-intent if X ≠
c(X), where c is a closure operator, and for all pseudo-
intents Q⊂ X, c(Q) ⊆X.

KEY TERMS

Association Rule: Pair of frequent itemsets (A,


B), where the ratio between the support of A∪B and
A itemsets is greater than a predefined threshold, de-
noted minconf.



You might also like