Professional Documents
Culture Documents
Identifying Interesting Association Rules
Identifying Interesting Association Rules
Identifying Interesting Association Rules
Data mining
Too much
data
Data
Data
Mining
Association
rules
Apriori Algorithm
TID
List of item
IDs
T10 I1,I2,I3
0
T20 I2, I4
0
T30 I2, I3
0
T40 I1,I2,I4
0
T50 I1, I3
0
3
T60 I2, I3
0
Too many
associati
on rules
Data
Data
Mining
Association
rules
Interestingness criteria
Comprehensibility.
Conciseness.
Diversity.
Generality.
Novelty.
Utility.
...
Interestingness measures
Subjective measures
Data and the users prior knowledge are
considered.
Comprehensibility, novelty, surprisingness, utility.
Objective measures
The structure of an association rule is considered.
Conciseness, diversity, generality, peculiarity.
Example: Support
It represents the generality of a rule.
It counts the number of transactions containing both A
and B.
7
Drawbacks of objective
measures
Detabase-dependence
Problem
o Large number of disk I/O
Detabaseindependence
Genetic algorithm-based
learning (ARMGA )
1. Initialize population
ARMGA Modeling
Given an association rule XY
Requirement
Conf(XY) > Supp(Y)
Aim is to maximise
10
ARMGA Encoding
Michigan Strategy
Given an association k-rule XY, where
11
12
ARMGA Operators
Select
Crossover
Mutation
13
ARMGA Operators-Select
Select(c,ps): Acts as a filter of the
chromosome
C: Chromosome
Ps: pre-specified probability
14
ARMGA Operators-Crossover
This operation uses a two-point strategy
15
ARMGA Operators-Mutate
16
ARMGA Initialization
17
ARMGA Algorithm
18
Visual C++
Use WEKA to produce interesting
association rules
Compare the results
19
20