Professional Documents
Culture Documents
Absract:: Data, Information, and Knowledge
Absract:: Data, Information, and Knowledge
mining provides the link between the two. is an example of associative mining.
1.Extract transform and load transaction data rules for the classification of a dataset.
onto the data warehouse system Specific decision tree methods include
Classification and Regression Trees. CART
2. Store and manage the data in a
and CHAID are decision tree techniques
multidimensional database system.
used for classification of a dataset. They
3. Provide data access to business analysts
provide a set of rules that you can apply to a
and information technology professionals.
new (unclassified) dataset to predict which
4. Analyze the data by application software. records will have a given outcome.
Nearest neighbor method:
5. Present the data in a useful format, such
as a graph or table. A technique that classifies each record in
a dataset based on a combination of the
Different levels of analysis are available: classes of the k record(s) most similar to it
in a historical dataset .Sometimes called the
Artificial neural networks: k-nearest neighbor technique.
Genetic algorithms:
Data visualization: The visual
Optimization techniques that use interpretation of complex relationships in
processes such as genetic combination, multidimensional data. Graphics tools are
mutation, and natural selection in a design used to illustrate data relationships.
based on the concepts of natural evolution. Technological infrastructure
required:
Today, data mining applications are
available on all size systems for mainframe, Parallel Processors (MPP) to achieve order-
prices range from several thousand dollars of-magnitude improvements in query time.
for the smallest applications up to $1 million
ADVANTAGES OF DATA MINING
a terabyte for the largest. Enterprise-wide
applications generally range in size from 10 Marking/Retailing
gigabytes to over 11 terabytes. NCR has the
Data mining can aid direct marketers
capacity to deliver applications exceeding
by providing them with useful and accurate
100 terabytes.
trends about their customers’ purchasing
There are two critical technological behavior. Based on these trends, marketers
drivers: can direct their marketing attentions to their
The more data being processed and can also benefit from data mining in similar
maintained, the more powerful the system ways. For example, through the trends
access to their personal information and then successful applications of data mining.