Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 14

DATA MINING

DEFINITION
 Data mining, the extraction of hidden predictive
information from large databases, is a powerful
new technology with great potential to help
companies focus on the most important
information in their data warehouses.
EVOLUTIONARY STEP
 Data Collection (1960s)
 Data Access (1980s)
 Data Warehousing & Decision Support(1990s)
 Data Mining (Emerging Today)
GOALS OF DATA MINING

 Prediction
 How certain attributes within the data behave in future.
 Identification
 Data patterns used to identify the existence of an item, an event
or an activity.
 Classification
 Data partition to identify different classes or patterns based on
combination of parameters.
 Optimization
 Optimize the use of limited resources like time, space, money or
material and maximize sale & profit under given constraints.
PREDICTION

 What customers buy with discount.


 How much sale value a store generates in a given
period.
 Whether deleting a sale line yield more profit.
 Uses techniques like Regression, correlation etc.
IDENTIFICATION

 Intruders trying to break the computer system


may be identified by the program executed, files
accessed and CPU time per session.
 Existence of gene is identified by certain
sequence of nucleotide symbols present in the
DNA sequence.
 Authentication
CLASSIFICATION

 Customers can be identified as discount seekers,


shoppers in a rush, loyal regular customers,
shoppers attached to name brands etc.
 Classification can help in categorizing food as
health food, party food, school lunch food etc.
LEVELS OF ANALYSIS
 Regression
 Decision Trees
 Nearest Neighbor Classification
 Neural Networks
 Rule Induction –If - else - then
TECHNOLOGICAL INFRASTRUCTURE
REQUIRED
 Depends on
 Size of the database
 Query complexity

 Relational database storage


 extensive indexing capabilities
 Massively Parallel Processors (MPP)
APPLICATIONS OF DATA MINING

 Marketing
 Consumer behavior based on buying pattern
 Determination of market strategy
 Targeted mailing
 Advertisement campaigns
 Design of catalogue
 Store layout
APPLICATIONS OF DATA MINING CONT..
 Finance
 Analysis of trustworthiness of a client
 Segmentation of account receivables
 Performance analysis of financial investments like
stocks, bonds & mutual funds
 Evaluation of financing options
 Fraud detection
APPLICATIONS OF DATA MINING CONT..
 Manufacturing
 Optimization of resources like machine, manpower &
material
 Optimal design of manufacturing processes
 Shop floor layout
 Product design
 Packaging design
APPLICATIONS OF DATA MINING CONT..
 Healthcare
 Discovery pattern in radiological images
 Analysis of micro array to relate to diseases
 Analysis of side effects of drugs.
 Effectiveness of certain treatments.
 Optimizing processes within hospitals.
 Relating patients wellness data with doctor’s
qualification.
APPLICATIONS OF DATA MINING CONT..
 Web site personalization
 Credit card fraud detection
 SAS lie detector
 Market based analysis

You might also like