Professional Documents
Culture Documents
Data Mining Present Paper
Data Mining Present Paper
Data Mining Present Paper
arajacs1983@yahoo.co.in
Abstract:
Although data mining is a relatively new term, the technology is not. Companies have
used powerful computers to sift through volumes of supermarket scanner data and
analyze market research reports for years. However, continuous innovations in
computer processing power, disk storage, and statistical software are dramatically
increasing the accuracy of analysis while driving down the cost.
DATA MINING:
Most companies already collect and refine massive quantities of data. Data
mining techniques can be implemented rapidly on existing software and hardware
platforms to enhance the value of existing information resources, and can be integrated
with new products and systems as they are brought on-line. When implemented on high
performance client/server or parallel processing computers, data mining tools can analyze
massive databases to deliver answers to questions such as, "Which clients are most likely
to respond to my next promotional mailing, and why?"
APPLICATIONS:
Data
Data are any facts, numbers, or text that can be processed by a computer. Today,
organizations are accumulating vast and growing amounts of data in different formats and
different databases. This includes:
• operational or transactional data such as, sales, cost, inventory, payroll, and
accounting
• nonoperational data, such as industry sales, forecast data, and macro economic
data
• meta data - data about the data itself, such as logical database design or data
dictionary definitions
Information
The patterns, associations, or relationships among all this data can provide
information. For example, analysis of retail point of sale transaction data can yield
information on which products are selling and when.
Knowledge
Information can be converted into knowledge about historical patterns and future
trends. For example, summary information on retail supermarket sales can be analyzed in
light of promotional efforts to provide knowledge of consumer buying behavior. Thus, a
manufacturer or retailer could determine which items are most susceptible to promotional
efforts.
Data Warehouses:
While large-scale information technology has been evolving separate transaction and
analytical systems, data mining provides the link between the two. Data mining software
analyzes relationships and patterns in stored transaction data based on open-ended user
queries. Several types of analytical software are available: statistical, machine learning,
and neural networks. Generally, any of four types of relationships are sought:
• Classes: Stored data is used to locate data in predetermined groups. For example,
a restaurant chain could mine customer purchase data to determine when
customers visit and what they typically order. This information could be used to
increase traffic by having daily specials.
• Sequential patterns: Data is mined to anticipate behavior patterns and trends. For
example, an outdoor equipment retailer could predict the likelihood of a backpack
being purchased based on a consumer's purchase of sleeping bags and hiking
shoes.
• Extract, transform, and load transaction data onto the data warehouse system.
• Rule induction: The extraction of useful if-then rules from data based on
statistical significance.
• Data visualization: The visual interpretation of complex relationships in
multidimensional data. Graphics tools are used to illustrate data relationships.
Today few data mining languages are commercially available in the market
like Microsoft’s SQL server 2005, IBM Intelligent Miner, AS Enterprises Miner, SGI
Minest, Clementine, DBMiner and money more but a standard data mining language or
other standardization efforts will provide the orderly development of data mining
solutions, improved interpretability among multiple data mining systems and functions.
Web Mining:
The massive growth of data from terabytes to perabytes is due to the wide
availability of data in automated form from various sources as WWW, Business, science,
Society and many more. But we are drowning in data but efficient of knowledge Data is
useless, if it cannot deliver knowledge. That is why data mining is gaining wide
acceptance in today’s world. A lot has been done in this field and lot more need to be
done.
CONCLUSION
REFERENCES
1. Data Mining Concepts and techniques –Jiawei Han & Michein Kamber.
2. Modern Data Warehousing, Mining and visualization Core Concepts by
George M.Marakas.