Download as ppt, pdf, or txt
Download as ppt, pdf, or txt
You are on page 1of 11

Data Warehouse and Data Mining …

DATA WAREHOUSE ..

• A Data warehouse is a system for storing and delivering


massive quantities of data .
•It is a centralized data that stores and provides already
transformed and summarized data, therefore, making it an
appropriate environment for more effective DSS( Decision
Support System ).
•It stores data that have been extracted from the various
external and other databases of an organization.
Characteristics of a Data Warehouse ..

1. SUBJECT-ORIENTED : data are organized according to


subject instead of application e.g., an insurance company
using a data warehouse would organize their data by
customer, premium and claim, instead of by different
products.
2. INTEGRATED : A data warehouse integrates data from
multiple data sources. For example, source A and source B
may have different ways of identifying a product, but in a
data warehouse, there will be only a single way of identifying
a product.
Characteristics of a Data Warehouse ..

3. TIME-VARIENT : historical data is kept in a data


warehouse.it contains a place for storing data that are 5
to 10 years old , these data are not updated .
4. NON-VOLATILE : data are not updated or changed in any
way once they enter the data warehouse, but are only
loaded and accessed . So, historical data in a data
warehouse should never be altered.
DEVELOPING DATA WAREHOUSE..

1. INSULATE OPERATIONAL INFORMATION : first phase


is to insulate current operational information from the data
in data warehouse.

2. RETRIEVE AND TRANSFORM DATA : next data is then


transformed and delivered to the data warehouse based on
the selection of relevant information.
DEVELOPING DATA WAREHOUSE..

3. CLEANSE DATA : creating an efficient data warehouse and


remove duplication of data collection and inconsistent
data.

4. TRANSFER DATA INTO DATA WAREHOUSE : once data


has been cleaned, it is than transferred to the data
warehouse for an organization to use its database
information and sub-divided into data-marts.
DATA MINING ..

• Data mining, the extration of hidden information patters


from large databases.

• This helps managers to make decision about strategic


changes in business operations to gain competetive
advantagesa in the market place.
• Data mining consists of five major elements:

• Extract, transform, and load transaction data onto the


data warehouse system.
• Store and manage the data in a multidimensional database
system.
• Provide data access to business analysts and information
technology professionals.
• Analyze the data by application software.
• Present the data in a useful format, such as a graph or
table.
What technological infrastructure is required?

• Size of the database: the more data being processed and


maintained, the more powerful the system required.

• Query complexity: the more complex the queries and the


greater the number of queries being processed, the more
powerful the system required.
APPLICATIONS OF DATA MINING..

• Companies use data mining to –

1. perform market basket analysis to identify new product


bundles.
2. Find root causes to quality or manufacturing problems.
3. Prevent customer attribution and acquire new customers.
4. Cross-sell to existing customers.
5. Profile customers with more accuracy.
THANK YOU ..!!

PRESENTED BY : Sneh {c-02}


Nidhi {c-05}
Rishi {c-30}
Anup {c-48}

You might also like