Professional Documents
Culture Documents
Etl Data Warehouse Tools
Etl Data Warehouse Tools
Etl Data Warehouse Tools
THEORY:
While working with databases, it is essential to
properly format and prepares data in order to load it
into data storage systems. ETL are three separate but
crucial functions combined into a single
programming tool that helps in preparing data and in
the management of databases.
Extract, Transform, Load each denotes a process in
the movement of data from its source to a data
storage system, often referred to as a data
warehouse.
1.)Extract: The extract function reads data from a
source database and extracts the desired subset of
data. The purpose of this step is to retrieve all the
required data from the source system with minimum
resources.
2.)Transform: This function filter cleanses and
prepares the extracted data using lookup tables or
rules or by creating combinations with other data
and converts it to the desired state. The transform
step includes validation of records, rejection of data
(if they are not acceptable) and data integration. The
commonly used processes for transformation are
conversion, sorting, filtering, clearing the duplicates,
standardizing, translating and looking up or verifying
the consistency of data sources.
3.)Load: The loading is the last stage of an ETL
process. The load function writes the resulting data,
i.e. the extracted and transformed data, (all of the
subset or just the changes) to a target data
repository.