Professional Documents
Culture Documents
ETL Talend: Doing IN
ETL Talend: Doing IN
ETL IN
TALEND
Talend is a popular open-source data
integration and ETL (Extract, Transform,
Load) software platform used for connecting,
transforming, and managing data from
various sources to target destinations.
e c t i o n
Conn onent
Comp
Query can be
changed depending Column remaping & renaming
on condition
ADDING UNIQUE ID
A unique ID for each row can be useful to the database to calculate some
aggregation data. But the original data doesn’t provide a unique ID to each row
of the data. Therefore it would be a good idea to add a unique ID to the data.
This can be using tAddCRCRow component.
n u s e d f o r
Colum ting unique ID
calcula
DATA QUALITY
AND VALIDATION
After adding a unique ID, data quality and validation are done to the data by
using tSchemaComplianceCheck component. This component validates all input
rows against a reference schema or check types, nullability, length of rows
against reference values.
DATA
TRANSFORMATION
After the data quality and validation were checks out, we can proceed to data
transformation. We use tMap component to transform data in talend. The
transformation thats done to the data in this project were car_age,
profit_range, sales_year
tMap EXPRESSION
car_age
This column calculate the age of the car at the time of purchase
profit_range
Create buckets or categories for "Sale Price" ranges (e.g., low, medium, high)
to analyze sales distribution.