Components Description

You might also like

Download as txt, pdf, or txt
Download as txt, pdf, or txt
You are on page 1of 1

Sort : Orders your data according to a key specifier.

Sort within Groups: Refines the sorting of data records already sorted according to
one key specifier: it sorts the records within the groups formed by the first sort
according to a second key specifier.
Partition by Key and Sort: Repartitions data records by key values and then sorts
the records within each partition.
Aggregate: Generates summary records for groups of input records. It maximizes
performance by keeping intermediate results in main memory.
Dedup Sorted: Separates one specified data record in each group of data records
from the rest of the records in the group.
Denormalize Sorted: Consolidates groups of related records in the input to a single
data record with a vector field for each group, and optionally computes summary
fields for each group. It's the inverse of Normalize.
Filter by Expression: Filters data records according to a specified DML expression.
FUSE: Applies transform to corresponding records from each input flow. The
transform is first applied to the first record on each flow, then to the second,
and so on. The result of the transform is sent out of the out port.
Join: Performs inner, outer, and semi-joins with multiple flows of data records.
It maximizes performance by loading input data records into main memory.
Match Sorted: Combines and performs transform operations on multiple flows of data
records.
Normalize: Generates multiple output data records from each input data record.
Normalize can separate a data record with a vector field into several individual
records, each containing one element of the vector.
Reformat: Changes the record format of your data by dropping fields or by using DML
expressions to add fields, combine fields, or modify the data.
Rollup: Generates data records that summarize groups of data records. Rollup in
Memory maximizes performance by keeping intermediate results in main memory.
SCAN: Generates a series of cumulative summary records--such as year-to-date
totals--for groups of data records. Scan Sorted requires grouped input.
Compress: Reduces the volume of data in a flow. This component is not available on
Windows NT/2000. To reverse the effect, use a GUnzip or Uncompress component.
Concatenate: Appends multiple flow partitions of data records one after another.
Gather: Combines data records from multiple flow partitions arbitrarily.
Interleave: Combines blocks of data records from multiple flow partitions in round-
robin fashion.
Merge : Combines data records from multiple flow partitions that have been sorted
according to the key specifier, and maintains the sort order.
Gather Logs: Collects the output from log ports of components for analysis of a
graph after execution.
Redefine Format : Copies data records from its input to its output without changing
the values. Use Redefine Format to change a record format or rename fields.
Replicate: Arbitrarily combines all the data records it receives into a single flow
and writes a copy of that flow to each of its output flows.

You might also like