Professional Documents
Culture Documents
Intro 2
Intro 2
Ab Initio Software
• Constructing Applications
• Parallelism
• Data Partitioning
• Multifiles
3. Choose InputFile
Choose OutputFile
Enter expression
• record
• string("|") node;
• string("|") timestamp;
• string("|") component;
• string("|") subcomponent;
• string("|") event_type;
• string("|\n") event_text;
• end
Sorts it by city.
• Open figure-06.
• Component parallelism
• Pipeline parallelism
• Data parallelism
Sorting Customers
Sorting Transactions
• Limitation:
• Scales to number of “branches” a graph.
Processing Record: 99
• Limitations:
• Scales to length of “branches” in a graph.
• Some operations, like sorting, do not
pipeline.
ns
t i o
rt i
Pa
Global View:
Expanded View:
Global View:
Degree of Parallelism
Fan-out Flow
• Open figure-04.
• Create a copy of the Reformat and the Simple-Out dataset (use Edit...Copy and
Edit…Paste).
Degree of Parallelism
(Abstract)
mfile://host1/u/jo/mfs/dir2/b.dat
1. Drill into
multidirectory
2. Type in filename
• Open figure-04.
• Run the application and examine the results (use the “Partition”
option in View Data).
0345Smith Bristol 56
0121Forth Bristol 7 Bristol 63
0322Jones Compton 12 Compton 12
0212Spade London 8
0492West London 23 London 31
0221Black New York 42 New York 42