Professional Documents
Culture Documents
Inbound Interface Design Template
Inbound Interface Design Template
Inbound Interface Design Template
Design
Project
<<Project Name>>
Project No:
<<Project Name>>
Source System:
<<Source System Name>>
Prepared by:
<<Author>>
Teradata Professional Services
REVISION LIST
Ver. #
0.01
Date
19-11-2010
Author
Description
Initial Draft
Page 2 of 13
<<Feed Name>> Inbound Interface Design
Table of Contents
DOCUMENT VERSION............................................................................................................................. 1
REVISION LIST......................................................................................................................................... 2
1
INTRODUCTION................................................................................................................................. 4
1.1
1.2
1.3
INTERFACE REQUIREMENTS..........................................................................................................5
2.1
IN SCOPE EXTRACTS........................................................................................................................................ 5
OVERVIEW....................................................................................................................................................... 6
GENERAL CONTACT INFORMATION..................................................................................................................... 6
FILE LOCATION INFORMATION............................................................................................................................ 6
FILE INTERFACE INFORMATION.......................................................................................................................... 7
EXTRACT DESIGN SCHEDULE............................................................................................................................ 7
APPENDICES................................................................................................................................... 13
5.1
5.2
QUESTION LOG.............................................................................................................................................. 13
SOURCE RCIS EXTRACT RUN BOOK............................................................................................................... 13
Page 3 of 13
<<Feed Name>> Inbound Interface Design
INTRODUCTION
1.1
The Enterprise Data Warehouse (EDW) will ultimately provide <<Client Name>> with a single integrated database
to support both enterprise and business unit analytics and reporting across all lines of business, while phasing out
or consolidating areas of redundancy.
EDWs Data Acquisition Layer refers to the area that resides between Source and Data Integration Layers, and is
used as a collection area for source data. This layer is off-limits to EDWs Information Consumers as source data is
not yet integrated, and the underlying supporting structures are not designed for query performance.
The objective of the Data Acquisition Activity is to design, build, and implement the data acquisition process to
transfer data from source feeds and applications to the EDW Data Acquisition Layer.
The purpose of this IID is to document the detailed description for the transport of RCIS data to the EDW
Acquisition Layers file system.
1.2
The purpose of the IID is to define, with the highest level of detail possible and all information available, the extract
files that will be provided to the DI stream developer.
1.3
Intended Audience
Title
Mapping team
Development
CUT team
Signature
Date
Page 4 of 13
<<Feed Name>> Inbound Interface Design
2
2.1
INTERFACE REQUIREMENTS
In Scope Extracts
Extract Criteria
DailyChanges
Full Snapshot
Full Snapshot
Frequency
Daily
Daily
Daily
Volume(MB)
4,356
<1
<1
Page 5 of 13
<<Feed Name>> Inbound Interface Design
# of
Attributes
37
4
3
3.1
Overview
3.2
Contact
Source Owner
IT BI/DW SME
Source System Business Contact Name & Telephone #
Source System Name
SSDA Development Representative
3.3
Information
Data Governance
Open Ticket with IBM
Location
Production or Development Files
Internal or External Source
Source ID
Information
Development
Internal
Data Governance To Supply
DEV:
QA:
PROD:
Flat Files
edw_rcis_ec_account_yyyymmddhhmiss.dat
edw_rcis_ec_account_yyyymmddhhmiss.ctl
edw_rcis_ec_acct_role_yyyymmddhhmiss.dat
edw_rcis_ec_acct_role_yyyymmddhhmiss.ctl
edw_rcis_ec_bill_cycle_yyyymmddhhmiss.dat
edw_rcis_ec_bill_cycle_yyyymmddhhmiss.ctl
\\Rsoesndeveci\IDE_FTP_Shared\DataProfiling\RCIS
Page 6 of 13
<<Feed Name>> Inbound Interface Design
3.4
Interface
Source Type (e.g. DB, File)
Type of Transmission (e.g. FTP, OBDC)
Character Code (ASCII, Binary, Hex)
File format (fixed, variable, delimited)
Extract file archiving
Extract file encrypting
3.5
Information
File
Delivered via Control M job running UNIX shell scripts
ASCII
Pipe Delimited
Yes
No
Definition around when the extract will arrive in the development area.
Note: Profile and Unit Test Dates apply to all files below
Source File/Table Name
Profile date
EC_ACCOUNT
22 Mar 2011
EC_ACCT_ROLE
22 Mar 2011
EC_BILL_CYCLE
22 Mar 2011
Masking Date
Page 7 of 13
<<Feed Name>> Inbound Interface Design
4
4.1
Detail
The detail records will contain the actual data that is to be loaded. This data will be in a pre-defined format as
shown in this document.
Control file
ETM data files must have an accompanying control file with the information to perform audit and reconciliation of
the data file. The control file name should match the name of the data file with a ctl file extension and it is piped
delimited. Contents of the required control file should contain the following:
Field
SRC_FL
SRC_FL_GEN_DTTM
Format
File name convention:
edw_<system>_<table_name>_yyyymmddhhmiss.dat
Sample file name:
edw_rcis_ec_cust_name_20110218114257.dat
DATETIME: MMDDYYYY:HH:MI:SS
SRC_SYS_ID
SRC_COL_SUM
NAME_ID
SRC_COL_POS
2
SRC_REC_CNT
NUMBER
SRC_COL_SUM_VAL
NUMBER(30,2)
Description
Name of source file
Example
For example edw_rcis_ec_account.ctl contains
edw_rcis_ec_account.dat|04302010:14:04:05|RCIS|PRIMARY_ACCT_ID|6|1413415|220147867957.00
The list of control files and what is expected in each field in each file are listed on the next two pages.
Page 8 of 13
<<Feed Name>> Inbound Interface Design
Table
SRC_FL
EC_ACCOUNT
edw_rcis_ec_account_yyyymmddhhmiss.dat
EC_ACCT_ROLE
edw_rcis_ec_acct_role_yyyymmddhhmiss.dat
EC_BILL_CYCLE
edw_rcis_ec_bill_cycle_yyyymmddhhmiss.dat
SRC_FL_GE
N_DTTM
MMDDYYYY
:HH:MI:SS
MMDDYYYY
:HH:MI:SS
MMDDYYYY
:HH:MI:SS
SRC_
SYS_
ID
SRC_COL_SUM
SRC _
COL_
POS
SRC_REC_CN
T
SRC_COL_SUM_
VAL
<checksum total>
RCIS
ACCT_ID
<record count>
RCIS
NONE
<record count>
RCIS
BILL_CYCLE
<record count>
Page 9 of 13
<<Feed Name>> Inbound Interface Design
0
<checksum total>
4.2
Value
RCIS.06.04
RCIS_CICSK
TBD
edw_rcis_ec_account_yyyymmddhhmiss.dat
Details of the layout of this record type are in the table below.
Field Name
Field Type
ACCT_ID
NUMBER(TBD,TBD)
NAME_ID
NUMBER(TBD,TBD)
SRC_CD
VARCHAR2(20)
SYSTEM_SRC_CD
VARCHAR2(20)
SRC_ACCT_KEY
VARCHAR2(60)
PRIMARY_ACCT_ID
NUMBER(TBD,TBD)
CONSOL_ACCT_ID
NUMBER(TBD,TBD)
SRC_ACCT_EFFECTIVE_DT
DATE
SRC_ACCT_EXPIRY_DT
DATE
BILL_CYCLE
NUMBER(3,0)
BILL_LANGUAGE
VARCHAR2(3)
SRC_ACCT_TYPE
VARCHAR2(5)
SRC_ACCT_SUB_TYPE
VARCHAR2(5)
SRC_ACCT_BUS_TYPE
VARCHAR2(5)
PARTY_TYPE
CHAR(1)
SRC_ACCT_STATUS
CHAR(1)
ACCT_STATUS
VARCHAR2(10)
CONVERSION_TYP
VARCHAR2(8)
CREATE_DT
DATE
UPDATE_DT
DATE
UPDATE_USER_ID
VARCHAR2(20)
SRC_BILLING_ENTITY
BT_ACCT_NUMBER
BT_CONV_STATUS
BT_PLAN_CONV_DT
BT_ACTUAL_CONV_DT
MIG_DEFER_REASON_DESC
SRC_CBP_NUMBER
VARCHAR2(4)
NUMBER(TBD,TBD)
CHAR(1)
DATE
DATE
VARCHAR2(80)
NUMBER(TBD,TBD)
PK
Y
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
Unique
Y
N
N
Y
Y
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
Nulls
N
N
N
Y
Y
Y
Y
N
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
N
Y
Y
Y
Y
Y
Y
Y
Page 10 of 13
<<Feed Name>> Inbound Interface Design
4.3
File Format
This section outlines the characteristics of the extract file that are operating system dependant.
Character set
Record Terminator
Column Separator
The column separator used for the project will be the pipe character (|). This is ASCII character x7C. This
character will be removed from the body of any field.
Null Fields
Fields that are null values in the source will be treated the same as fields that are empty in the source. Any such
fields will be sent as column separators with no value between. To demonstrate, the fifth field in the example below
will be treated as null in the target.
001|12345233|HP|2007-12-14 08:34:32||YES|The quick brown fox jumps over the lazy dog.
Key fields that have invalid or null values will result in the record being rejected by the load program. Other fields
that have invalid values will be sent to the target replaced with null by default. The mapping document will describe
any actions other than the default behaviour. The extract should prevent these values from appearing.
See the table below for a description of data types and the expected format.
Data Type
Numeric
Date
Timestamp
Text fields
Format
Scientific notation is not to be sent. Expand the data to decimal format.
Negative numbers are to be proceeded with a - sign, not enclosed in brackets.
Numbers may be zero left padded, but this is not required.
MM/DD/YYYY
MM/DD/YYYY HH:MI:SS
ASCII characters x20 through x7E.
Fields are not to be enclosed by quotation marks.
Leading, trailing and multiple embedded blanks may be trimmed by the load
program.
No embedded record termination characters or column separator characters.
If a particular file has no data to contribute at extract time, the file will still be produced with a control file indication 0
record counts. The load programs are quite happy to process with no data.
Page 11 of 13
<<Feed Name>> Inbound Interface Design
Automation of Extract
Duplicate Rows
Source system extracts are requested to remove all primary key duplicates.
Data Masking
Data masking to remove any personally identifiable information will be performed by the extract programs. Masking
will have sufficient intelligence to maintain any relationships between tables for which the masked field is the foreign
key.
History
TBD
Delta Mechanism
Extracts will be date driven using some control mechanism. Each record type should be individually controlled in
terms of the time period of rows to be extracted. This mechanism will allow the extract to reach back further in time
than the normal processing period if required when in production. Interim changes to the rows will not be captured,
only the state of the row at extract time will be sent.
Data Consolidation
As much as possible, the data in the extract will be combined data from multiple tables in the source to the benefit
of the load programs. If it makes sense to combine columns in a where clause in the source, perform the join. If a
filter will reduce the number of rows, use the filter. If data combination results in additional logic necessary in the
load, leave the data in the format of the source. These principals will be reflected in the mapping document.
Each record type will hold the all the data it requires to load the target table. No look ups between data in different
record types will be performed by the load programs.
Source
Load Order
Record Type
Dependency
To get Foreign Keys information along with Source System Tables DDLs.
Page 12 of 13
<<Feed Name>> Inbound Interface Design
5
5.1
APPENDICES
Question Log
Question
5.2
Whom?
Date
The link below is to the Run book for the sources that exist as production feeds as of 3/16/11.
<<Link to External ops / scheduling documents>>
Page 13 of 13
<<Feed Name>> Inbound Interface Design
Closed