Download as pdf or txt
Download as pdf or txt
You are on page 1of 5

INFOT 1: INTEGRATIVE PROGRAMMING AND TECHNOLOGIES

UNIVERSITY OF ANTIQUE
LIBERTAD CAMPUS MODULE 5: DATA MAPPING AND EXCHANGE

CHAPTER OBJECTIVES:

1. Learn data representation and exchange techniques and their appropriate use.
2. Construct and understand an XML document and Parse an XML document.
3. Use XML schemas and DTDs to construct data structure, constraints and type enforcement.
4. Use XSL for transformations and formatting.
5. Exchange data among data sources where data structure and/or model is not the same.

INTRODUCTION

Data mapping is the life blood of any data integration process. Without a proper data mapping strategy, data
transformation and filtration errors can occur that can lead to poor quality data. This directly impacts business
analysis, forecasting and business decision making. Therefore, it is crucial to maintain integrity throughout
the data mapping process.

Enterprise data is getting more dispersed and voluminous by the day, and at the same time, it has become
more important than ever for businesses to leverage data and transform it into actionable insights. However,
enterprises today collect information from an array of data points, and they may not always speak the same
language. So, data mapping process is used to integrate all the disparate data sources and make sense of
them.

In summary, data mapping is the process of establishing relationships between separate data models from
disparate sources or systems.

MOTIVATION

Take a good look on the image above. Think about what it might mean and how it is related to our lessons.
Keep those observations and ideas as we discuss the different topics on this chapter and see if what you have
thought was either related to the lessons or if it’s not what you thought it was.

INCULCATING CONCEPTS

What is Data Mapping?


Data mapping is the process of extracting data fields from one or multiple source files and matching
them to their related target fields in the destination.

BSINFOTECH 2A, 2B & 2C ARIANNE MAE A.GALLARDO, MIT 1|Page


A.Y. 2022-2023 FIRST SEMESTER INSTRUCTOR I
INFOT 1: INTEGRATIVE PROGRAMMING AND TECHNOLOGIES
UNIVERSITY OF ANTIQUE
LIBERTAD CAMPUS MODULE 5: DATA MAPPING AND EXCHANGE

Data mapping means that different data sets, with varying ways of defining similar points, can be
combined in a way that makes it accurate and usable at the end destination.
Data mapping is used to accomplish a range of data integration and transformation tasks depending
on the data management needs of an enterprise and the capabilities of data conversion mapping
software it uses. Some common known use cases of mapping business data include database schema
mapping for pre-integration, data cleansing from disparate data stores, and data conversion from
legacy systems.
Data mapping is crucial to the success of many data processes. One misstep in data mapping can
ripple throughout your organization, leading to replicated errors, and ultimately, to inaccurate
analysis.

Data mapping tools help easily map data from the source to the destination through a GUI. These
database mapping tools can be classified into three broad types:

 On-Premise data mapping tools:


Such tools are hosted on a company’s server and native computing infrastructure. Many on-premise
database mapping tools eliminate the need for hand-coding to create complex mappings, and automate
repetitive tasks in the data conversion mapping process.

 Cloud-Based data mapping tools:


Cloud based data mapping tools allow legacy to modern and on-premise to cloud data integration using
a cloud-based integration platform. In summary, these tools leverage cloud technology to help a business
perform its data binding projects.

 Open-Source data mapping tools:


Open-source business mapping tools provide a low-cost alternative to on-premise data mapping
solutions. These tools work better for small businesses with lower data volumes and simpler use-cases.

Why is data mapping essential?


Data mapping is essential for any company that processes data. It’s mainly used to integrate data,
build data warehouses, transform data, or migrate data from one place to another. The process of
matching data to a schema is a fundamental part of the flow of data through any organization.
Data mapping is the key to good data management. Unmapped or poorly mapped data will cause
issues as data flows to different endpoints within an organization. Mapping is the first step to getting
the most out of your data when it reaches integrations, transformations, and when it is stored for
future use.

What are the Steps of Data Mapping?


Step 1: Define — Define the data to be moved, including the tables, the fields within each table, and the
format of the field after it’s moved. For data integrations, the frequency of data transfer is also defined.

Step 2: Map the Data — Match source fields to destination fields.

Step 3: Transformation — If a field requires transformation, the transformation formula or rule is coded.

Step 4: Test — Using a test system and sample data from the source, run the transfer to see how it works
and make adjustments as necessary.

BSINFOTECH 2A, 2B & 2C ARIANNE MAE A.GALLARDO, MIT 2|Page


A.Y. 2022-2023 FIRST SEMESTER INSTRUCTOR I
INFOT 1: INTEGRATIVE PROGRAMMING AND TECHNOLOGIES
UNIVERSITY OF ANTIQUE
LIBERTAD CAMPUS MODULE 5: DATA MAPPING AND EXCHANGE

Step 5: Deploy — Once it’s determined that the data transformation is working as planned, schedule a
migration or integration go-live event.

Step 6: Maintain and Update — For ongoing data integration, the data map is a living entity that will
require updates and changes as new data sources are added, as data sources change, or as requirements at
the destination change.

XML DTD and XML Schema

How does an XML processor check your xml document?


XML – Extensible Markup Language
There are two main checks that XML processors make: DTD – Document Type Definition
XSD – XML Schema Definition
1. Checking that your document is well formed (Syntax
rule)
2. Checking that it’s valid (syntax-check your XML either in XML DTD or XSD)

Why need XML Validator?

Use our XML validator to syntax-check your XML.


Errors in XML documents will stop your XML applications unlike HTML browser.

XML DTD

An XML document with correct syntax is called “Well Formed”.


An XML document validated against a DTD is “Well Formed” and “Valid”.

How you add a DTD to our XML document?

1. DTDs can be separate documents (or)


2. They can be built into an XML document using a special element named <!DOCTYPE>.

An XML Document with a DTD (example1.xml)

<?xml version=”1.0” encoding=”UTF-8”?>


<?xml-stylesheet type=”text/css” href=”css1.cs”?>
<!DOCTYPE document
[ <!ELEMENT document (heading, message)>
<!ELEMENT message (#PCDATA)>
]>
<document>
<heading>Hello From XML </heading>
<message> This is an XML document! </message>
</document>

Valid XML Document with DTD (example2.xml)

The DOCTYPE declaration is a reference to an external DTD file “Note.dtd”


<?xml version=”1.0” encoding=”UTF-8”?>

BSINFOTECH 2A, 2B & 2C ARIANNE MAE A.GALLARDO, MIT 3|Page


A.Y. 2022-2023 FIRST SEMESTER INSTRUCTOR I
INFOT 1: INTEGRATIVE PROGRAMMING AND TECHNOLOGIES
UNIVERSITY OF ANTIQUE
LIBERTAD CAMPUS MODULE 5: DATA MAPPING AND EXCHANGE

<!DOCTYPE note SYSTEM “Note.dtd”>


<note>
<to>Tove</to>
<from>Jani</from>
<heading>Reminder</heading>
<body>Don’t forget me this weekend!</body>
</note>
Note.dtd
<!DOCTYPE note
[
<!ELEMENT note (to,from,heading,body)>
<!ELEMENT to (#PCDATA)>
<!ELEMENT from (#PCDATA)>
<!ELEMENT heading (#PCDATA)>
<!ELEMENT body (#PCDATA)>
]>

The DTD above is interpreted like this:

!DOCTYPE note defines that the root element of the document is note.
!ELEMENT note defines that the note element contains four elements: “to, from, heading, body”
!ELEMENT to defines the to element to be of type “#PCDATA”
!ELEMENT from defines the from element to be of type “#PCDATA”
!ELEMENT heading defines the heading element to be of type “#PCDATA”
!ELEMENT body defines the body element to be of type “#PCDATA”

Note:
#PCDATA means parse-able text data.

When NOT to use a Document Definition?

When you are working with small XML files, creating document definitions may be a waste of time.

XML Schema

Another way of validating XML documents: using XML schemas.


The XML Schema language is also referred to as XML Schema Definition (XSD), describes the
structure of an XML document.
Defines the legal building blocks (elements and attributes) of an XML documents like DTD.
Defines which elements are child elements.
Defines the number and order of child elements.
Defines whether an element is empty or can include text.
Defines data types for elements and attributes.
Defines default and fixed values for elements and attributes.

BSINFOTECH 2A, 2B & 2C ARIANNE MAE A.GALLARDO, MIT 4|Page


A.Y. 2022-2023 FIRST SEMESTER INSTRUCTOR I
INFOT 1: INTEGRATIVE PROGRAMMING AND TECHNOLOGIES
UNIVERSITY OF ANTIQUE
LIBERTAD CAMPUS MODULE 5: DATA MAPPING AND EXCHANGE

XML Schemas will be used in most Web applications as a replacement for DTDs.
Here are some reasons:

XML Schemas are extensible to future additions.


XML Schemas are richer and more powerful than DTDs.
XML Schemas are written in XML.
XML Schemas support data types and namespaces.

Creating XML Schemas by Using XML Schema-Creation Tools

HiT Software
xmlArchitect
XMLspy
XML Ray
Microsoft Visual Studio .NET

XSD Simple Element

The syntax for defining a simple element


Default and Fixed Values for Simple Elements

XSD Attributes

The syntax for defining an attribute


Default and Fixed Values for Attributes
Optional and Required Attributes

XSD Complex Elements

How to Define a Complex Element using XML Scheme


XSD Empty Elements

XSD Indicators
Order indicators are:

All
Choice
Sequence

ENRICHMENT

Sathish, K. (2012), Basic Integrative Programming Technologies: Data Integration Technology/ Architecture,
Lambert Academic Publishing

https://www.slideshare.net/vijipriyacse/ipt-chapter-5

https://www.tutorialspoint.com/java/java_inheritance.htm

https://www.tutorialspoint.com/java/java_polymorphism.htm

https://www.tutorialspoint.com/java/java_interfaces.htm

BSINFOTECH 2A, 2B & 2C ARIANNE MAE A.GALLARDO, MIT 5|Page


A.Y. 2022-2023 FIRST SEMESTER INSTRUCTOR I

You might also like