Professional Documents
Culture Documents
Managing Data: College of Accountancy and Business Administration
Managing Data: College of Accountancy and Business Administration
DATA RESOURCE
MANAGING DATA
Organizations win by making good decisions fast, and organizations cannot do so without a highquality
data resource.
Managing data as a resource has many general business dimensions, it is also important for the
costeffective development and operation of information systems. Poor systems development productivity is
frequently due to a lack of data management, and some methods, such as prototyping, cannot work unless
the source of data is clear and the data are available. Systems development time is greatly enhanced by
the reuse of data and pro- grams as new applications are designed and built. Unless data are cataloged,
named in standard ways, protected but accessible to those with a need to know, and maintained with high
quality, the data and the programs that capture and maintain them cannot be reused.
The methodology includes the steps that are followed to identify and describe organizational data entities,
and the notation is a way to show these findings, usually graphically. Managers must be integrally involved
in these methodologies to ensure that the data you need are planned for inclusion in organizational
databases and that the data captured and stored have the required business meaning. Several possible
methodologies are introduced in the following paragraphs, but the reader is referred to texts on database
management for a detailed discussion of data modeling notations.
Entity-Relationship Diagram - captures entities (i.e., customer, order, product) and their relationships (i.e.,
submits, includes).
- is the most commonly accepted notation for representing the data needs in an organization.
- consists of:
1. entities, or the things about which data are collected;
2. attributes, the actual elements of data that are to be collected;
3. relationships, the relevant associations between organizational entities.
- is only part of metadata, or data about data, needed to unambiguously describe data for the enterprise.
Entity-Relationship Diagram
Metadata - documents the meaning and all the business rules that govern data.
Example: About an attribute of customer name would define this term, state its properties such as
maximum length and the type of data (alphanumeric characters) that a value of this attribute might have,
whether every customer has to have a name to be stored in the database, whether the name can change in
value over time, whether there can be multiple instances of the name, and who has rights to enter and
change the name.
- These metadata rules come from the nature of the organization, so business
managers are typically the source of the knowledge to develop these rules.
- using the concept of data steward, to actively manage the metadata for each
subject area of the business. This allowed subtle differences in customer data to be recognized and
accepted and for data to be stored accurately.
2. Bottom-up or view integration – the data elements in each user view are identified and put into
a basic structure.
- After each user view has been normalized, they are all
combined (or integrated) into one comprehensive description. Ideally, this integrated set of
entities from normalization will match those from enterprise modeling.
User view - each report, computer screen, form, document, and so on to be produced from
organizational databases Normal form - basic structure
Normalization - the process of creating simple data structures from more complex ones
- consists of a set of rules that yields a data structure that is very stable
and useful across many different requirements.
- is used as a tool to rid data of troublesome anomalies associated with
inserting, deleting, and updating data.
Many organizations choose to use both approaches because they are complementary
methods that emphasize different aspects of data and, hence, check and balance each other.
A business manager often simply wants access to needed data and is not interested in
waiting for an entire data model to be built.
• Shared data - are those that are exchanged between different user groups, and hence there must be
agreements on the definition, format, and timing for exchange of these data among those sharing the
data.
- exists because of a dependency between different organizational units or functions. • Core
data - are those that require an organization-wide definition and sourcing (in other words, core data is
enterprise-shared data).
- there may be one copy of core data, but if there are multiple copies, the creation of
these copies are carefully planned and managed.
The concept of application independence suggests that different data processing applications can be
classified into three groups, based upon their role in managing data:
1. data capture - gather data and populate the database. They store and maintain data
- ach datum is captured once and fully tested for accuracy and
completeness.
2. data transfer and integration applications - move data from one database to another or otherwise
bring together data from various databases to meet some processing need.
- often called bridges or interfaces because they
connect related databases.
3. data analysis and presentation - provide data and information to authorized persons.
Almost all information systems and databases refer to common subject areas of data
Example: people,
things, places
often enhance common data with local data relevant to only that application or database.
Master data management (MDM) - refers to the disciplines, technologies, and methods to ensure the
currency, meaning, and quality of reference data within and across various subject areas (White and
Imhoff, 2006).
- ensures that everyone knows the current description of
product, the current salary of an employee, and the current billing address of a customer.
- does not address sharing transactional data, such as
customer purchases.
- supports all uses of data, from operational to business intelligence
- to be successful, an organization must create a strong data governance process, often
including data stewards.