Databases and Information Management

You might also like

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 2

System and Information Technology

Summary of Chapter 6: Databases and Information Management

6-1
 An effective information system provides users with accurate, timely, and
relevant information.
 File organization terms and concepts:
A computer system organizes data in a hierarchy that starts with bits (smallest
unit of data a computer can handle) and bytes (a group of bits) and progresses to
fields (grouping characters into a word), records (a group of related fields), files
(a group of record), and databases (a group of related files).
A record describes an entity, which is a person, place, thing, or event on which
we store and maintain information.
 The problems with the traditional file environment is that when the application
in the business required its own files and its own computer program to operate
and as this process goes for a long time, the organization is saddled with
hundreds of programs and applications that are very difficult to maintain and
manage.
 The resulting problems: Data redundancy, which is the presence of duplicate
data in multiple files. It wastes storage resources and leads to data
inconsistency; Program data dependence; Lack of flexibility; Poor security;
Lack of data sharing and availability.

6-2
Database is a collection of data organized to serve many applications
efficiently by centralizing the data and controlling redundant data.
 A database management system (DBMS) is software that permits an
organization to centralize data, manage them efficiently, and provide access to
the stored data by application programs. The DBMS relieves the programmer
or end user from the task of understanding where the logical view presents
data as end users or business specialists would perceive them, whereas the
physical view shows how data are actually organized and structured on
physical storage media.
 DBMS help to solve the problems of the traditional file environment by
reducing data redundancy and inconsistency, enabling data to stand on their
own, causing access and availability of information increase and program
development also maintenance costs reduced, and allowing data sharing
 DBMS includes capabilities and tools for organizing, managing, and accessing
the data in the database, which are data definition language, data dictionary,
and data manipulation language. The most prominent data manipulation is
Structured Query Language (SQL).
 The database requires both a conceptual design and a physical design.
 Database designers document their data model with an entity-relationship
diagram,
 Non-relational database management systems use a more flexible data model
and are designed for managing large data sets across many distributed
machines and for easily scaling up or down
6-3
To improve business performance and decision making is by understanding:
 The challenge of big data
 Business Intelligence Infrastructure:
⁃ Data warehouse: a database that stores current and historical data of
potential interest to decision makers throughout the company
⁃ A data mart is a subset of a data warehouse in which a summarized or
highly focused portion of the organization’s data is placed in a separate
database for a specific population of users.
⁃ Hadoop is an open source software framework hat enables distributed
parallel processing of huge amounts of data across inexpensive
computers.
⁃ In-memory processing makes it possible for very large sets of data,
amounting to the size of a data mart or small data warehouse, to reside
entirely in memory.
⁃ Analytic platforms using both relational and non-relational technology
that are optimized for analyzing large data sets.
 Analytical tools:
⁃ OLAP (online analytical processing): OLAP supports
multidimensional data analysis, enabling users to view the same data in
different ways using multiple dimensions.
⁃ Data mining provides insights into corporate data that cannot be
obtained with OLAP by finding hidden patterns and relationships in
large databases and inferring rules from them to predict future
behavior.
⁃ Text mining tools help businesses analyze large unstructured data sets
consisting of text.Web mining tools focus on analysis of useful patterns
and information from the Web

6-4
 In order to make sure that the data for your business remain accurate, reliable,
and readily, your business need special policies and procedures:
 Establishing an information policy to specify the organization rules for
information.
 Ensuring data quality: Analysis of data quality often begins with a data quality
audit. Data cleansing, also known as data scrubbing, consists of activities for
detecting and correcting data in a database that are incorrect, incomplete,
improperly formatted, or redundant.

You might also like