Unit 3

You might also like

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 28

Unit 3

Syllabus
• Managing Data Resources- The need for data management,
Challenges of data management, Data independence, Data
redundancy, Data consistency, Data administration. Database
Management System – Concepts and types of DBMS, Fields, Records,
Table, View, Reports and Queries. Datawarehouse and Data mining –
Characteristics and uses of Data warehouse, Techniques of Data
Mining, Business Intelligence
•Database
•A database is a collection of related data which represents
some aspect of the real world. A database system is designed to
be built and populated with data for a certain task.

• Data Management
•Data management involves collecting, storing, organizing,
protecting, verifying, and processing essential data and making
it available to an organization.
Need for data management

 Creation of a database.
 Retrieval of information from the database.
 Updating the database.
 Managing a database.
Following are the need of data management of any
organization

1. Increase productivity
2. Smooth operations
3. Reduce security risk
4. Cost effective
5. Minimal chance of data loss
6. Better decision making
Challenges of data management

• Accuracy
• Validity
• Reliability
• Timeliness
• Relevance
• Completeness
Data independence

•Data independence is the type of data transparency.

•Data independence is a database management system (DBMS) characteristic that lets programmers modify
information definitions and organization without affecting the programs or applications that use it. Such
property allows various users to access and process the same data for different purposes, regardless of changes
made to it.
•A database containing patient information, for example, could serve various purposes. A hospital’s billing
department can use the data to obtain patients’ charges, discounts, and insurance details. On the other hand, the
food services department would need the same data to see the patients’ nutritional requirements. How each
department uses the data should not affect the stored information regardless of the changes it undergoes, such as
where the patient details are stored or how they are labelled.
Data redundancy

•Data redundancy occurs when the same piece of data is stored in two or more separate places.

• Suppose you create a database to store sales records, and in the records for each sale, you enter the customer
address. Yet, you have multiple sales to the same customer so the same address is entered multiple times. The
address that is repeatedly entered is redundant data.
Data consistency

•In a single-user database, the user can modify data in the database without concern for other users modifying
the same data at the same time. However, in a multiuser database, the statements within multiple simultaneous
transactions can update the same data. Transactions executing at the same time need to produce meaningful and
consistent results.
 Data concurrency means that many users can access data at the same time.
 Data consistency means that each user sees a consistent view of the data, including visible changes made by
the user's own transactions and transactions of other users.
Data administration

•Data administration allows an organization to control its data assets, as well as their processing and interactions
with different applications and business processes. Data administration ensures that the entire life cycle of data
use and processing is on par with the enterprise’s objective.

•Since Data administration is a high-level function that is responsible for the overall management of data
resources in an organization, including:

 Data definition, policies, procedures and standards


 Data conflict resolution
 Database planning, analysis, design, implementation, and maintenance
 Data protection
 Data performance assurance
 User training, education, and consulting support
 Technical issues of database security, performance, backup and recovery
Database Management System

•Database management system is a software which is


used to manage the database. For example: MySQL,
Oracle, etc
•It is technology of storing and retrieving user’s data
with most efficiency along with appropriate security
measures.
Advantages of DBMS

o Controls database redundancy: It can control data redundancy because it stores all
the data in one single database file and that recorded data is placed in the database.
o Data sharing: In DBMS, the authorized users of an organization can share the data
among multiple users.
o Easily Maintenance: It can be easily maintainable due to the centralized nature of
the database system.
o Reduce time: It reduces development time and maintenance need.
o Backup: It provides backup and recovery subsystems which create automatic backup
of data from hardware and software failures and restores the data if required.
o Multiple user interface: It provides different types of user interfaces like graphical
user interfaces, application program interfaces
Disadvantages of DBMS

o Cost of Hardware and Software: It requires a high speed of data processor


and large memory size to run DBMS software.
o Size: It occupies a large space of disks and large memory to run them
efficiently.
o Complexity: Database system creates additional complexity and
requirements.
o Higher impact of failure: Failure is highly impacted the database because
in most of the organization, all the data stored in a single database and if
the database is damaged due to electric failure or database corruption then
the data may be lost forever.
Types of Database Management Systems

1. Hierarchical databases
2. Network databases
3. Relational databases
4. Object-oriented databases
5. Graph databases
6. ER model databases
Hierarchical databases
Network Databases
Relational Databases

• In a relational database management system (RDBMS), the relationship between data is relational and data is
stored in tabular form of columns and rows. Each column of a table represents an attribute and each row in a
table represents a record. Each field in a table represents a data value.
Object-Oriented Model
Graph Databases

• Graph Databases are NoSQL databases and use a graph structure for
semantic queries. The data is stored in the form of nodes, edges, and
properties. In a graph database, a Node represents an entity or instance
such as a customer, person, or car. A node is equivalent to a record in a
relational database system. An Edge in a graph database represents a
relationship that connects nodes.
ER Model Databases

•An ER model is typically implemented as a database. In a simple


relational database implementation, each row of a table represents one
instance of an entity type, and each field in a table represents an
attribute type. In a relational database, a relationship between entities is
implemented by storing the primary key of one entity as a pointer or
"foreign key" in the table of another entity.
Table/FIELD/ROW

ID Name AGE COURSE


1 Ajeet 24 B.Tech

2 aryan 20 C.A

3 Mahesh 21 BCA

4 Ratan 22 MCA

5 Vimal 26 BSC
Database Query

•A query is a way of requesting information from the database. A


database query can be either a select query or an action query. A select
query is a query for retrieving data, while an action query requests
additional actions to be performed on the data, like deletion, insertion,
and updating.

•For example, A manager can perform a query to select the employees


who were hired 5 months ago. The results could be the basis for creating
performance evaluations.
Data Warehouse

•A data warehouse is a technique for collecting and managing data from


varied sources to provide meaningful business insights. It is a blend of
technologies and components which allows the strategic use of data.
•Data Warehouse is electronic storage of a large amount of information
by a business which is designed for query and analysis instead of
transaction processing. It is a process of transforming data into
information and making it available to users for analysis.
Data Mining

•Data mining is looking for hidden, valid, and potentially useful patterns
in huge data sets. Data Mining is all about discovering unsuspected/
previously unknown relationships amongst the data. It is a multi-
disciplinary skill that uses machine learning, statistics, AI and database
technology.
Data Mining Techniques

•Data mining includes the utilization of refined data analysis tools to


find previously unknown, valid patterns and relationships in huge data
sets. These tools can incorporate statistical models, machine learning
techniques, and mathematical algorithms, such as neural networks or
decision trees. Thus, data mining incorporates analysis and prediction.

Business Intelligence
• BI (Business Intelligence) is a set of processes, architectures, and technologies
that convert raw data into meaningful information that drives profitable business
actions. It is a suite of software and services to transform data into actionable
intelligence and knowledge. BI has a direct impact on organization's strategic,
tactical and operational business decisions. BI supports fact-based decision
making using historical data rather than assumptions and gut feeling.

• The term Business Intelligence (BI) refers to technologies, applications and


practices for the collection, integration, analysis, and presentation of business
information. The purpose of Business Intelligence is to support better business
decision making. Essentially, Business Intelligence systems are data-driven
Decision Support Systems (DSS)
•THANK YOU

You might also like