Download as pdf or txt
Download as pdf or txt
You are on page 1of 17

Database · Data

Warehouse · Data Lake


What’s the Difference?

Member List

De Borja El-Assmar
Bryle Christian
 Khaled Waleed Noaman


Esteves Laudato
Marc Joshua 
 Rafael


Ugalde
Zaimon

TABLE OF CONTENTS

Database Data Warehouse


01 What is a database?

02 What is a data warehouse?


Data Lake Differences


03 What is a data lake?

04 How are they different from

 one another?

TABLE OF CONTENTS

Biggest
Common Cons Application
05 What are the common cons
06 Group’s opinion

of building a data 

warehouse?


Cloud Computing
07 Details of cloud computing
and distinction between the
rest


01
Database
What is a database?

An organized
collection of data
A database is designed and used for accessible retrieval,
management, and manipulation of data. It acts as a
digital archive for a variety of tasks. Examples of tasks
are; data analysis, reporting, and transactions.
Databases are used in many applications, from personal
projects to large-scale businesses.

Uses in Real Life

E-COMMERCE HEALTHCARE TRANSPORTATION


Product information, Patient records, medical Tracking schedules, routes,
customer profiles, purchase histories, diagnostic reports, vehicle maintenance, and
histories, transaction and treatment plans
 passenger information

records, etc.

02
Data Warehouse
What is a data warehouse?

A Centralized Data Repository
Used to help business insights, analysis of data available, and
reports. It is an optimized database for analysis, far different from
transactional processing. Data warehouses retrieve information
from multiple sources, standardize it, and offer a foundation for
strategic decision-making.


Insurance Governments Education


Assess risk profiles, process Analyze census data, Student performance,
claims efficiently, and create monitor public health trends, analyze enrollment trends,
actuarial models for accurate and make data-driven policy and improve academic
pricing of insurance policies.
 decisions.
 planning

03
Data Lake
What is a data lake?

Stores a large volume
of untouched data
A data lake is a storage space that keeps data entries in its
original form until it's needed for analysis or other processes.
Unlike databases or data warehouses that organize data
systematically, data lakes store data as they are, including
structured, semi-structured, and unstructured formats such
as text, images, and videos.


Uses of Data Lakes

DATA EXPLORATION RISK MANAGEMENT ENERGY AND UTILITIES


Data scientists and analysts can Financial institutions use data Data lakes store data from
explore data without worrying lakes to store and analyze sensors and devices in energy
about data structure or schema. historical trading data, helping with and utility networks, supporting
This facilitates uncovering hidden risk assessment, fraud detection, predictive maintenance, asset
patterns, correlations, and trends.
 and compliance.
 management, and energy
consumption analysis.

Differences
04

Nature Size
Data lakes contain raw data. Databases consist It's vital to understand that the size and capacity of
of that data from data lakes put into a logical and these systems is not single-handedly determined by
meaningful nature. Data warehouses use data their category but also by their specific
from multiple databases in order to create a big implementation, build, and technology options and
choices. Businesses choose the storage option that
storage house with multiple sources.

best fits their data and analytics requirements, taking
into consideration aspects such as data volume,
performance, scalability, and cost.

05 Common Pitfalls to
Building A Data Warehouse
1. Lack of Clear Goals and Strategy: Starting without a clear understanding of what the
data warehouse is meant to achieve can lead to misalignment between business needs
and the implementation, resulting in wasted effort and resources.


2. Inadequate Data Governance: Neglecting data governance, including data quality,
security, and metadata management, can lead to unreliable or inaccurate insights
derived from the warehouse.


3. Scalability Issues: Inadequate planning for data growth can lead to scalability problems
as the data warehouse expands, impacting performance and responsiveness.


4. Rapidly Changing Requirements: Shifting business needs and requirements can cause
the data warehouse to become outdated or misaligned with the evolving goals of the
organization.


06 Group Opinion
What’s the biggest application of a data warehouse in today’s world?

As a group, we believe that trend detection is what data


warehouses are most used for. Predicting the future and
finding new areas to invest their resources in is where
everyone is always looking. Another strong opinion
among other members was performance tracking.
Although some of us disagreed if it was the biggest, we
definitely did not deny its significance in today’s world.

07
Cloud Computing
Details of cloud computing and distinction between the rest

It’s all in the cloud
Cloud storage for data includes utilizing distant servers provided
by cloud service providers to both store and oversee data via the
internet. Instead of depending on local storage systems like
drives, businesses and individuals can utilize cloud storage to
securely store data, access it from any location with internet
access, and take advantage of the scalability and adaptability that
cloud technology provides.


You might also like