Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 3

ST.

ANTHONY’S
COLLEGE
Business Education Department
San Jose de Buenavista, 5700
Antique Tel. No. (036) 5409238; 5400898; 5409971 Tel. No.:
(036) 5409196 In SAC, we
Website: www.sac.edu.ph Email: info@sac.edu.ph; bused@sac.edu.ph

AIS 306
DATA WAREHOUSING
AND MANAGEMENT
(WEEK 1 ACTIVITY)
ST. ANTHONY’S COLLEGE
AIS 306
San Jose, Antique Business
DATA WAREHOUSING AND
Education Department SCORE:
MANAGEMENT
PRELIM
Week 1 WORKSHEET NO. 1

NAME: CEPEDA HELEN ROSE S. Program, Year and Section: BSAIS 4


Last Name First Name M.I
.

ACTIVITY 1: Answer the questions comprehensively. Write/encode your answer on this sheet.

Please send your Worksheet No. 1 at the Google Classroom on or before September 5, 2021, 11:59pm. Your Worksheet No. 1 may be in
pdf or word format.

1. Discuss the importance of a data warehouse in a manufacturing corporation.

(10 pts) Answer:

A data warehouse is a repository of information that manufacturers collect from their various operational and
transactional systems. This allows them to improve their operations and gain a competitive advantage.

2. Give three examples of cloud data warehouse solution providers. State their best features.

(10 pts) Answer:

Amazon Redshift is one of the most popular data warehousing solutions on the market today. The service
powers the analytical initiatives of countless leading businesses, including startups and Fortune 500
companies alike. Some of the biggest brands using Redshift today include Intuit, Lyft, Yelp, and even
McDonald’s. 
One of the best things about Redshift is that it integrates perfectly with your Data Lake and AWS
environment. Redshift allows developers and business leaders to query vast amounts of semi-structured and
structured data from a host of settings. What’s more, Redshift’s performance benefits from Amazon’s
incredible AWS infrastructure, so you know you’re going to get a great user experience. For data outside of
your S3 data lake, you can use AWS Glue to extract, transform, and load data into the warehouse too.

Snowflake is a leading data warehousing solution that offers a variety of choices for public cloud technology.
With Snowflake, you can make your business more data-driven, enabling you to create amazing customer
experiences in turn. Unlike other data warehousing services, Snowflake also comes with per-second pricing.
The convenient pricing structure of Snowflake’s technology means that you only pay for what you use.
Snowflake’s reliable architecture simplifies and improves the data pipeline, while reducing unnecessary
complexity. You also get self-service access to all the extra functionality that you need.

Google Big Query is a component of the Google cloud platform environment. This highly scalable and server
less cloud data warehouse is ideal for companies that want to keep costs low. If you need a quick way to
make informed decisions through data analysis, Big Query has you covered. 
Big Query sets itself apart by its accessibility. In particular, querying data with SQL and Open Database
Connectivity is easy with this offering. Additionally, you can efficiently run your analytics environments
with a three-year TCO that’s up to 34% cheaper than other cloud data warehouse alternatives. Integration
with machine learning tools from Google is another key differentiator if you’re interested in stepping into the
artificial intelligence environment.
3. What is the difference of Online Analytical Processing (OLAP) and Online Transaction Processing (OLTP)? (10 pts) Answer:

Online transaction processing (OLTP) is a process that enables businesses to process transactions in real time.
An analytical processing (OLAP) procedure helps analyze the data collected by OLTP systems.

4. There are several different ways to establish a data warehouse and there are numerous data warehousing tools that businesses can use to upload and
analyze their data. Give three examples of data warehousing tools and their functions (example: Google Big Query). (10 pts)

Answer:

Redshift is a cloud-based data warehousing tool for enterprises. The fully-managed platform can process
petabytes of data in seconds. That's why it's suitable for high-speed data analytics. It also supports automatic
concurrency scaling. The automation increases or decreases query processing resources to match workload
demand. This way, you can execute hundreds of concurrent queries without the operational overhead.
Additionally, Redshift allows you to scale your cluster or switch between node types. Thus, it enables you to
optimize data warehouse performance and cut operational costs. 

Azure SQL data warehouse is a cloud-based relational database from Microsoft. You can optimize it for
petabyte-scale data loading/processing and real-time reporting. The platform has a node-based system, and it
employs massively parallel processing (MPP). The architecture is suitable for optimizing queries for
concurrent processing. Thus, it enables you to extract and visualize business insights much faster. The data
warehouse is compatible with hundreds of MS Azure resources. For example, you may build intelligent apps
with the platform's machine learning tools. Also, the platform lets you store different types of structured and
unstructured data. The data may come from diverse sources, such as on-premise SQL databases and IoT
devices.

Vertica is an SQL data warehouse available in the cloud on platforms like AWS and Azure. You may also
deploy it on-premise or as a hybrid. The tool supports columnar storage and uses MPP to increase query
speed. Its shared-nothing architecture reduces competition for shared resources. Vertica offers built-in
capabilities for analytics. These include machine learning, pattern matching, and time series. It also supports
standard programming interfaces, such as OLEDB. The software uses compression to optimize storage. 

2|Page

You might also like