Professional Documents
Culture Documents
Comparative Study Amongs AWS Azure and HortonWorks
Comparative Study Amongs AWS Azure and HortonWorks
Comparative Study Amongs AWS Azure and HortonWorks
Microsoft hd InSights
Class: CA191
Jamahuriya University of science and
technlogy
2. Components of AWS[9]
1. Data Management and Data Transfer
Figure 2: component of Amazon EC2[1] 2. Compute & Networking
3. Storage
C. Microsoft Azure HDInsight 4. Automation and Orchestration
Microsoft azure is cloud computing service that 5. Visualization
Microsoft offers. The platform offers over 600 services and 6. Operations and Management
its web-based platform that you can build, test, deploy and 7. Security and Compliance
manage for applications and services[6]. Microsoft designed
Azure HDInsight as a cloud-based service for processing B. HortonWorks
and analyzing large volumes of streaming and historical Hortonworks Data Platform (HDP) is an open-source
data. Enterprises can further use HDInsight as a fully framework for distributed storage and processing of large,
managed analytics service. HDInsight enables developers to multi-source data sets. Hortonworks is the Hadoop
build big data applications using open-source frameworks Distribution that support windows platform. Premises while
such as Apache Hadoop, Apache Spark, Apache Hive, helping you drive new revenue streams, improve customer
Apache Kafka, Apache LLAP, Apache Storm, and experience, and control costs.
Microsoft Machine Learning Server[7]. Azure HDInsight
allows developers to build custom big data solutions and 1. Highlights of Hortonworks[1]
process massive amounts of data using the implementation 1. Hortonworks purpose of economic model
of widely used Apache products. The developers can is selling their support and training not to
facilitate batch processing using Apache Pig, Apache Spark, sell their license.
or Apache Hive. Likewise, they can access NoSQL data 2. It is the Big Hadoop contributor.
using Apache HBase, and stream millions of streaming 3. Uses existing data platform to embed
events using Apache Storm, Apache Spark, or Apache Hadoop
Kafka. The users can further integrate Apache Spark with
Hadoop MapReduce to extract, transform, and load (ETL) 2. Components of Hortonworks
large data cluster on demand[7]. Microsoft Azure does offer
Software as a service (SaaS), Platform as a service (PaaS) The Hortonworks Data Platform consists three
and the infrastructure as a service (IaaS). Any kind of layers
programming language, tools and the framework which is 1. Core Hadoop 2: The basic components of
currently bringing to the top marketplace of services that Apache Hadoop version 2.x.
can be use by the customers[1]. • Hadoop Distributed File System
Figure 3: (HDFS).
cloud service • YARN.
models • MapReduce 2 (MR2)
2. Essential Hadoop: A set of Apache
components designed to ease working with
Core Hadoop.
Some are Apache HBase, Apache
HCatalog, Apache Hive, Apache Pig.
3. Supporting Components: A set of I. Features of Aws, Azure and Hortonworks
components that allow you to monitor your
Hadoop installation and to connect Hadoop feature Azure Aws Hortonworks
with your larger compute environment.
C. Microsoft Azure
Azure is a public cloud computing platform with
Storage Blob Hortonworks
solutions including Infrastructure as a Service S3
services Data Platform
(IaaS), Platform as a Service (PaaS), and Software as a Storage
Buckets (HDP) is an
Service (SaaS) that can be used for services such as
Containers open-source
analytics, virtual computing, storage, networking, and much
EBS framework for
more.
Azure Drive distributed
SDB storage and
1. Highlights of Azure[10], [11]
Table processing of
• Azure supports IaaS, PaaS, SaaS domains
Storage large, multi-
• Global – Data housed in geo-synchronous source data sets.
data center. Tables Easy to use
• Open – Supports almost any OS,
language, tool, or framework. Storage SQS
• Flexible – Move compute resources up Stats
and down as needed. CloudFront
• Azure facilitates easy mobility and a
reliable consistent platform between on- AWS Import/
premise and public Cloud. Export
• Azure has hybrid capabilities that make it
unique. Database PostgreSQL
MS SQL MySQL
services
2. Components of Microsoft Azure [12] SQL Sync Oracle MySQL