Professional Documents
Culture Documents
Case Studies and Advancements: Unit: 5
Case Studies and Advancements: Unit: 5
Unit: 5
CLOUD
COMPUTING (TCS 074)
Mr. Saurabh Gupta
Department of CSE
Course Details
(B.Tech 7th Sem)
• Open Stack
• Framework
• Massive Storage
• Processing Power
• Big data is a term used to define very large amount of unstructured and semi
structured data a company creates.
• That much data would take so much time and cost to load into relational
database for analysis.
• Data from all disks have to be combined from all the disks which is a mess.
• It ties so many small and reasonable priced machines together into a single cost
effective computer cluster.
• If a node goes down, jobs are automatically redirected to other nodes to make sure the
distributed computing does not fail.
• It provides simplified programming model which allows user to quickly read and write
the distributed system.
• Map Reduce is a programming model for processing and generating large data
sets with a parallel, distributed algorithm on a cluster.
• MAP function that process a key pair to generates a set of intermediate key
pairs.
• REDUCE function that merges all intermediate values associated with the same
intermediate key
• Text processing on massively scalable web data stored using Big Table and GFS
distributed file system
• Designed for processing and generating large volumes of data via massively
parallel computations, utilizing tens of thousands of processors at a time
• Computing power
• Flexibility
• Fault Tolerance
• Low Cost
• Scalability
• Integration with existing systems Hadoop is not optimized for ease for use.
• Map phase: – Each mapper reads approximately 1/M of the input from the
global file system
• Reduce phase: – The master informs the reducers where the partial computations have
been stored on local files of respective mappers
• – Reducers make remote procedure call requests to the mappers to fetch the files
• – Each reducer groups the results of the map step using the same key and performs a
function f on the list of values that correspond to these key value:
• Virtual Box is open-source software for virtualizing the x86 computing architecture.
• It acts as a hypervisor, creating a VM (virtual machine) in which the user can run
another OS (operating system).
• The operating system in which Virtual Box runs is called the "host" OS.
• On January 27, 2010, Oracle Corporation purchased Sun, and took over development of
Virtual Box.
•Dynamic Web server with full support for common web technologies
• Java 5, Java 6
• Go
Advantages
• Infrastructure for Security Disadvantages
• You Are At Google’s Mercy
• Scalability
• Violation of Policies
• Performance and Reliability
• Forget Porting
• Cost Savings
• It isn’t Free
• Platform Independence
• Open Stack is a cloud operating system that controls large pools of compute, storage,
and networking resources throughout a datacenter
• all managed through a dashboard that gives administrators control while empowering
their users to provision resources through a web interface.
▪ On top of IaaS e.g. Cloud Foundry ▪ Storage for VMs and arbitrary files
The logical and operational level of a federated cloud identifies and addresses the
challenges in devising a framework that enables the aggregation of providers that
belong to different administrative domains within a context of a single overlay
infrastructure, which is the cloud federation.
• The federation of cloud resources allows a client to choose the best cloud services
provider, in terms of flexibility,
• Federation across different cloud resource pools allows applications to run in the most
appropriate infrastructure environments.
• move data between disparate networks and implement innovative security models for
user access to cloud resources.
• The federated cloud model is a force for real democratization in the cloud market.
• It’s how businesses will be able to use local cloud providers to connect with customers,
partners and employees anywhere in the world.
• It’s how end users will finally get to realize the promise of the cloud.
• And, it’s how data center operators and other service providers will finally be able to
compete with, and beat, today’s so-called global cloud providers
QUESTIONS
3. Above the file systems comes the ________ engine, which consists of one Job Tracker, to
which client applications submit Map Reduce jobs.
a) Map Reduce
b) Google
c) Functional programming
d) Facebook
1. A ________ serves as the master and there is only one NameNode per cluster.
a) Data Node b) Name Node
c) Data block d) Replication
2. Swift is Open Stack's object storage system, while Cinder deals with block storage.
a)True b)False