Professional Documents
Culture Documents
HDFS and Oracle
HDFS and Oracle
Introduction
Johan Louwers Lead Architect Oracle Technology
The Hadoop Distributed File System (HDFS) is a distributed file system designed to run on commodity
hardware. It has many similarities with existing distributed file systems. However, the differences from
other distributed file systems are significant. HDFS is highly fault-tolerant and is designed to be deployed
on low-cost hardware. HDFS provides high throughput access to application data and is suitable for
applications that have large data sets. HDFS relaxes a few POSIX requirements to enable streaming
access to file system data. HDFS was originally built as infrastructure for the Apache Nutch web search
engine project. HDFS is now an Apache Hadoop subproject. The project URL
is http://hadoop.apache.org/hdfs/.
HDFS introduction
HDFS Name Node
HDFS introduction
HDFS Storage
HDFS introduction
HDFS Storage
HDFS introduction
HDFS Storage
10
11
12
13
14
15
Contact me
Johan Louwers
Capgemini Lead Architect Oracle Technology
Mail
Twitter
Blog 1
Blog 2
: Johan.Louwers@capgemini.com
: @johanlouwers
: http://www.capgemini.com/blog/capgemini-oracle-blog
: http://johanlouwers.blogspot.com
16
About Capgemini
With almost 140,000 people in over 40 countries, Capgemini is
one of the world's foremost providers of consulting, technology
and outsourcing services. The Group reported 2013 global
revenues of EUR 10.1 billion.
Together with its clients, Capgemini creates and delivers
business and technology solutions that fit their needs and drive
the results they want. A deeply multicultural organization,
Capgemini has developed its own way of working, the
Collaborative Business Experience, and draws on
Rightshore, its worldwide delivery model.
Learn more about us at www.capgemini.com.
www.capgemini.com
The information contained in this presentation is proprietary.
2014 Capgemini. All rights reserved.
Rightshore is a trademark belonging to Capgemini.