Welcome to Scribd!

Skip carousel

0% found this document useful (0 votes)

4 views

2 DE +Installing+Apache+Spark+on+CDH+EC2

Uploaded by

Junaid Sheikh

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Apache Upgrade For Content Server
Document6 pages
Apache Upgrade For Content Server
KotiEswar
No ratings yet
The Little Book of Sitecore® Tips: Volume 1
From Everand
The Little Book of Sitecore® Tips: Volume 1
Neil P Shack
No ratings yet
NoSQL Injection for Elasticsearch
From Everand
NoSQL Injection for Elasticsearch
Gary Drocella
No ratings yet
DevOps. How to build pipelines with Jenkins, Docker container, AWS ECS, JDK 11, git and maven 3?
From Everand
DevOps. How to build pipelines with Jenkins, Docker container, AWS ECS, JDK 11, git and maven 3?
John Edward Cooper Berg
No ratings yet
APKTOOLS 1.5.2 & Apk Tool Install Windows R05-Ibot
Document2 pages
APKTOOLS 1.5.2 & Apk Tool Install Windows R05-Ibot
Heernaan Rodriiguezz
67% (3)
Install Guide D-Link TR-069: Alpha Version
Document7 pages
Install Guide D-Link TR-069: Alpha Version
hacksystem
No ratings yet
Experiment No. 5 Step 1: Install Apache2
Document36 pages
Experiment No. 5 Step 1: Install Apache2
Sagar Padhy
No ratings yet
Install Oracle Enterprise Manager
Document15 pages
Install Oracle Enterprise Manager
JENIFA JEYAKUMAR
100% (1)
Project 5 Icinga
Document25 pages
Project 5 Icinga
vinay kumar Singh
No ratings yet
Install Cloudera Manager Using AMI On Amazon EC2
Document39 pages
Install Cloudera Manager Using AMI On Amazon EC2
Ram Guggul
No ratings yet
Running A Pig Program On The CDH Single Node Cluster On An Aws Ec2 Instance
Document21 pages
Running A Pig Program On The CDH Single Node Cluster On An Aws Ec2 Instance
Ram Guggul
No ratings yet
Lab-Kafka Administration VI
Document197 pages
Lab-Kafka Administration VI
Vaibhav Marathe
No ratings yet
Java Tomcat Setup
Document4 pages
Java Tomcat Setup
shawnqiang
No ratings yet
End To End Deploy in Ubuntu
Document31 pages
End To End Deploy in Ubuntu
Raashid Shahab
No ratings yet
Apex 20.2 With ORDS & Apache Tomcat
Document27 pages
Apex 20.2 With ORDS & Apache Tomcat
Md Shaiduzzaman Shuvo
100% (1)
Hosting Static Website With Docker Container in Aws Ec-2: Smit Dharaiya
Document5 pages
Hosting Static Website With Docker Container in Aws Ec-2: Smit Dharaiya
Dharaiya Text-tiles
No ratings yet
Springboot Docker MysqlRDS
Document11 pages
Springboot Docker MysqlRDS
Rabbani Shaikh
No ratings yet
v2 3 Running+PySpark+on+Jupyter+NoteBook
Document8 pages
v2 3 Running+PySpark+on+Jupyter+NoteBook
Junaid Sheikh
No ratings yet
Multiple VPC Networks
Document20 pages
Multiple VPC Networks
kanedakodama
No ratings yet
Apache Java Tomcat Mod JK
Document30 pages
Apache Java Tomcat Mod JK
viren0307
No ratings yet
HowToInstallEM12cOnODA asPDF
Document21 pages
HowToInstallEM12cOnODA asPDF
astn98
No ratings yet
Deploy, Scale, and Update Your Website On Google Kubernetes Engine
Document19 pages
Deploy, Scale, and Update Your Website On Google Kubernetes Engine
subodh
No ratings yet
Solr Configuration: Guide To Installing Open Source Search Solution Solr On Windows and Linux
Document13 pages
Solr Configuration: Guide To Installing Open Source Search Solution Solr On Windows and Linux
sandeep amilineni
No ratings yet
Oracle 12cR2 Installations
Document48 pages
Oracle 12cR2 Installations
Nasir Mahmood
No ratings yet
Cloud Computing Lab 2
Document4 pages
Cloud Computing Lab 2
Yen-Kai Cheng
No ratings yet
Installation OpenMeetings 5.0.0 On Ubuntu 20.04 Lts
Document17 pages
Installation OpenMeetings 5.0.0 On Ubuntu 20.04 Lts
Luis Agustin Suaña Jala
No ratings yet
Deploying Openstack Lab On GCP-v3
Document10 pages
Deploying Openstack Lab On GCP-v3
Mainak Chakraborty
No ratings yet
Opennebula and Amazon Ec2 PDF
Document12 pages
Opennebula and Amazon Ec2 PDF
rafik03
No ratings yet
Build A Basic CRUD App With Angular 5
Document15 pages
Build A Basic CRUD App With Angular 5
nagarajuvcc123
No ratings yet
Content Server Installation and Configuration
Document35 pages
Content Server Installation and Configuration
Santosh Sarkale
No ratings yet
Cruz Freddy Cloud Based System
Document37 pages
Cruz Freddy Cloud Based System
api-683209730
No ratings yet
Active Directory Backup and Restore1
Document20 pages
Active Directory Backup and Restore1
NagarajuRb
No ratings yet
Squid Squidguard On Centos
Document7 pages
Squid Squidguard On Centos
Zeeshan Muhammad
No ratings yet
Minikube Instalation
Document15 pages
Minikube Instalation
darveshchauhan0001
No ratings yet
Step by Step Oracle 12c Grid Infrastructure - Installation
Document70 pages
Step by Step Oracle 12c Grid Infrastructure - Installation
8A8 - 28 - Huỳnh Ngọc Anh Thư
100% (1)
RHEL Cluster Suite
Document31 pages
RHEL Cluster Suite
Rohit Khurana
No ratings yet
Basic Apache Server Configuration Step by Step
Document3 pages
Basic Apache Server Configuration Step by Step
Jebin Jacob Luke
No ratings yet
Installing Docker On Amazon Linux 2
Document6 pages
Installing Docker On Amazon Linux 2
brayan segundo
No ratings yet
Install Squid Windows
Document19 pages
Install Squid Windows
Nguyễn Quốc Huy
No ratings yet
Oppstartsmanual KV-Multiprog 2
Document12 pages
Oppstartsmanual KV-Multiprog 2
thang doan
No ratings yet
R12 Installation 64 Bit On OEL 5 Update 5
Document18 pages
R12 Installation 64 Bit On OEL 5 Update 5
kkpareek
No ratings yet
Install Wamp SSL PDF
Document9 pages
Install Wamp SSL PDF
Ionel Gherasim
No ratings yet
Install The Guacamole Client
Document5 pages
Install The Guacamole Client
Samuel Zodingliana
No ratings yet
Step by Step Install of Grid Control 10g R2 On Linux: 1. Select Complete Installation
Document7 pages
Step by Step Install of Grid Control 10g R2 On Linux: 1. Select Complete Installation
smart_aix
No ratings yet
Oam Upgrade 10g To 11gr2 Steps
Document69 pages
Oam Upgrade 10g To 11gr2 Steps
api-219602070
No ratings yet
Tools and Sky130 Installation With WSL2
Document37 pages
Tools and Sky130 Installation With WSL2
Darwin Villamizar
No ratings yet
Installing PHP For Dynamic Web Pages
Document11 pages
Installing PHP For Dynamic Web Pages
Asif Iqbal
No ratings yet
Deploy Agent in Oracle Enterprise Manager 12c: (Root@em12c ) # VI /etc/sudoers
Document11 pages
Deploy Agent in Oracle Enterprise Manager 12c: (Root@em12c ) # VI /etc/sudoers
mohit.oracledba
No ratings yet
Oracle 19C MultiTenant Database 1704253431
Document18 pages
Oracle 19C MultiTenant Database 1704253431
deepak23aug
No ratings yet
LFD259 Kubernetes For Developers Version
Document96 pages
LFD259 Kubernetes For Developers Version
ahhung77
No ratings yet
Xrdocs Io CNBNG Tutorials Inception Server Deployment Guide
Document8 pages
Xrdocs Io CNBNG Tutorials Inception Server Deployment Guide
Konstantinos Dimitriou
No ratings yet
ABC
Document40 pages
ABC
Database 1
No ratings yet
Configuration of VVS Development Environment On Oracle Forms / Reports 11gR2
Document14 pages
Configuration of VVS Development Environment On Oracle Forms / Reports 11gR2
Madallin Oprea
No ratings yet
Creating A VM in Google Cloud
Document7 pages
Creating A VM in Google Cloud
Rhugved Takalkar
No ratings yet
Práctica de Laboratorio 26.1.7
Document10 pages
Práctica de Laboratorio 26.1.7
rojas.saldana.armando.sptm
No ratings yet
Capstone Project Final Steps
Document6 pages
Capstone Project Final Steps
Sweety Sweeti
No ratings yet
Hadoop 2.x Installation Guide
Document25 pages
Hadoop 2.x Installation Guide
ahmed_sft
100% (1)
WebLogic Server 11g
Document16 pages
WebLogic Server 11g
moisendiaye245
No ratings yet
Running Cassandra On Eclipse
Document4 pages
Running Cassandra On Eclipse
Sainareshmut
No ratings yet
BigData Challenge - L2 (Spark)
Document3 pages
BigData Challenge - L2 (Spark)
harshapandey112233
No ratings yet
Deploy Applications On Kubernetes
Document12 pages
Deploy Applications On Kubernetes
Miro
No ratings yet
Creating Databases and Data Placement
Document17 pages
Creating Databases and Data Placement
alfiatuz
No ratings yet
Custom MK-SS808 Image
Document4 pages
Custom MK-SS808 Image
gejib
No ratings yet
Chapter02-Accessing The Command Line
Document4 pages
Chapter02-Accessing The Command Line
Shahabuddin Mohammed Ahmed
No ratings yet
Log
Document97 pages
Log
Morteza Rafael
No ratings yet
TCP Keepalive HOWTO: Fabio Busatto
Document16 pages
TCP Keepalive HOWTO: Fabio Busatto
Ali N Chouman
No ratings yet
Q) Explain Process Control Block. Draw The Block Diagram: of Process Transition States
Document23 pages
Q) Explain Process Control Block. Draw The Block Diagram: of Process Transition States
ganeshshimpi125
No ratings yet
Bugreport
Document13 pages
Bugreport
Daniela
No ratings yet
FAQ OSY Updated
Document5 pages
FAQ OSY Updated
Satejsingh Shiledar
No ratings yet
Mapi 32
Document4 pages
Mapi 32
FLORES TORRES KATHERINE MARIA
No ratings yet
TSV Tnew Page Alloc Failed Analysis
Document5 pages
TSV Tnew Page Alloc Failed Analysis
contactaps
No ratings yet
How To Reset Administrator Password Offline by Using Hiren Boot CD
Document5 pages
How To Reset Administrator Password Offline by Using Hiren Boot CD
Umno Putera
No ratings yet
Anr 7.0.1 (70016001) 20210122 190132
Document8 pages
Anr 7.0.1 (70016001) 20210122 190132
Rodrigo Perlaza
No ratings yet
x86 Stderr
Document35 pages
x86 Stderr
Dar
No ratings yet
Short Guide To Install Oracle 10 On Linux
Document14 pages
Short Guide To Install Oracle 10 On Linux
sudhir_kumar009351
No ratings yet
Chapter 5 Exercises 51 Answer 52 Answer 53 Lottery Scheduling
Document7 pages
Chapter 5 Exercises 51 Answer 52 Answer 53 Lottery Scheduling
Quang Nguyễn Minh
No ratings yet
Citra Log
Document4 pages
Citra Log
Rachman Agung
No ratings yet
Orbtrc 04122016 0923 59
Document1 page
Orbtrc 04122016 0923 59
Nico Baihaqi
No ratings yet
Key Code For MAC
Document1 page
Key Code For MAC
Cyber
No ratings yet
Changes CoDeSys SP RTE
Document41 pages
Changes CoDeSys SP RTE
alinup
No ratings yet
Case Study On Windows PDF
Document15 pages
Case Study On Windows PDF
JPC RTK
100% (1)
U S D C L: SB Torage Evice Ontrol IN Inux
Document5 pages
U S D C L: SB Torage Evice Ontrol IN Inux
Christian Aguas Núñez
No ratings yet
Dump Reading: - Prasanna Rathi
Document23 pages
Dump Reading: - Prasanna Rathi
Geethapriya Gowrishankar
No ratings yet
Virtualization in Cloud Computing Unit IV
Document11 pages
Virtualization in Cloud Computing Unit IV
arpitsharmaafs3
No ratings yet
LT Mahout Exercises
Document4 pages
LT Mahout Exercises
Mypost
No ratings yet
Steps For Applying Digital Signature: Click Here To Download Firefox 48.0.2 Click Here To Download Jre 1.8 Update 121
Document12 pages
Steps For Applying Digital Signature: Click Here To Download Firefox 48.0.2 Click Here To Download Jre 1.8 Update 121
Suresh Kumar
No ratings yet
Module 1 Unix-Ise
Document23 pages
Module 1 Unix-Ise
Atul Jha
No ratings yet
Log
Document38 pages
Log
Ekis Putra
No ratings yet
An Improved Round Robin CPU Scheduling Algorithm B
Document4 pages
An Improved Round Robin CPU Scheduling Algorithm B
Abduallah Mustafa
No ratings yet
Understanding Real-World Concurrency Bugs in Go: Tengfei Tu Xiaoyu Liu
Document14 pages
Understanding Real-World Concurrency Bugs in Go: Tengfei Tu Xiaoyu Liu
Akshay
No ratings yet

2 DE +Installing+Apache+Spark+on+CDH+EC2

Uploaded by

Junaid Sheikh

0% found this document useful (0 votes)

4 views19 pages

Original Title

2_DE_+Installing+Apache+Spark+on+CDH+EC2

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

4 views19 pages

2 DE +Installing+Apache+Spark+on+CDH+EC2

Uploaded by

Junaid Sheikh

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 19

Search inside document

Installing Apache Spark 2 on AWS EC2

Prerequisite: JDK 1.8 on your EC2 instance.

1. Download the Spark 2 CSD file from below link.

wget http://archive.cloudera.com/spark2/csd/SPARK2_ON_YARN-2.3.0.cloudera2.jar

2. move this spark jar to /opt/cloudera/csd/

cp SPARK2_ON_YARN-2.3.0.cloudera2.jar /opt/cloudera/csd/

3. Go to the CSD jar location

cd /opt/cloudera/csd/

ls -ltrh
4. Change the owner using the following command

chown cloudera-scm:cloudera-scm SPARK2_ON_YARN-2.3.0.cloudera2.jar

Verify using the following command.

ls -ltrh

5. Now change the permissions using the chmod command.

chmod 644 SPARK2_ON_YARN-2.3.0.cloudera2.jar

6. Now restart the cloudera-scm-server and agent using the
systemctl command.

systemctl restart cloudera-scm-server

systemctl restart cloudera-scm-agent
systemctl status cloudera-scm-server
7. Now Refresh the Cloudera manager using web UI and log in. (it
take some time)

● Click on the Parcels tab in right side.corner.

● Scroll down and verify a Spark2 parcel is present.
● Click on the Download button in front of Spark2.

● Now click on Distribute.

● Now click on Activate and then click on OK.

● Finally it is done and distributed.

8. Now Scroll up and go back to the Cloudera Manager Home

page.
● Click on the Stale Configuration under the Cloudera management services tab.

● Click on Restart Cloudera Management Services

● It takes some time, Once done click on Finish.

9. Now go to Cloudera manager home and click on add
service.
● Select Spark2 and then click on Continue.

● Integrate with services like. S3, HBase, HDFS, Hive, YARN(MR2 Included), Zookeeper

Or you may get the screen something like below.

● Select History server by clicking the text box provided just below History Server text.

● You will the pop-up window as shown below. Select the checkbox. Click Ok to get back.

● Click on Continue
● Click on Continue

● Click on Continue
● Click on Finish.
10. Now restart YARN by clicking the Stale configuration icon, as shown below

● Then click on Restart Slale Services

● Click on Restart Now

● Once all steps are complete, click on Finish.

10. Back to the Cloudera manager homepage, In case if you are
getting a Health issue as shown below, then follow the steps to
resolve it.

● Click on the Issue ICON then suppress.

11. Create the Spark Lineage directory

Open putty/terminal login to your ec2 instance, go to the root user and run the following
command, as shown below in the screenshot

mkdir /var/log/spark2/lineage
chown spark:spark /var/log/spark2/lineage
chmod 777 /var/log/spark2/lineage
12. The Spark2 job by default reads data from HDFS, so now let’s
create an ec2-user directory in HDFS(Skip if already done)
In putty run the following commands. (run from root user)
sudo su hdfs
hadoop fs -mkdir -p /user/ec2-user
hadoop fs -chown -R ec2-user:ec2-user /user/ec2-user
hadoop fs -chmod -R 777 /user/ec2-user
Now, for your spark jobs, keep your data file in /user/ec2-user/ directory.(you can further create
a directory here and store your data files)

Now, return to ec2-user by running the command: exit

13. To verify the Spark version, run the following command

spark2-submit --version

It should show Version 2.3.0 as shown in the image below.

14. Open Cloudera Manager, Click on YARN from the cluster
services, then Click in Configuration from the Upper Menu bar.
NOTE: These two parameters value should be matched to our configuration.
1. Serach “yarn.nodemanager.resource.memory-mb” and set it to 10 GB > Click Save
changes

2. Search ”yarn.scheduler.maximum-allocation-mb” and set it to 8 GB > Click Save Changes

15. How to open the history server web UI:

Note: Before open History server Web UI. First, you need to configure the Security group of
your instance:

i. Go to the EC2 dashboard and select your instance.

II. Scroll down the Description tab and under your security group click on avdmap.
III. Then go to Inbound > Edit > Add Rule

IV. A new will get added in the list, configure the rule as below
A. Type section - “All TCP”
B. Protocol - TCP.
C. Port range - 0-65535
D. Source - My IP
E. Click on Save.
V. Copy public IP address from the ec2 dashboard.

VII. Now Paste the public IP address into the browser as shown below. The format is

<your EC2 public ip>:18089

Apache Upgrade For Content Server
Document6 pages
Apache Upgrade For Content Server
KotiEswar
No ratings yet
The Little Book of Sitecore® Tips: Volume 1
From Everand
The Little Book of Sitecore® Tips: Volume 1
Neil P Shack
No ratings yet
NoSQL Injection for Elasticsearch
From Everand
NoSQL Injection for Elasticsearch
Gary Drocella
No ratings yet
DevOps. How to build pipelines with Jenkins, Docker container, AWS ECS, JDK 11, git and maven 3?
From Everand
DevOps. How to build pipelines with Jenkins, Docker container, AWS ECS, JDK 11, git and maven 3?
John Edward Cooper Berg
No ratings yet
APKTOOLS 1.5.2 & Apk Tool Install Windows R05-Ibot
Document2 pages
APKTOOLS 1.5.2 & Apk Tool Install Windows R05-Ibot
Heernaan Rodriiguezz
67% (3)
Install Guide D-Link TR-069: Alpha Version
Document7 pages
Install Guide D-Link TR-069: Alpha Version
hacksystem
No ratings yet
Experiment No. 5 Step 1: Install Apache2
Document36 pages
Experiment No. 5 Step 1: Install Apache2
Sagar Padhy
No ratings yet
Install Oracle Enterprise Manager
Document15 pages
Install Oracle Enterprise Manager
JENIFA JEYAKUMAR
100% (1)
Project 5 Icinga
Document25 pages
Project 5 Icinga
vinay kumar Singh
No ratings yet
Install Cloudera Manager Using AMI On Amazon EC2
Document39 pages
Install Cloudera Manager Using AMI On Amazon EC2
Ram Guggul
No ratings yet
Running A Pig Program On The CDH Single Node Cluster On An Aws Ec2 Instance
Document21 pages
Running A Pig Program On The CDH Single Node Cluster On An Aws Ec2 Instance
Ram Guggul
No ratings yet
Lab-Kafka Administration VI
Document197 pages
Lab-Kafka Administration VI
Vaibhav Marathe
No ratings yet
Java Tomcat Setup
Document4 pages
Java Tomcat Setup
shawnqiang
No ratings yet
End To End Deploy in Ubuntu
Document31 pages
End To End Deploy in Ubuntu
Raashid Shahab
No ratings yet
Apex 20.2 With ORDS & Apache Tomcat
Document27 pages
Apex 20.2 With ORDS & Apache Tomcat
Md Shaiduzzaman Shuvo
100% (1)
Hosting Static Website With Docker Container in Aws Ec-2: Smit Dharaiya
Document5 pages
Hosting Static Website With Docker Container in Aws Ec-2: Smit Dharaiya
Dharaiya Text-tiles
No ratings yet
Springboot Docker MysqlRDS
Document11 pages
Springboot Docker MysqlRDS
Rabbani Shaikh
No ratings yet
v2 3 Running+PySpark+on+Jupyter+NoteBook
Document8 pages
v2 3 Running+PySpark+on+Jupyter+NoteBook
Junaid Sheikh
No ratings yet
Multiple VPC Networks
Document20 pages
Multiple VPC Networks
kanedakodama
No ratings yet
Apache Java Tomcat Mod JK
Document30 pages
Apache Java Tomcat Mod JK
viren0307
No ratings yet
HowToInstallEM12cOnODA asPDF
Document21 pages
HowToInstallEM12cOnODA asPDF
astn98
No ratings yet
Deploy, Scale, and Update Your Website On Google Kubernetes Engine
Document19 pages
Deploy, Scale, and Update Your Website On Google Kubernetes Engine
subodh
No ratings yet
Solr Configuration: Guide To Installing Open Source Search Solution Solr On Windows and Linux
Document13 pages
Solr Configuration: Guide To Installing Open Source Search Solution Solr On Windows and Linux
sandeep amilineni
No ratings yet
Oracle 12cR2 Installations
Document48 pages
Oracle 12cR2 Installations
Nasir Mahmood
No ratings yet
Cloud Computing Lab 2
Document4 pages
Cloud Computing Lab 2
Yen-Kai Cheng
No ratings yet
Installation OpenMeetings 5.0.0 On Ubuntu 20.04 Lts
Document17 pages
Installation OpenMeetings 5.0.0 On Ubuntu 20.04 Lts
Luis Agustin Suaña Jala
No ratings yet
Deploying Openstack Lab On GCP-v3
Document10 pages
Deploying Openstack Lab On GCP-v3
Mainak Chakraborty
No ratings yet
Opennebula and Amazon Ec2 PDF
Document12 pages
Opennebula and Amazon Ec2 PDF
rafik03
No ratings yet
Build A Basic CRUD App With Angular 5
Document15 pages
Build A Basic CRUD App With Angular 5
nagarajuvcc123
No ratings yet
Content Server Installation and Configuration
Document35 pages
Content Server Installation and Configuration
Santosh Sarkale
No ratings yet
Cruz Freddy Cloud Based System
Document37 pages
Cruz Freddy Cloud Based System
api-683209730
No ratings yet
Active Directory Backup and Restore1
Document20 pages
Active Directory Backup and Restore1
NagarajuRb
No ratings yet
Squid Squidguard On Centos
Document7 pages
Squid Squidguard On Centos
Zeeshan Muhammad
No ratings yet
Minikube Instalation
Document15 pages
Minikube Instalation
darveshchauhan0001
No ratings yet
Step by Step Oracle 12c Grid Infrastructure - Installation
Document70 pages
Step by Step Oracle 12c Grid Infrastructure - Installation
8A8 - 28 - Huỳnh Ngọc Anh Thư
100% (1)
RHEL Cluster Suite
Document31 pages
RHEL Cluster Suite
Rohit Khurana
No ratings yet
Basic Apache Server Configuration Step by Step
Document3 pages
Basic Apache Server Configuration Step by Step
Jebin Jacob Luke
No ratings yet
Installing Docker On Amazon Linux 2
Document6 pages
Installing Docker On Amazon Linux 2
brayan segundo
No ratings yet
Install Squid Windows
Document19 pages
Install Squid Windows
Nguyễn Quốc Huy
No ratings yet
Oppstartsmanual KV-Multiprog 2
Document12 pages
Oppstartsmanual KV-Multiprog 2
thang doan
No ratings yet
R12 Installation 64 Bit On OEL 5 Update 5
Document18 pages
R12 Installation 64 Bit On OEL 5 Update 5
kkpareek
No ratings yet
Install Wamp SSL PDF
Document9 pages
Install Wamp SSL PDF
Ionel Gherasim
No ratings yet
Install The Guacamole Client
Document5 pages
Install The Guacamole Client
Samuel Zodingliana
No ratings yet
Step by Step Install of Grid Control 10g R2 On Linux: 1. Select Complete Installation
Document7 pages
Step by Step Install of Grid Control 10g R2 On Linux: 1. Select Complete Installation
smart_aix
No ratings yet
Oam Upgrade 10g To 11gr2 Steps
Document69 pages
Oam Upgrade 10g To 11gr2 Steps
api-219602070
No ratings yet
Tools and Sky130 Installation With WSL2
Document37 pages
Tools and Sky130 Installation With WSL2
Darwin Villamizar
No ratings yet
Installing PHP For Dynamic Web Pages
Document11 pages
Installing PHP For Dynamic Web Pages
Asif Iqbal
No ratings yet
Deploy Agent in Oracle Enterprise Manager 12c: (Root@em12c ) # VI /etc/sudoers
Document11 pages
Deploy Agent in Oracle Enterprise Manager 12c: (Root@em12c ) # VI /etc/sudoers
mohit.oracledba
No ratings yet
Oracle 19C MultiTenant Database 1704253431
Document18 pages
Oracle 19C MultiTenant Database 1704253431
deepak23aug
No ratings yet
LFD259 Kubernetes For Developers Version
Document96 pages
LFD259 Kubernetes For Developers Version
ahhung77
No ratings yet
Xrdocs Io CNBNG Tutorials Inception Server Deployment Guide
Document8 pages
Xrdocs Io CNBNG Tutorials Inception Server Deployment Guide
Konstantinos Dimitriou
No ratings yet
ABC
Document40 pages
ABC
Database 1
No ratings yet
Configuration of VVS Development Environment On Oracle Forms / Reports 11gR2
Document14 pages
Configuration of VVS Development Environment On Oracle Forms / Reports 11gR2
Madallin Oprea
No ratings yet
Creating A VM in Google Cloud
Document7 pages
Creating A VM in Google Cloud
Rhugved Takalkar
No ratings yet
Práctica de Laboratorio 26.1.7
Document10 pages
Práctica de Laboratorio 26.1.7
rojas.saldana.armando.sptm
No ratings yet
Capstone Project Final Steps
Document6 pages
Capstone Project Final Steps
Sweety Sweeti
No ratings yet
Hadoop 2.x Installation Guide
Document25 pages
Hadoop 2.x Installation Guide
ahmed_sft
100% (1)
WebLogic Server 11g
Document16 pages
WebLogic Server 11g
moisendiaye245
No ratings yet
Running Cassandra On Eclipse
Document4 pages
Running Cassandra On Eclipse
Sainareshmut
No ratings yet
BigData Challenge - L2 (Spark)
Document3 pages
BigData Challenge - L2 (Spark)
harshapandey112233
No ratings yet
Deploy Applications On Kubernetes
Document12 pages
Deploy Applications On Kubernetes
Miro
No ratings yet
Creating Databases and Data Placement
Document17 pages
Creating Databases and Data Placement
alfiatuz
No ratings yet
Custom MK-SS808 Image
Document4 pages
Custom MK-SS808 Image
gejib
No ratings yet
Chapter02-Accessing The Command Line
Document4 pages
Chapter02-Accessing The Command Line
Shahabuddin Mohammed Ahmed
No ratings yet
Log
Document97 pages
Log
Morteza Rafael
No ratings yet
TCP Keepalive HOWTO: Fabio Busatto
Document16 pages
TCP Keepalive HOWTO: Fabio Busatto
Ali N Chouman
No ratings yet
Q) Explain Process Control Block. Draw The Block Diagram: of Process Transition States
Document23 pages
Q) Explain Process Control Block. Draw The Block Diagram: of Process Transition States
ganeshshimpi125
No ratings yet
Bugreport
Document13 pages
Bugreport
Daniela
No ratings yet
FAQ OSY Updated
Document5 pages
FAQ OSY Updated
Satejsingh Shiledar
No ratings yet
Mapi 32
Document4 pages
Mapi 32
FLORES TORRES KATHERINE MARIA
No ratings yet
TSV Tnew Page Alloc Failed Analysis
Document5 pages
TSV Tnew Page Alloc Failed Analysis
contactaps
No ratings yet
How To Reset Administrator Password Offline by Using Hiren Boot CD
Document5 pages
How To Reset Administrator Password Offline by Using Hiren Boot CD
Umno Putera
No ratings yet
Anr 7.0.1 (70016001) 20210122 190132
Document8 pages
Anr 7.0.1 (70016001) 20210122 190132
Rodrigo Perlaza
No ratings yet
x86 Stderr
Document35 pages
x86 Stderr
Dar
No ratings yet
Short Guide To Install Oracle 10 On Linux
Document14 pages
Short Guide To Install Oracle 10 On Linux
sudhir_kumar009351
No ratings yet
Chapter 5 Exercises 51 Answer 52 Answer 53 Lottery Scheduling
Document7 pages
Chapter 5 Exercises 51 Answer 52 Answer 53 Lottery Scheduling
Quang Nguyễn Minh
No ratings yet
Citra Log
Document4 pages
Citra Log
Rachman Agung
No ratings yet
Orbtrc 04122016 0923 59
Document1 page
Orbtrc 04122016 0923 59
Nico Baihaqi
No ratings yet
Key Code For MAC
Document1 page
Key Code For MAC
Cyber
No ratings yet
Changes CoDeSys SP RTE
Document41 pages
Changes CoDeSys SP RTE
alinup
No ratings yet
Case Study On Windows PDF
Document15 pages
Case Study On Windows PDF
JPC RTK
100% (1)
U S D C L: SB Torage Evice Ontrol IN Inux
Document5 pages
U S D C L: SB Torage Evice Ontrol IN Inux
Christian Aguas Núñez
No ratings yet
Dump Reading: - Prasanna Rathi
Document23 pages
Dump Reading: - Prasanna Rathi
Geethapriya Gowrishankar
No ratings yet
Virtualization in Cloud Computing Unit IV
Document11 pages
Virtualization in Cloud Computing Unit IV
arpitsharmaafs3
No ratings yet
LT Mahout Exercises
Document4 pages
LT Mahout Exercises
Mypost
No ratings yet
Steps For Applying Digital Signature: Click Here To Download Firefox 48.0.2 Click Here To Download Jre 1.8 Update 121
Document12 pages
Steps For Applying Digital Signature: Click Here To Download Firefox 48.0.2 Click Here To Download Jre 1.8 Update 121
Suresh Kumar
No ratings yet
Module 1 Unix-Ise
Document23 pages
Module 1 Unix-Ise
Atul Jha
No ratings yet
Log
Document38 pages
Log
Ekis Putra
No ratings yet
An Improved Round Robin CPU Scheduling Algorithm B
Document4 pages
An Improved Round Robin CPU Scheduling Algorithm B
Abduallah Mustafa
No ratings yet
Understanding Real-World Concurrency Bugs in Go: Tengfei Tu Xiaoyu Liu
Document14 pages
Understanding Real-World Concurrency Bugs in Go: Tengfei Tu Xiaoyu Liu
Akshay
No ratings yet

2 DE +Installing+Apache+Spark+on+CDH+EC2

Uploaded by

Copyright:

Available Formats

You might also like

2 DE +Installing+Apache+Spark+on+CDH+EC2

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

2 DE +Installing+Apache+Spark+on+CDH+EC2

Uploaded by

Copyright:

Available Formats

Installing Apache Spark 2 on AWS EC2

Prerequisite: JDK 1.8 on your EC2 instance.

1. Download the Spark 2 CSD file from below link.

2. move this spark jar to /opt/cloudera/csd/

3. Go to the CSD jar location

chown cloudera-scm:cloudera-scm SPARK2_ON_YARN-2.3.0.cloudera2.jar

Verify using the following command.

5. Now change the permissions using the chmod command.

chmod 644 SPARK2_ON_YARN-2.3.0.cloudera2.jar

systemctl restart cloudera-scm-server

● Click on the ​Parcels ​tab in right side.corner.

● Now click on ​Distribute​.

● Now click on ​Activate ​and then click on ​OK​.

8. Now Scroll up and go back to the Cloudera Manager Home

● Click on ​Restart Cloudera Management Services

● It takes some time, Once done click on ​Finish​.

Or you may get the screen something like below.

● Then click on ​Restart Slale Services

● Once all steps are complete, click on ​Finish​.

● Click on the ​Issue ​ICON then ​suppress​.

11. Create the Spark Lineage directory

Now, return to ec2-user by running the command: ​exit

13. To verify the Spark version, run the following command

It should show Version 2.3.0 as shown in the image below.

2. Search​ ”yarn.scheduler.maximum-allocation-mb” ​and set it to​ 8 GB > ​Click​ Save Changes

i. Go to the EC2 dashboard and select your instance.

<your EC2 public ip>:18089

You might also like

● Click on the Parcels tab in right side.corner.

● Now click on Distribute.

● Now click on Activate and then click on OK.

● Click on Restart Cloudera Management Services

● It takes some time, Once done click on Finish.

● Then click on Restart Slale Services

● Once all steps are complete, click on Finish.

● Click on the Issue ICON then suppress.

Now, return to ec2-user by running the command: exit

2. Search ”yarn.scheduler.maximum-allocation-mb” and set it to 8 GB > Click Save Changes