Welcome to Scribd!

Programming With Mahout: Target and Introduction

Uploaded by

0% found this document useful (0 votes)

59 views4 pages

This document provides instructions for students to write a simple example of K-Means clustering using the Mahout machine learning library. It outlines how to set up the environment by installing Maven and checking out the Mahout source code from subversion. It then describes how to import the Mahout project into Eclipse and find built-in examples to demonstrate clustering random sample data with K-Means.

Original Description:

Original Title

Cosc3500 Prac

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

59 views4 pages

Programming With Mahout: Target and Introduction

Uploaded by

saravanankongu

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 4

Search inside document

University of Queensland, School of Information Technology and Electrical Engineering

Programming with Mahout

Outline: 1, Target and introduction 2, How to install Mahout under Linux 3, Using mahout to show KMeans display 3, Reference

Target and Introduction

For students

Write a simple example of KMeans using Mahout Review the result

Write a simple example of KMeans using Mahout

1, Set up the environment

For how to install mahout, please read the previous tutorial. In this tutorial, you should have Hadoop up-and-running and have eclipse installed.

1.1 , Install Maven Step1. Update source and package

sudo apt-get upgrade sudo apt-get update

Step2. Search the Maven package in Ubuntu with apt command

sudo apt-cache search maven

Step3. Install maven

sudo apt-get install maven2

Step4. Check installation

mvn version

Step5. Set Maven path to eclipse workspace

University of Queensland, School of Information Technology and Electrical Engineering

mvn -Declipse.workspace=<path-to-eclipse-workspace> eclipse:add-maven-repo

1.2. Install SVN

sudo apt-get install subversion

1.2.1 Check out from subversion: (by default the file will be stored in the home folder) svn co http://svn.apache.org/repos/asf/mahout/trunk 1.2.2 change the checked out project folder name to MAHOUT (or any name you like)
mv trunk MAHOUT

1.2.3 Go to the MAHOUT directory and clean install

mvn clean install -DskipTests=true (This will take a while)

1.3. Setting up in Eclipse

1.3.1 Install eclipse2maven Eclipse: Help->Install New Software->Add http://m2eclipse.sonatype.org/sites/m2e 1.3.2 Import the maven project (That is the MAHOUT folder you just checked out) File->Import->Maven->Existing Maven Projects->Next Select the root Directory as the MAHOUT directory, see the figure 1:

Figure 1

1.4, Mahout Display Mahout allows us to do iterative MapReduce job during the processing. It is especially useful when dealing with Data Mining problems, e.g. KMeans. Here we just use some built-in examples to show how does Mahout display clusters with random sample data.
2

University of Queensland, School of Information Technology and Electrical Engineering

In this tutorial, we will find the example under mahout-examples:

Now we have the mahout project in our Eclipse workspace, normally, we have the latest version of the mahout. It is quite convenient for us to study, develop mahout application. Here, we have the source code, you can download at our website. File Name ClustersFilter.java Display.java Graphic.java ReadData.java Description This java file implements the PathFilter Class of Mahout 0.6 Show the results Initialize all Disaplay.java Read CSV file the methods needed in

To write your own code, you should Add all mahout 0.6 libraries. You can find them under the folder

University of Queensland, School of Information Technology and Electrical Engineering

Reference: File Description ClustersCanopy.java: File Name DisplayCanopy.java DisplayDirichlet.java DisplayFuzzyKMeans.java DisplayKMeans.java DisplayMeansShift.java DisplaySpectralKMeans.java Description https://cwiki.apache.org/confluence/display/MAHOUT/Canopy+ Clustering https://cwiki.apache.org/confluence/display/MAHOUT/Dirichlet +Process+Clustering https://cwiki.apache.org/confluence/display/MAHOUT/Fuzzy+KMeans https://cwiki.apache.org/confluence/display/MAHOUT/K-Means +Clustering https://cwiki.apache.org/confluence/display/MAHOUT/Mean+S hift+Clustering https://cwiki.apache.org/confluence/display/MAHOUT/Spectral +Clustering

Getting started with Spring Framework: A Hands-on Guide to Begin Developing Applications Using Spring Framework
From Everand
Getting started with Spring Framework: A Hands-on Guide to Begin Developing Applications Using Spring Framework
Ashish Sarin
Rating: 4.5 out of 5 stars
4.5/5 (2)
Project Assignment
Document8 pages
Project Assignment
FA
60% (5)
Learn Kubernetes - Container orchestration using Docker: Learn Collection
From Everand
Learn Kubernetes - Container orchestration using Docker: Learn Collection
Arnaud Weil
Rating: 4 out of 5 stars
4/5 (1)
Grab Assignment Semester 6
Document12 pages
Grab Assignment Semester 6
Lia Fotocopy
50% (2)
Maven Tutorial PDF
Document11 pages
Maven Tutorial PDF
Anonymous fkxLbu
No ratings yet
Maven
Document5 pages
Maven
jagadeesh
No ratings yet
Maven Quick Guide
Document63 pages
Maven Quick Guide
manoj kumar mahadevaiah
No ratings yet
Java Maven Eclipse JSF Tutorial
Document60 pages
Java Maven Eclipse JSF Tutorial
bdeepak23
No ratings yet
Maven in 5 Minutes
Document4 pages
Maven in 5 Minutes
Benny Susanto
No ratings yet
Maven - Maven in 5 Minutes
Document5 pages
Maven - Maven in 5 Minutes
Anirban Bhattacharjee
No ratings yet
Maven Tutorial: Understanding The Problem Without Maven
Document29 pages
Maven Tutorial: Understanding The Problem Without Maven
rina mahure
No ratings yet
Installation Instructions
Document8 pages
Installation Instructions
tanny1234
No ratings yet
Day 4
Document15 pages
Day 4
junaid
No ratings yet
Maven
Document4 pages
Maven
CHANDINI DASARI
No ratings yet
Maven Essentials - Sample Chapter
Document19 pages
Maven Essentials - Sample Chapter
Packt Publishing
No ratings yet
Maven by Polarapu Prasad
Document7 pages
Maven by Polarapu Prasad
Ravi Ghantasala
No ratings yet
Mobicents Installation
Document6 pages
Mobicents Installation
Reynaldi Dwi
No ratings yet
Maven
Document19 pages
Maven
Claudia Afinek
No ratings yet
Maven
Document7 pages
Maven
blossomjuhi
No ratings yet
BRM Lab
Document108 pages
BRM Lab
sbhatlabe21
No ratings yet
Maven Tutorial
Document7 pages
Maven Tutorial
blossomjuhi
No ratings yet
Castalia - Installation
Document11 pages
Castalia - Installation
khadijehnoori
No ratings yet
How To Install Maven On Ubuntu
Document5 pages
How To Install Maven On Ubuntu
Nirajan Shrestha
No ratings yet
Lab-4 (Jenkins and Maven Configuration)
Document13 pages
Lab-4 (Jenkins and Maven Configuration)
व्यास ढुंगाना
No ratings yet
Apache Maven Is A Software Project Management and Comprehension Tool
Document3 pages
Apache Maven Is A Software Project Management and Comprehension Tool
Koushik reddy.r
No ratings yet
Labtainers Student Guide: Fully Provisioned Cybersecurity Labs
Document8 pages
Labtainers Student Guide: Fully Provisioned Cybersecurity Labs
ceni silva
No ratings yet
Step 1 With SAP Cloud SDK - Set Up
Document3 pages
Step 1 With SAP Cloud SDK - Set Up
Anonymous Yw2XhfXv
No ratings yet
Maven 829
Document8 pages
Maven 829
harshita.nitkkr
No ratings yet
CIS Lab Workbook
Document72 pages
CIS Lab Workbook
Satish Peetha
No ratings yet
Labtainer Student
Document12 pages
Labtainer Student
tuanbinkk
No ratings yet
BDA Lab 8 Manual
Document7 pages
BDA Lab 8 Manual
Mydah Nasir
No ratings yet
Maven: Local Repository Then Central Repository Then Remote Repository
Document10 pages
Maven: Local Repository Then Central Repository Then Remote Repository
urvashi tomar
No ratings yet
Day 3
Document10 pages
Day 3
junaid
No ratings yet
Lab Files For Opnet Modeler
Document101 pages
Lab Files For Opnet Modeler
w
100% (1)
Unit 6 Devops
Document50 pages
Unit 6 Devops
Aryan Rathore
No ratings yet
Maven: Mr. Ashok
Document20 pages
Maven: Mr. Ashok
santhosh
No ratings yet
DPA Exp1.3
Document4 pages
DPA Exp1.3
uic.19bca1345
No ratings yet
Maven Notes
Document11 pages
Maven Notes
pravin kumbhar
No ratings yet
Setting Up Your Programming Assignment Environment
Document13 pages
Setting Up Your Programming Assignment Environment
Agus Lesmana
No ratings yet
Software Installation Guide
Document13 pages
Software Installation Guide
Ivan Fontalvo
No ratings yet
Build and Deploy Your Own Learning Management Systems Using Moodle 2.x On CentOS-6 Server v1.0
Document11 pages
Build and Deploy Your Own Learning Management Systems Using Moodle 2.x On CentOS-6 Server v1.0
Kefa Rabah
No ratings yet
Java Mission Control 6.0 Tutorial: Consulting Member of Technical Staff
Document83 pages
Java Mission Control 6.0 Tutorial: Consulting Member of Technical Staff
Raúl Tinoco
No ratings yet
Maven
Document28 pages
Maven
niraj chavhan
No ratings yet
Aven Heatsheet: Basic Operations With Maven You Need During A Work Day
Document7 pages
Aven Heatsheet: Basic Operations With Maven You Need During A Work Day
harish
No ratings yet
Pentaho 3 7 0 Windows and MySQL
Document22 pages
Pentaho 3 7 0 Windows and MySQL
Elias Luna
No ratings yet
InstallationGuide-JavaEclipseAndMaven v2 PDF
Document27 pages
InstallationGuide-JavaEclipseAndMaven v2 PDF
sajjadalimail
No ratings yet
Comp322 s14 Lab9 PDF
Document3 pages
Comp322 s14 Lab9 PDF
Phanidhar S Gadiyaram
No ratings yet
Netbeans For Java: How To Install and Get Started With Java Programming (On Windows, Mac Os and Ubuntu)
Document14 pages
Netbeans For Java: How To Install and Get Started With Java Programming (On Windows, Mac Os and Ubuntu)
fakkelogin
No ratings yet
How To Deploy Java 8 Using SCCM 2012
Document11 pages
How To Deploy Java 8 Using SCCM 2012
Dzoni_m
No ratings yet
Maven
Document17 pages
Maven
Satish Racherla
No ratings yet
Maven
Document16 pages
Maven
M.I.M KRUPHA
No ratings yet
BS 301 0S Lab14
Document12 pages
BS 301 0S Lab14
Muhammad Faraz
No ratings yet
IGT Open Manual
Document9 pages
IGT Open Manual
cristina
No ratings yet
Installing Moodle As A Debian Package
Document8 pages
Installing Moodle As A Debian Package
elasu85
No ratings yet
MapReduce Hands On
Document28 pages
MapReduce Hands On
varun3dec1
No ratings yet
STQA Mini Project No.1: 6.1 Title
Document22 pages
STQA Mini Project No.1: 6.1 Title
Dhiraj Patil
No ratings yet
Lab01 Maven
Document17 pages
Lab01 Maven
Adam Amrid
No ratings yet
Dev Ops Tutorial
Document20 pages
Dev Ops Tutorial
AbstractSoft
No ratings yet
Java Package Mastery: 100 Knock Series - Master Java in One Hour, 2024 Edition
From Everand
Java Package Mastery: 100 Knock Series - Master Java in One Hour, 2024 Edition
Kanto
No ratings yet
Schaum's Easy Outline of Programming with Java
From Everand
Schaum's Easy Outline of Programming with Java
John R. Hubbard
Rating: 3.5 out of 5 stars
3.5/5 (3)
Introducing Maven: A Build Tool for Today's Java Developers
From Everand
Introducing Maven: A Build Tool for Today's Java Developers
Balaji Varanasi
No ratings yet
Accelerating MATLAB with GPU Computing: A Primer with Examples
From Everand
Accelerating MATLAB with GPU Computing: A Primer with Examples
Jung W. Suh
Rating: 3 out of 5 stars
3/5 (1)
Please Note That in The Final Exam, No Hints Will Be Provided
Document40 pages
Please Note That in The Final Exam, No Hints Will Be Provided
Wesam AL Mofareh
No ratings yet
Chapter 3: Adding A New Table and A Detail Level: Objectives
Document60 pages
Chapter 3: Adding A New Table and A Detail Level: Objectives
enrikexk1
No ratings yet
Safenet Protectserver/Protecttoolkit 5.5: Customer Release Notes
Document12 pages
Safenet Protectserver/Protecttoolkit 5.5: Customer Release Notes
Lee Chee Soon
No ratings yet
Arduino Info LCD Blue I2C
Document17 pages
Arduino Info LCD Blue I2C
Rhedan Polo
No ratings yet
Using DeviceLogix With CompactBlock
Document3 pages
Using DeviceLogix With CompactBlock
igrjaa
No ratings yet
Telenor
Document24 pages
Telenor
Mudassar Rasool
No ratings yet
Presentation On Photoshop and It'S Working Environment: Presented by
Document20 pages
Presentation On Photoshop and It'S Working Environment: Presented by
sagar salal
No ratings yet
Config600 Lite/Lite+ Software User Manual: Form Number A6170
Document337 pages
Config600 Lite/Lite+ Software User Manual: Form Number A6170
teo2005
No ratings yet
Role of Artificial Intelligence in Business Transformation
Document10 pages
Role of Artificial Intelligence in Business Transformation
Juan Pablo Montenegro
No ratings yet
4 Models 4up
Document4 pages
4 Models 4up
CHANDRA BHUSHAN
No ratings yet
How To Run A Simple HTML - CSS - Javascript Application On Heroku PDF
Document3 pages
How To Run A Simple HTML - CSS - Javascript Application On Heroku PDF
Luis Rafael Salgado Perdomo
No ratings yet
Plan de Negocios: Presentado Por
Document43 pages
Plan de Negocios: Presentado Por
Omar PinGlo NuñEz
No ratings yet
Extensions For The Visual Studio Family of Products: Featured
Document7 pages
Extensions For The Visual Studio Family of Products: Featured
Aléat Oire
No ratings yet
Latex Suite User Manual
Document40 pages
Latex Suite User Manual
lfnshrnndz
No ratings yet
To Count The Person in The Classroom With Identity by Using IoT Technique
Document8 pages
To Count The Person in The Classroom With Identity by Using IoT Technique
IJRASETPublications
No ratings yet
Stellarisware Release Notes: Sw-Rln-6852
Document160 pages
Stellarisware Release Notes: Sw-Rln-6852
Akio Takeuchi
No ratings yet
Manage Qlik Sense Sites
Document427 pages
Manage Qlik Sense Sites
yogeshwari
No ratings yet
Lazarus Brookframework Ref
Document170 pages
Lazarus Brookframework Ref
drmicroso
No ratings yet
Vmware Cloud Foundation 310 Vrealize Suite 2019 Deployment
Document234 pages
Vmware Cloud Foundation 310 Vrealize Suite 2019 Deployment
Sudhakar Subburam
No ratings yet
S.O.L.I.D First Five Object-Oriented Design OOD
Document2 pages
S.O.L.I.D First Five Object-Oriented Design OOD
nac
No ratings yet
Terence Tao's Answer To The Erdős Discrepancy Problem - Quanta Magazine
Document5 pages
Terence Tao's Answer To The Erdős Discrepancy Problem - Quanta Magazine
Elie Kawerk
No ratings yet
3.97inch 16BIT Module MRB3973 User Manual: Lcdwiki CR2019-MI4055
Document27 pages
3.97inch 16BIT Module MRB3973 User Manual: Lcdwiki CR2019-MI4055
Rafael Lucasionist
No ratings yet
UD21964B Baseline User-Manual-of-DVR V4.30.210 20201110
Document101 pages
UD21964B Baseline User-Manual-of-DVR V4.30.210 20201110
riyat saputra
No ratings yet
Código VBA para Combinar Hojas de Diferentes Libros
Document5 pages
Código VBA para Combinar Hojas de Diferentes Libros
jrjimenez14084
No ratings yet
Curso de Batch Script
Document45 pages
Curso de Batch Script
Rogério Santos
No ratings yet
TOEFL - Error Recognition Test 18
Document2 pages
TOEFL - Error Recognition Test 18
ruswandi_123
No ratings yet
Proposal SMS Gateway - SendQuick
Document12 pages
Proposal SMS Gateway - SendQuick
Dian Ayoe
No ratings yet
L35+36 Lab Cat
Document5 pages
L35+36 Lab Cat
Tharo Bhai Jogindar
No ratings yet