Welcome to Scribd!

ML 1 Project

Uploaded by

0% found this document useful (0 votes)

25 views2 pages

This document provides instructions for Lab 5, which involves implementing and analyzing multiple regression methods on a machine learning benchmark dataset. Students must use nearest neighbor regression, linear regression, regression forest, and Gaussian process regression models. Results should be presented in a 3000-word PDF report including code. Models will be evaluated on accuracy, complexity, and sensitivity to hyperparameters. A toy problem must also be designed and used to validate the implementations. The SARCOS robotics dataset on Moodle will be used, with the task being to predict motor torque from other variables.

Original Description:

ML 1 Project

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

25 views2 pages

ML 1 Project

Uploaded by

Chaitanya Sanga

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

Lab 5

(Deadline 18:00 10/01/2020)

Task
You will be provided with a machine learning benchmark dataset (details below). The task
focuses on the implementation and critical analysis of multiple regression methods: You should
implement, evaluate, and analyse each of the following algorithms:
● Nearest neighbour
● Linear regression
● Regression forest
● Gaussian process

The results should be presented as a report with a 3000 word limit in a pdf document. Please
also include your code as well (Jupyter workbook and/or normal .py files).

Mark scheme
This project is worth 60% of your marks for the unit:
● 10 marks for the method,
● 20 marks for designing and validating on a toy problem (see below),
● 20 marks for the experiments,
● 10 marks for the analysis,
for a total of 60 marks.

If a piece of work is submitted after the submission date (and no extension has been explicitly
granted by the Director of Studies), the maximum possible mark will be 40% of the full mark. If
work is submitted more than five working days after the submission date, the student will receive
zero marks.

Data set
Included on Moodle is the SARCOS data set, a regression problem where the task is to predict
the torque of one motor of a robotic arm given physical joint details. Specifically, position,
velocity and acceleration for 7 degrees of freedom.

The data set is provided as one csv file - you are responsible for splitting it for train/test and
hyperparameter learning. Each row is an exemplar, and each column a feature. Your task is to
predict the last column (#22) given the first 21 columns. Be aware that you will not want to use
all of the exemplars when training a Gaussian process as it will get too slow.
Suggested document structure
Please structure your report sensibly. Below is a suggested structure which we would strongly
suggest you follow.

Section 1: Introduction
Explain the problem.

Section 2: Methods
Explain regression forests and Gaussian process regression. Include a discussion of
why (or why not) you would use these algorithms. Note that you do not need to discuss
nearest neighbours or linear regression.

Section 3: Validation on a toy problem

Design a toy problem and provide a discussion on validating your implementations on
this problem. Please explain the rationale of your toy problem design and how it does, or
does not, demonstrate that you have implemented your algorithm correctly.

A toy problem is used to debug, test and validate a machine learning algorithm; it will
usually have the following properties:
● Low dimensional (2 or 3 dimensions), so it can be visualised.
● Synthetic, so the correct answer is known and there is no noise.
● Designed so that the behaviour of the algorithms being tested is predictable, so
you will know if something is wrong.
● Very simple, e.g. you could type it into a calculator in a few seconds.

Note that a single toy problem can cover all four algorithms.

Section 4: Experiments and analysis

Please report your experimental results. This could include
1) Your hyperparameter selection strategies.
2) Performance comparison of different algorithms in terms of accuracy,
computational complexity, etc.
3) Analysis: How the results are influenced by different hyper-parameter choices?
Why does one algorithm perform better than the others on the given dataset?

Source
The data set was originally obtained from http://gaussianprocess.org/gpml/data/

Métodos numéricos aplicados a Ingeniería: Casos de estudio usando MATLAB
From Everand
Métodos numéricos aplicados a Ingeniería: Casos de estudio usando MATLAB
Héctor Jorquera González
Rating: 5 out of 5 stars
5/5 (1)
Project3 1
Document4 pages
Project3 1
Jacob Daugherty
0% (2)
Lab 2
Document4 pages
Lab 2
geoaamer
100% (1)
Testbank Chapter 1
Document10 pages
Testbank Chapter 1
Jia Lin
No ratings yet
1-1 Basics
Document32 pages
1-1 Basics
Rohan Srivastava
No ratings yet
(Intro To DSA)
Document29 pages
(Intro To DSA)
tsidra545
No ratings yet
Revision Sheet 03
Document2 pages
Revision Sheet 03
Mutahhir Khan
No ratings yet
Instructions and Topics For Optimization Project (2021)
Document5 pages
Instructions and Topics For Optimization Project (2021)
Jawad Chowdhury
No ratings yet
Algorithm Design and Analyisis Notes
Document37 pages
Algorithm Design and Analyisis Notes
nazamsingla5
No ratings yet
Unit 2: Algorithm (2 Hrs and Contains 3 Marks)
Document6 pages
Unit 2: Algorithm (2 Hrs and Contains 3 Marks)
Sarwesh Maharzan
No ratings yet
AIM cw1 BPP 2024 04 08 1
Document6 pages
AIM cw1 BPP 2024 04 08 1
scyzw14
No ratings yet
Question 12
Document2 pages
Question 12
ejaknon
No ratings yet
1 - Design and Analysis of Algorithms by Karamagi, Robert
Document346 pages
1 - Design and Analysis of Algorithms by Karamagi, Robert
K Tiến
No ratings yet
CSC 603 - Final Project
Document3 pages
CSC 603 - Final Project
bme.engineer.issa.mansour
No ratings yet
Design Analysis and Algorithms: (Unit I Question Bank)
Document7 pages
Design Analysis and Algorithms: (Unit I Question Bank)
abcjohn
No ratings yet
7641 Assignment 1
Document4 pages
7641 Assignment 1
Muhammad Aleem
No ratings yet
Miniproject
Document2 pages
Miniproject
tul Sanwal
No ratings yet
An Algorithm - Characteristics and Types - Lecture-1
Document8 pages
An Algorithm - Characteristics and Types - Lecture-1
tantos557
No ratings yet
ESO207 Assignment 3 EXPRESSION EVALUATION
Document3 pages
ESO207 Assignment 3 EXPRESSION EVALUATION
ssppdd ssdd
No ratings yet
Project
Document1 page
Project
XYZ
No ratings yet
Algorithm Analysis All Chapters
Document150 pages
Algorithm Analysis All Chapters
lidib47128
No ratings yet
Daa Uniti
Document41 pages
Daa Uniti
RAJESH
No ratings yet
DAA Lab Manual
Document52 pages
DAA Lab Manual
Vikas Kumar
No ratings yet
Roshan Ass
Document10 pages
Roshan Ass
Badavath Jeethendra
No ratings yet
Algorithm
Document34 pages
Algorithm
Habadu Habadi
No ratings yet
Design and Analysis of Algorithms: Notes On
Document28 pages
Design and Analysis of Algorithms: Notes On
PrasannaKumar M J
No ratings yet
Milestone
Document7 pages
Milestone
kirankashif1118
No ratings yet
Chapter 1.1 - Analysis of Algorithms-1
Document5 pages
Chapter 1.1 - Analysis of Algorithms-1
fasikawudia
No ratings yet
Design and Analysis of Algorithms Manual
Document55 pages
Design and Analysis of Algorithms Manual
Upinder Kaur
No ratings yet
CSE 3211 Chapter 01 Basics
Document73 pages
CSE 3211 Chapter 01 Basics
Bonsa
No ratings yet
Unit I
Document9 pages
Unit I
vijayendrar219
No ratings yet
UNIT 1ojfa
Document24 pages
UNIT 1ojfa
2023bardai
No ratings yet
Organization of The Examination: Theoretical Part - Duration: 1h
Document4 pages
Organization of The Examination: Theoretical Part - Duration: 1h
Saramisd
No ratings yet
COSC 4P76 Machine Learning: Project Report Format: A. The Target Function
Document3 pages
COSC 4P76 Machine Learning: Project Report Format: A. The Target Function
ru
No ratings yet
Week 1 Introduction To Problem Solving Techniques
Document34 pages
Week 1 Introduction To Problem Solving Techniques
James Monchoi
No ratings yet
Module 1 DAA
Document46 pages
Module 1 DAA
TUSHAR GUPTA
No ratings yet
Daa Master Labmanual
Document52 pages
Daa Master Labmanual
21p61a6211
No ratings yet
Master Thesis Algorithm
Document6 pages
Master Thesis Algorithm
jennifergutierrezmilwaukee
100% (2)
DS Notes
Document170 pages
DS Notes
June Mohan
No ratings yet
Prog
Document21 pages
Prog
Mohamed Atheeb
No ratings yet
Algorithm Design and Analysis LAB ETCS-351
Document69 pages
Algorithm Design and Analysis LAB ETCS-351
kanta devi
No ratings yet
Assignment - Machine Learning
Document3 pages
Assignment - Machine Learning
Le G
No ratings yet
ML Performance Improvement Cheatsheet
Document11 pages
ML Performance Improvement Cheatsheet
rahulsukhija
100% (1)
Lecture 2 Data Structures Introduction
Document15 pages
Lecture 2 Data Structures Introduction
ssc.khuzaima.2004
No ratings yet
2021 S2 FOP Assignment
Document4 pages
2021 S2 FOP Assignment
Dileeka Diyabalanage
No ratings yet
List of Projects Cs
Document5 pages
List of Projects Cs
Kanishak Kumar
No ratings yet
CS01
Document3 pages
CS01
Eduardo Campi
No ratings yet
AIML Manual
Document76 pages
AIML Manual
Hardik Kannad
No ratings yet
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
Document38 pages
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
Ravi Kotharu
No ratings yet
Which ML Algo Should I Use SAS
Document20 pages
Which ML Algo Should I Use SAS
TAMAL ACHARYA
No ratings yet
Data Science and Machine Learning Essentials: Lab 4A - Working With Regression Models
Document24 pages
Data Science and Machine Learning Essentials: Lab 4A - Working With Regression Models
aussatris
No ratings yet
Machine Learning Multiple Choice Questions - Free Practice Test
Document12 pages
Machine Learning Multiple Choice Questions - Free Practice Test
arafaliwijaya
100% (1)
EE236 Course Project Total: 10% of Grade
Document1 page
EE236 Course Project Total: 10% of Grade
niloymithun
No ratings yet
2020 6 17 Exam Pa Project Statement PDF
Document6 pages
2020 6 17 Exam Pa Project Statement PDF
Hông Hoa
No ratings yet
Week 2-3 - Analysis of Algorithms
Document7 pages
Week 2-3 - Analysis of Algorithms
Xuân Dương Vương
No ratings yet
Module 1 CS104 Data Structures and Algorithms
Document17 pages
Module 1 CS104 Data Structures and Algorithms
Kobe Bry
No ratings yet
EEGLAB 2010 - BCILAB - Instructions and Practicum
Document68 pages
EEGLAB 2010 - BCILAB - Instructions and Practicum
Muhammad Faiz Qureshi
No ratings yet
Assignment 3
Document6 pages
Assignment 3
Aditya Priyadarshi
No ratings yet
DESIGN ALGORITHMS TO SOLVE COMMON PROBLEMS: Mastering Algorithm Design for Practical Solutions (2024 Guide)
From Everand
DESIGN ALGORITHMS TO SOLVE COMMON PROBLEMS: Mastering Algorithm Design for Practical Solutions (2024 Guide)
ARCHER PAUL
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Agni Karyam1
Document1 page
Agni Karyam1
Chaitanya Sanga
No ratings yet
Agni Karyam2
Document1 page
Agni Karyam2
Chaitanya Sanga
No ratings yet
TechStrongCon - Agenda - 04-Jun-2020
Document31 pages
TechStrongCon - Agenda - 04-Jun-2020
Chaitanya Sanga
No ratings yet
Building Your First Amazon Virtual Private Cloud
Document17 pages
Building Your First Amazon Virtual Private Cloud
Chaitanya Sanga
No ratings yet
Working With Amazon Elastic Block Store
Document12 pages
Working With Amazon Elastic Block Store
Chaitanya Sanga
No ratings yet
Working With Elastic Load Balancing
Document13 pages
Working With Elastic Load Balancing
Chaitanya Sanga
No ratings yet
Introduction To Amazon EC2
Document15 pages
Introduction To Amazon EC2
Chaitanya Sanga
No ratings yet
Introduction To AWS Identity and Access Management
Document13 pages
Introduction To AWS Identity and Access Management
Chaitanya Sanga
No ratings yet
Ansible Automation Choral
Document9 pages
Ansible Automation Choral
Chaitanya Sanga
No ratings yet
TC Sit Wiz Quiz Book 2015
Document56 pages
TC Sit Wiz Quiz Book 2015
Chaitanya Sanga
No ratings yet
Algorithmic Problem Solving With Python
Document360 pages
Algorithmic Problem Solving With Python
Adrian Perez Dominguez
No ratings yet
Computer Science
Document100 pages
Computer Science
Surya Naraya Surya
No ratings yet
Mazvita Chadzimura Form 4 Paper 2 Test 1
Document5 pages
Mazvita Chadzimura Form 4 Paper 2 Test 1
Jason
No ratings yet
Full Notes
Document114 pages
Full Notes
Chethan KSwamy
No ratings yet
9618 Allp2 21 22
Document166 pages
9618 Allp2 21 22
Whitoan 99
No ratings yet
Eye Tracking Proposal
Document4 pages
Eye Tracking Proposal
Marcela Reyna
No ratings yet
Scheduling System For Bsit Thesis Final
Document32 pages
Scheduling System For Bsit Thesis Final
ALYSSA VICTORIO
No ratings yet
Algorithm Design and Analysis: Lecturer: M. Sohaib Ajmal
Document36 pages
Algorithm Design and Analysis: Lecturer: M. Sohaib Ajmal
Sohaib
No ratings yet
Concepts of Model Verification and Validation
Document41 pages
Concepts of Model Verification and Validation
Ma'ruf Nurwantara
No ratings yet
A Novel Statistical Cost Model and An Algorithm For Efficient Application Offloading To Clouds
Document14 pages
A Novel Statistical Cost Model and An Algorithm For Efficient Application Offloading To Clouds
Technos_Inc
No ratings yet
CSC 210 (Exam) 2019-2020 BATCH (B)
Document3 pages
CSC 210 (Exam) 2019-2020 BATCH (B)
seyi
No ratings yet
Artificial
Document9 pages
Artificial
SatishGajawada
No ratings yet
Comparing Architecture and The Partition Table Using Fanemesel
Document6 pages
Comparing Architecture and The Partition Table Using Fanemesel
Danisy Eisyraf
No ratings yet
Introduction To Computing and Programming in Python Global Edition Mark J Guzdial Full Chapter
Document67 pages
Introduction To Computing and Programming in Python Global Edition Mark J Guzdial Full Chapter
maurice.camargo284
100% (8)
37.jofin Joy. C
Document16 pages
37.jofin Joy. C
Jofin Joy
No ratings yet
aSCDL PGDIT Programcurriculum
Document4 pages
aSCDL PGDIT Programcurriculum
nj
No ratings yet
BIAN How To Guide Semantic API V7.0 Final V1.0 PDF
Document54 pages
BIAN How To Guide Semantic API V7.0 Final V1.0 PDF
ميلاد نوروزي رهبر
No ratings yet
Introduction To DSA Chapter 1
Document20 pages
Introduction To DSA Chapter 1
Abdiaziz Qabile
100% (3)
6.an AUV Vision System For Target Detection and Precise Positioning
Document8 pages
6.an AUV Vision System For Target Detection and Precise Positioning
Milton Alvarado
No ratings yet
AIHRApathforward
Document28 pages
AIHRApathforward
John Astli
No ratings yet
Quantu M Oracles: Jesse Dhillon, Ben Schmid, & Lin Xu
Document21 pages
Quantu M Oracles: Jesse Dhillon, Ben Schmid, & Lin Xu
Slagalo Igor
No ratings yet
The PIM Model
Document12 pages
The PIM Model
w96ppcqykn
No ratings yet
NCTM Article
Document9 pages
NCTM Article
tfchung83
100% (1)
Flight Control System Architecture Optimization For Fly-By-Wire Airliners
Document8 pages
Flight Control System Architecture Optimization For Fly-By-Wire Airliners
Charlton Eddie
No ratings yet
DAA Course File
Document6 pages
DAA Course File
Jaya Vakapalli
No ratings yet
Long Transmission Lines Fault Location Based On Parameter Identification Using One-Terminal Data
Document4 pages
Long Transmission Lines Fault Location Based On Parameter Identification Using One-Terminal Data
Omar Chayña Velásquez
No ratings yet
Towers of Hanoi and Analysis of Algorithms: The American Mathematical Monthly
Document15 pages
Towers of Hanoi and Analysis of Algorithms: The American Mathematical Monthly
ABRAHAM PHİLİP VANMANNEKES
No ratings yet
Mse PGDM Research
Document25 pages
Mse PGDM Research
Ayyappan P
No ratings yet
What An Algorithm Is: Robin K. Hill
Document26 pages
What An Algorithm Is: Robin K. Hill
Aldair Bs
No ratings yet