Welcome to Scribd!

Intro To Linear Regression

Uploaded by

0% found this document useful (0 votes)

25 views15 pages

This document provides an introduction to linear regression through examples. It discusses how Francis Galton discovered the concept of regression in the 1800s when studying relationships between parents and children's heights. Linear regression aims to draw a line that minimizes the vertical distance between data points. Even with just two data points, regression finds the line closest to both points. The goal with multiple data points is to minimize the total distance between all points and the regression line. R uses formulas of the form (y ~ x) to perform linear regression, allowing additional predictors with '+'. An example in R reinforces these core concepts.

Original Description:

Intro to Linear Regressio

Original Title

Intro to Linear Regression

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

25 views15 pages

Intro To Linear Regression

Uploaded by

HITESH PONNOJU

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 15

Search inside document

Introduction to Linear Regression

Reading Assignment

Chapters 2 & 3 of
Introduction to Statistical Learning
By Gareth James, et al.
History

This all started in the 1800s

with a guy named Francis Machine Math &
Learning
Galton. Galton was studying Statistics
the relationship between DS
Research
parents and their children. InSoftware
particular, he investigated the
relationship between the Domain
heights of fathers and their Knowledge
sons.
History

What he discovered was that a

man's son tended to be Machine Math &
Learning
roughly as tall as his father. Statistics
However Galton's DS

breakthrough was that the Software Research

son's height tended to be

closer to the overall average
height of all people. Domain
Knowledge
Example

Let's take Shaquille O'Neal as an

example. Shaq is really tall:7ft 1in Machine Math &
Learning
(2.2 meters). Statistics
DS
If Shaq has a son, chances are
Research
he'll be pretty tall too. However,Software
Shaq is such an anomaly that
there is also a very good chance Domain
that his son will be not be as tall Knowledge
as Shaq.
Example

Turns out this is the case:

Shaq's son is pretty tall (6 ft 7
in), but not nearly as tall as his
dad.
Galton called this
phenomenon regression, as in
"A father's son's height tends
to regress (or drift towards)
the mean (average) height."
Example

Let's take the simplest

possible example:
calculating a regression
with only 2 data points.
Example

All we're trying to do when we

calculate our regression line is
draw a line that's as close to
every dot as possible.

For classic linear regression, or

"Least Squares Method", you
only measure the closeness in
the "up and down" direction
Example

Now wouldn't it be great if we

could apply this same concept to
a graph with more than just two
data points?

By doing this, we could take

multiple men and their son's
heights and do things like tell a
man how tall we expect his son to
be...before he even has a son!
Example

Our goal with linear regression

is to minimize the vertical
distance between all the data
points and our line.

So in determining the best line,

we are attempting to minimize
the distance between all the
points and their distance to
our line.
Example

There are lots of different

ways to minimize this, (sum of
squared errors, sum of
absolute errors, etc), but all
these methods have a general
goal of minimizing this
distance.
Using R for Linear Regression

Formulas in R take the form (y ~ x). To add more predictor

variables, just use the + sign. i.e.(y ~ x + z).
Using R for Linear Regression
Example with R

Let's go to RStudio and begin to explore an example, then

you’ll have a project to test your understanding!

Solution Manual For Introduction To Statistical Quality Control 7th Ed - Douglas Montgomery
Document20 pages
Solution Manual For Introduction To Statistical Quality Control 7th Ed - Douglas Montgomery
Mohammed Saad
0% (3)
4.CI For Prop, Var, Ratio
Document36 pages
4.CI For Prop, Var, Ratio
Hamza Asif
No ratings yet
IoT Wireless Weather Station Using Arduino 1111
Document53 pages
IoT Wireless Weather Station Using Arduino 1111
HITESH PONNOJU
100% (2)
IoT Wireless Weather Station Using Arduino 1111
Document53 pages
IoT Wireless Weather Station Using Arduino 1111
HITESH PONNOJU
100% (2)
HW 03 Sol
Document9 pages
HW 03 Sol
fdf
No ratings yet
Actuarial Questions
Document73 pages
Actuarial Questions
Angel Guo
100% (1)
Sesi 4-2A Introduction To Linear Regression
Document19 pages
Sesi 4-2A Introduction To Linear Regression
Widy Yo
No ratings yet
Correlation and Regression
Document5 pages
Correlation and Regression
Peter Wyatt
No ratings yet
Prime Number Research Paper
Document7 pages
Prime Number Research Paper
aflbasnka
100% (1)
Ordinal Data
Document6 pages
Ordinal Data
Simon SG
No ratings yet
Bell Curve Thesis Definition
Document7 pages
Bell Curve Thesis Definition
susantullisnorman
100% (2)
Notes On Intro To Data Science Udacity
Document8 pages
Notes On Intro To Data Science Udacity
Hari
No ratings yet
Benford's Law
Document6 pages
Benford's Law
Scrooty
No ratings yet
Abstract Reasoning Tests: Example Questions
Document6 pages
Abstract Reasoning Tests: Example Questions
Angela Lunas Villar
No ratings yet
Bayesian Homework Solution
Document4 pages
Bayesian Homework Solution
afmtdenav
100% (1)
Some Commonly Encountered Functions
Document2 pages
Some Commonly Encountered Functions
Farrukh Abbas
No ratings yet
Data Exploration in R
Document6 pages
Data Exploration in R
Ceekh
No ratings yet
Lab 3
Document8 pages
Lab 3
Work Clothing
No ratings yet
Thesis Time Series
Document7 pages
Thesis Time Series
gj6sr6d7
100% (2)
Data Science and Visualization
Document37 pages
Data Science and Visualization
Aanya
No ratings yet
The APS 163 Modeling Project
Document4 pages
The APS 163 Modeling Project
Vag Useless
No ratings yet
Name Sheeraz Ahmad Class BSCS-B Semeseter 8 Rol No 068 Assignment Numerical Analysis
Document7 pages
Name Sheeraz Ahmad Class BSCS-B Semeseter 8 Rol No 068 Assignment Numerical Analysis
muhmmad sheeraz
No ratings yet
Gcse Maths Statistics Coursework Examples
Document5 pages
Gcse Maths Statistics Coursework Examples
bcr7r579
100% (1)
Data Preparation & Exploration
Document12 pages
Data Preparation & Exploration
IUB VIBES
No ratings yet
New Worlds New Animals From Menagerie To Zoologica
Document2 pages
New Worlds New Animals From Menagerie To Zoologica
Alejandra
No ratings yet
Data Science
Document87 pages
Data Science
tha9s9catal9
No ratings yet
16 Meeting Data+Science+Project
Document47 pages
16 Meeting Data+Science+Project
Patrick Hugo
No ratings yet
Research Paper Graphs
Document7 pages
Research Paper Graphs
afnkjarbwbozrr
100% (1)
CDF Thesis
Document5 pages
CDF Thesis
christinapadillareno
100% (2)
Thesis Sample Xepersian
Document8 pages
Thesis Sample Xepersian
zuhemad0g0n3
100% (1)
Stock Market Analysis
Document34 pages
Stock Market Analysis
aryanchakvarty
No ratings yet
4 Levels of Measurement in Social Science Research
Document3 pages
4 Levels of Measurement in Social Science Research
Dhirend Pal
No ratings yet
Craig Baguley Thesis
Document7 pages
Craig Baguley Thesis
bsr6hbaf
100% (2)
Thesis On Mathematical Modeling
Document6 pages
Thesis On Mathematical Modeling
Andrea Porter
100% (1)
527 Silent Investor Silent Loser
Document2 pages
527 Silent Investor Silent Loser
PradeepKanwar
No ratings yet
VI Sem Machine Learning CS 601 PDF
Document28 pages
VI Sem Machine Learning CS 601 PDF
pankaj gupta
No ratings yet
Random Forest Thesis
Document6 pages
Random Forest Thesis
fj8e4mc7
100% (2)
VI Sem Machine Learning CS 601
Document28 pages
VI Sem Machine Learning CS 601
pankaj gupta
No ratings yet
Basic Statistics and Basic AI: Neural Networks: Caren Marzban
Document2 pages
Basic Statistics and Basic AI: Neural Networks: Caren Marzban
Kanika1908
No ratings yet
Maths Statistics Coursework Mark Scheme
Document5 pages
Maths Statistics Coursework Mark Scheme
afayeejka
100% (2)
From Average To K-means
From Everand
From Average To K-means
Beam van Waardenberg
No ratings yet
Statistics Using Libreoffice
Document56 pages
Statistics Using Libreoffice
gaby-01
No ratings yet
Mastering Clojure Data Analysis Sample Chapter
Document28 pages
Mastering Clojure Data Analysis Sample Chapter
Packt Publishing
No ratings yet
11 Introduction To Data Science and Machine Learning
Document50 pages
11 Introduction To Data Science and Machine Learning
Patrick Hugo
No ratings yet
Maximum Likelihood Homework
Document8 pages
Maximum Likelihood Homework
fawjlrfng
100% (1)
Edexcel Gcse Statistics Coursework Mark Scheme
Document5 pages
Edexcel Gcse Statistics Coursework Mark Scheme
afazakemb
100% (2)
Example Thesis With T-Test
Document5 pages
Example Thesis With T-Test
Amanda Moore
100% (2)
Jakub M. Tomczak - Deep Generative Modeling-Springer (2022)
Document210 pages
Jakub M. Tomczak - Deep Generative Modeling-Springer (2022)
Federico Molina Magne
No ratings yet
Intro To Stats Using LibreOffice-Calc and Gnumeric
Document91 pages
Intro To Stats Using LibreOffice-Calc and Gnumeric
Ki Soewarsono
No ratings yet
Gcse Statistics Coursework Mark Scheme Edexcel
Document5 pages
Gcse Statistics Coursework Mark Scheme Edexcel
kezevifohoh3
100% (2)
Edexcel Gcse Statistics Coursework Exemplar
Document4 pages
Edexcel Gcse Statistics Coursework Exemplar
f5dq3ch5
100% (2)
R Lab: Assumptions of Normality: Part 1. Assessing Parametric Assumptions
Document18 pages
R Lab: Assumptions of Normality: Part 1. Assessing Parametric Assumptions
api-573795931
No ratings yet
Statistics Coursework Gcse Edexcel
Document8 pages
Statistics Coursework Gcse Edexcel
afayepezt
100% (1)
What Is Data Science?
Document37 pages
What Is Data Science?
Annie Jain
No ratings yet
cst383 Finalprojectteam3
Document4 pages
cst383 Finalprojectteam3
api-498485959
No ratings yet
Hadoop For Oracle
Document7 pages
Hadoop For Oracle
RaviKiran
No ratings yet
Ai Lab-1
Document19 pages
Ai Lab-1
ASHISH CHAUDHARY
No ratings yet
CSC263 Notes
Document101 pages
CSC263 Notes
Shirley Zhang
No ratings yet
Thesis Length Ucl
Document7 pages
Thesis Length Ucl
dwt65fcw
100% (2)
1) Worst Case Complexity
Document3 pages
1) Worst Case Complexity
Israt Diba
No ratings yet
List of Topics For Research Paper in Computer Science
Document6 pages
List of Topics For Research Paper in Computer Science
aflbmmddd
No ratings yet
Bouchard, Stenetorp, Riedel - Unknown - Learning To Generate Textual Data
Document9 pages
Bouchard, Stenetorp, Riedel - Unknown - Learning To Generate Textual Data
Black Fox
No ratings yet
Deep Learning Fundamentals in Python
From Everand
Deep Learning Fundamentals in Python
LazyProgrammer
Rating: 4 out of 5 stars
4/5 (9)
What's Your STEM?: Activities to Discover Your Child's Potential in Science, Technology, Engineering, and Math
From Everand
What's Your STEM?: Activities to Discover Your Child's Potential in Science, Technology, Engineering, and Math
Rihab Sawah
Rating: 4 out of 5 stars
4/5 (1)
Domain-Specific Knowledge Graph Construction
From Everand
Domain-Specific Knowledge Graph Construction
Mayank Kejriwal
No ratings yet
Data Analysis with SPSS for Survey-based Research
From Everand
Data Analysis with SPSS for Survey-based Research
Saiyidi Mat Roni
No ratings yet
Induction Machines: Objective Practice Questions
Document7 pages
Induction Machines: Objective Practice Questions
HITESH PONNOJU
No ratings yet
Multiterminal Converter Based Ev Charging Station Integrated With PV3
Document5 pages
Multiterminal Converter Based Ev Charging Station Integrated With PV3
HITESH PONNOJU
No ratings yet
Modelling of A Multiterminal Converter Based Ev Charging Station Integrated With PV
Document2 pages
Modelling of A Multiterminal Converter Based Ev Charging Station Integrated With PV
HITESH PONNOJU
No ratings yet
Modelling of A Multiterminal Converter Based Ev Charging Station Integrated With PV
Document2 pages
Modelling of A Multiterminal Converter Based Ev Charging Station Integrated With PV
HITESH PONNOJU
No ratings yet
A Dual Three-Leg VSI Based Dstatcom in Three-Phase Four-Wire Distribution Systems
Document21 pages
A Dual Three-Leg VSI Based Dstatcom in Three-Phase Four-Wire Distribution Systems
HITESH PONNOJU
No ratings yet
Reactive Power Compensation by Controlling The Dstatcom
Document7 pages
Reactive Power Compensation by Controlling The Dstatcom
HITESH PONNOJU
No ratings yet
IntroductiontoArduinoIDE For Researchegate Fezari
Document12 pages
IntroductiontoArduinoIDE For Researchegate Fezari
HITESH PONNOJU
No ratings yet
Implementation of Weather Monitoring System: International Journal of Pure and Applied Mathematics No. 16 2018, 477-493
Document18 pages
Implementation of Weather Monitoring System: International Journal of Pure and Applied Mathematics No. 16 2018, 477-493
HITESH PONNOJU
No ratings yet
Cox Proportional-Hazards Regression For Survival Data: John Fox 15 June 2008 (Small Corrections)
Document18 pages
Cox Proportional-Hazards Regression For Survival Data: John Fox 15 June 2008 (Small Corrections)
LucijaRomić
No ratings yet
Quiz 3 Practice PDF
Document4 pages
Quiz 3 Practice PDF
NataliAmiranashvili
No ratings yet
Assignment 3 - With Answers - Engineering Statistics - Spring 2020 PDF
Document7 pages
Assignment 3 - With Answers - Engineering Statistics - Spring 2020 PDF
Kareema Batool
No ratings yet
ARIMA and GARCH
Document46 pages
ARIMA and GARCH
Xiaohu Zhang
No ratings yet
Sampletest2 Fall2003
Document8 pages
Sampletest2 Fall2003
Leo Schizo
No ratings yet
Topics: Descriptive Statistics and Probability: Name of Company Measure X
Document5 pages
Topics: Descriptive Statistics and Probability: Name of Company Measure X
Apoorva Kharwade
No ratings yet
R Notes
Document263 pages
R Notes
timothypurdy4333
No ratings yet
Covariance - Correlation and Regression (Lecture)
Document11 pages
Covariance - Correlation and Regression (Lecture)
Emerson Butyn
No ratings yet
Rocinante 36 Marengo 32 Car No. Car No. Mileage (KM/LTR) Top Speed (KM/HR) Mileage (KM/LTR) Top Speed (KM/HR)
Document14 pages
Rocinante 36 Marengo 32 Car No. Car No. Mileage (KM/LTR) Top Speed (KM/HR) Mileage (KM/LTR) Top Speed (KM/HR)
Prateesh Deepankar
No ratings yet
MS 1724012 Exp8
Document13 pages
MS 1724012 Exp8
KUNAL DURGANI
No ratings yet
Quantile Plot
Document4 pages
Quantile Plot
Anonymous FZNn6rB
No ratings yet
Biserial
Document2 pages
Biserial
LUCKY B
No ratings yet
SC0 X
Document51 pages
SC0 X
Rubén Mircin
No ratings yet
Statistics and Probability
Document16 pages
Statistics and Probability
Alice Rivera
No ratings yet
Estimating A Dirichlet Distribution Thomas P. Minka
Document15 pages
Estimating A Dirichlet Distribution Thomas P. Minka
situkangsayur
No ratings yet
P Value
Document2 pages
P Value
asdf0288
No ratings yet
STA1007S Lab 10: Confidence Intervals: October 2020
Document5 pages
STA1007S Lab 10: Confidence Intervals: October 2020
mlungu
No ratings yet
Eco 213: Basic Data Analysis and Econometrics: Assignment 1
Document2 pages
Eco 213: Basic Data Analysis and Econometrics: Assignment 1
Sarthak Garg
No ratings yet
STAT 497 - Old Exams
Document71 pages
STAT 497 - Old Exams
Anisa Anisa
100% (2)
Data Profiling Report
Document7 pages
Data Profiling Report
yash manglekar
No ratings yet
Date Preparation and Exploration:: Titanic Data - CSV
Document5 pages
Date Preparation and Exploration:: Titanic Data - CSV
Suhan 46
No ratings yet
Nist TN 1900 PDF
Document105 pages
Nist TN 1900 PDF
Allan Rosen
No ratings yet
RVSP CO-1 Material
Document69 pages
RVSP CO-1 Material
Haripriya Kuchipudi
No ratings yet
Biostatistics An Applied Introduction For The Public Health Practitioner 1st Edition Bush Test Bank
Document7 pages
Biostatistics An Applied Introduction For The Public Health Practitioner 1st Edition Bush Test Bank
joanneesparzagwjxmyqont
100% (40)
Employee Attrition Prediction
Document21 pages
Employee Attrition Prediction
user user
100% (1)
Werkstuk Dmouj - tcm235 91341 PDF
Document40 pages
Werkstuk Dmouj - tcm235 91341 PDF
Mr Brownstone
No ratings yet