Welcome to Scribd!

Introduction To Data Science With R Programming

Uploaded by

0% found this document useful (0 votes)

36 views12 pages

This document provides an introduction and overview of key concepts for data science with R programming including: - Standard deviation and variance which are measures of how varied or dispersed data values are from the mean. - Linear regression which models the relationship between two variables and can be used to make predictions. - The ggplot2 package for creating elegant graphics and plots in R. - Key steps and formulae are provided for calculating standard deviation, variance, and performing linear regression. Examples are also given to illustrate these statistical techniques.

Original Description:

Standard Deviation

Original Title

2019-Standard deviation

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

0% found this document useful (0 votes)

36 views12 pages

Introduction To Data Science With R Programming

Uploaded by

Vimal Kumar

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pptx, pdf, or txt

Jump to Page

You are on page 1of 12

Search inside document

Introduction to Data Science with R

Programming

Dr. D. Vimal Kumar

Associate Professor
Department of Computer Science
Nehru Arts and Science College
Coimbatore
TABLE OF CONTENTS
Standard deviation
Variance
Linear Regression
Standard deviation
• Standard deviation (SD) is a measure of how varied is
the data in a data set.
• Mathematically it measures how distant or close are
each value to the mean value of a data set.
• A standard deviation value close to 0 indicates that
the data points tend to be very close to the mean of
the data set
• High standard deviation indicates that the data
points are spread out over a wider range of values
Procedure
• To calculate the standard deviation of the numbers:
1. Work out the Mean (the simple average of the
numbers)
2. Then for each number: subtract the Mean and
square the result.
Vec <- c(4,6,8,4,10)
S <- (sd(Vec))
print (s)
Steps
• Calculate the mean.
• Subtract the mean from each observation.
• Square each of the resulting observations.
• Add these squared results together.
• Divide this total by the number of observations
(variance, S2).
• Use the positive square root (standard deviation, S).
Formulae
SD Example
Variate
• A variate is a quantity which may take any of the values of a
specified set with a specified relative frequency or probability.
The variate is therefore often known as a random variable.
• Univariate data – This type of data consists of only one variable.
The analysis of univariate data is thus the simplest form of
analysis since the information deals with only one quantity that
changes
• Bivariate data is used for little complex analysis than as compared
with univariate data. Bivariate data is the data in which analysis
are based on two variables per observation simultaneously.
• Multivariate data is the data in which analysis are based on more
than two variables per observation. Usually multivariate data is
used for explanatory purposes.
ggplot2
The ggplot2 package, created by Hadley Wickham,
offers a powerful graphics language for creating
elegant and complex plots. Its popularity in
the R community has exploded in recent years. ...
There is a helper function called qplot() (for quick plot)
that can hide much of this complexity when creating
standard graph
Linear Regression
Linear Regression
x <- c(151, 174, 138, 186, 128, 136, 179, 163, 152, 131)
y <- c(63, 81, 56, 91, 47, 57, 76, 72, 62, 48)
relation <- lm(y~x)
print(summary(relation))
a <- data.frame(x = 170)
result <- predict(relation,a)
print(result)
# Give the chart file a name.
png(file = "linearregression.png")
plot(y,x,col = "blue",main = "Height & Weight Regression",
abline(lm(x~y)),cex = 1.3,pch = 16,xlab = "Weight in Kg",ylab = "Height in cm")
# Save the file.
dev.off()
Thank You
Any Queries

Q3 Statistics and Probability 11 Module 4
Document31 pages
Q3 Statistics and Probability 11 Module 4
kaytoy
No ratings yet
Measures of Spread
Document19 pages
Measures of Spread
api-204699162
0% (2)
Diploma Programmes Main Examination: Dipl/Qts0109/May2018/Maineqp
Document17 pages
Diploma Programmes Main Examination: Dipl/Qts0109/May2018/Maineqp
May Jing
No ratings yet
Introduction To Probability
Document88 pages
Introduction To Probability
mathewsujith31
No ratings yet
3 5 A IntroSummaryStatistics
Document32 pages
3 5 A IntroSummaryStatistics
Gabriel Cherry
No ratings yet
Central Tendency + Dispersion
Document28 pages
Central Tendency + Dispersion
neha.akshi
No ratings yet
Basic Statistical Descriptions of Data: Dr. Amiya Ranjan Panda
Document35 pages
Basic Statistical Descriptions of Data: Dr. Amiya Ranjan Panda
Anu agarwal
No ratings yet
Instructor'S Manual: Statistical Techniques in Financial Management
Document3 pages
Instructor'S Manual: Statistical Techniques in Financial Management
joebloggs1888
No ratings yet
Introduction To Statistics
Document42 pages
Introduction To Statistics
Geetu Sodhi
No ratings yet
Part2 Statistics
Document55 pages
Part2 Statistics
TechManager SaharaTvm
No ratings yet
Mathematical Analysis
Document46 pages
Mathematical Analysis
Gilbert Dwasi
100% (1)
Measures of Dispersion
Document48 pages
Measures of Dispersion
Biswajit Rath
No ratings yet
Descriptive Statistics and Exploratory Data Analysis
Document36 pages
Descriptive Statistics and Exploratory Data Analysis
Emmanuel Adjei Odame
No ratings yet
Topic III
Document27 pages
Topic III
EmmarehBucol
No ratings yet
8
Document6 pages
8
ameerel3tma77
No ratings yet
01 - Descriptive Statistics - Print
Document36 pages
01 - Descriptive Statistics - Print
Phoeurn Chanarun
No ratings yet
Chapter 2 Final of Final
Document158 pages
Chapter 2 Final of Final
geletaw mitaw
No ratings yet
Process Data Analysis
Document24 pages
Process Data Analysis
Ridwan Mahfuz
No ratings yet
Probability and Statistics in Engineering
Document24 pages
Probability and Statistics in Engineering
asad
No ratings yet
Standard Deviation and Coefficient of Standard Deviation
Document4 pages
Standard Deviation and Coefficient of Standard Deviation
vishnu krishna
No ratings yet
CH - 4
Document71 pages
CH - 4
PIYUSH MANGILAL SONI
No ratings yet
Lecture 05 - Measures of Dispersion
Document17 pages
Lecture 05 - Measures of Dispersion
ferassadadi10
No ratings yet
Chapter 4
Document46 pages
Chapter 4
Javeria Naseem
No ratings yet
DMDW 5
Document25 pages
DMDW 5
Anu agarwal
No ratings yet
Gtu 302 Biostatistics: Descriptive Statistics
Document57 pages
Gtu 302 Biostatistics: Descriptive Statistics
LimYi
No ratings yet
DM - 02 - 02 - Descriptive Data Summarization
Document32 pages
DM - 02 - 02 - Descriptive Data Summarization
Ankush Laybar
No ratings yet
Linear Discriminant Analysis
Document16 pages
Linear Discriminant Analysis
Medhini Dubey
No ratings yet
Quantitative Methods: Dr. Zahra Sadeghinejad
Document38 pages
Quantitative Methods: Dr. Zahra Sadeghinejad
joseph
No ratings yet
Measures of Dispersion
Document26 pages
Measures of Dispersion
yaminis.0223
No ratings yet
Notes Module 5
Document19 pages
Notes Module 5
Vatsalya Bhardwaj
No ratings yet
Data Mining and Warehousing (203105431) : Prof. Dheeraj Kumar Singh, Assistant Professor
Document71 pages
Data Mining and Warehousing (203105431) : Prof. Dheeraj Kumar Singh, Assistant Professor
Harsha Gangwani
No ratings yet
Measures of Variability
Document71 pages
Measures of Variability
Rinna Legaspi
100% (1)
Untitled
Document43 pages
Untitled
Muhammad Areeb
No ratings yet
Normal Distributions and The Empirical Rule
Document3 pages
Normal Distributions and The Empirical Rule
Jhonalyn M. Alfaro
No ratings yet
Chapter 02
Document40 pages
Chapter 02
徐郁真
No ratings yet
Presenting and Interpreting Research Data: Kim Charies L. Okit
Document34 pages
Presenting and Interpreting Research Data: Kim Charies L. Okit
Dodoy Tacna
No ratings yet
Descriptive Statistical Measures
Document63 pages
Descriptive Statistical Measures
KUA JIEN BIN
No ratings yet
Mathematical Statistics: Instructor: Dr. Deshi Ye
Document42 pages
Mathematical Statistics: Instructor: Dr. Deshi Ye
Ahmed Kadem Arab
No ratings yet
DM 02 01 Data Undrestanding
Document35 pages
DM 02 01 Data Undrestanding
Pallavi Bharti
No ratings yet
Class 5.2 B Business Statistics Measures of Dispersion
Document63 pages
Class 5.2 B Business Statistics Measures of Dispersion
Priya Chugh
No ratings yet
Statistical Description of Data: Introduction To Business Statistics
Document17 pages
Statistical Description of Data: Introduction To Business Statistics
Princehope
No ratings yet
Mean Median Mode
Document56 pages
Mean Median Mode
Jenneth Cabinto Dalisan
No ratings yet
R Unit 4th and 5th
Document17 pages
R Unit 4th and 5th
Arshad Beg
No ratings yet
2.1 Data Analysis
Document8 pages
2.1 Data Analysis
Lei Yin
No ratings yet
Exploratory Data Analysis
Document19 pages
Exploratory Data Analysis
Muhammad Satriyo
No ratings yet
BI Chapter 02 - Unlocked
Document51 pages
BI Chapter 02 - Unlocked
Jawaher Albaddawi
No ratings yet
Topic 3 Measures of Dispersion
Document5 pages
Topic 3 Measures of Dispersion
Ell V
No ratings yet
Report Stats PDF
Document23 pages
Report Stats PDF
Sid Ra Rajpoot
No ratings yet
BA Module 1 Summary
Document3 pages
BA Module 1 Summary
Firda Basbeth
No ratings yet
Data Processing and Statistical Treatment
Document39 pages
Data Processing and Statistical Treatment
Emill Rivera Asuncion
No ratings yet
Central Tendency and Dispersion
Document61 pages
Central Tendency and Dispersion
faux
No ratings yet
QTT Project 2 2023
Document16 pages
QTT Project 2 2023
shahigyanendra146
No ratings yet
Descriptive Statistics: Mean or Average
Document5 pages
Descriptive Statistics: Mean or Average
Neel
No ratings yet
Lec.3 Measures of Spread (1) .
Document15 pages
Lec.3 Measures of Spread (1) .
Zainab Jamal Siddiqui
No ratings yet
Reading - Exploratory Data Analysis
Document33 pages
Reading - Exploratory Data Analysis
vaibhavpardeshi55
No ratings yet
Standard Deviation
Document19 pages
Standard Deviation
coolaysuh9
No ratings yet
History Reporting
Document61 pages
History Reporting
Rosemar Mae Garde Carpio
No ratings yet
Statistics From PLTW
Document64 pages
Statistics From PLTW
megantoys
No ratings yet
C4 Descriptive Statistics
Document34 pages
C4 Descriptive Statistics
NAVANEETH
No ratings yet
Chapter 6
Document37 pages
Chapter 6
Franco
No ratings yet
Basic Statistics Terms and Calculations
Document4 pages
Basic Statistics Terms and Calculations
Sagir Musa Sani
No ratings yet
Answers For 100daysml
Document8 pages
Answers For 100daysml
Cititorul
No ratings yet
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
Prepared By: SELVIN JOSY BAI. S
Document31 pages
Prepared By: SELVIN JOSY BAI. S
Vimal Kumar
No ratings yet
VB Unit-1 (Module - 1)
Document19 pages
VB Unit-1 (Module - 1)
Vimal Kumar
No ratings yet
VB One Mark
Document12 pages
VB One Mark
Vimal Kumar
No ratings yet
UNIT-1 & HOUR-1 Getting Started With VB6
Document25 pages
UNIT-1 & HOUR-1 Getting Started With VB6
Vimal Kumar
100% (1)
Introduction To Data Science With R Programming
Document14 pages
Introduction To Data Science With R Programming
Vimal Kumar
No ratings yet
Introduction To Data Science With R Programming
Document91 pages
Introduction To Data Science With R Programming
Vimal Kumar
No ratings yet
Introduction To Data Science With R Programming
Document40 pages
Introduction To Data Science With R Programming
Vimal Kumar
No ratings yet
Experimental Design
Document100 pages
Experimental Design
fain
No ratings yet
Exploratory Data Analysis
Document26 pages
Exploratory Data Analysis
mizart rna
No ratings yet
Introduction To Statistics Mock Exam: Duration: 3 Hours Max. Marks: 50 Section A: Mcqs (20 Marks)
Document7 pages
Introduction To Statistics Mock Exam: Duration: 3 Hours Max. Marks: 50 Section A: Mcqs (20 Marks)
Ahsan Kamran
No ratings yet
Fundamental Analysis of Selected Public and Private Sector Banks in India PDF
Document17 pages
Fundamental Analysis of Selected Public and Private Sector Banks in India PDF
Dipesh Apraj
No ratings yet
ch10 cHuX
Document52 pages
ch10 cHuX
Naeem Ullah
No ratings yet
7.6 10ex
Document1 page
7.6 10ex
neeti
No ratings yet
Astm D5147.D5147M 2014 PDF
Document8 pages
Astm D5147.D5147M 2014 PDF
joao carlos protz
No ratings yet
H2 Mathematics 2017 Preliminary Exam Paper 1 Question Ans Er All Uestions (100 Marks) 1
Document14 pages
H2 Mathematics 2017 Preliminary Exam Paper 1 Question Ans Er All Uestions (100 Marks) 1
Emmanuel Elijah Er
No ratings yet
Statistics Formula Booklet
Document13 pages
Statistics Formula Booklet
Ezra Hutahayan
No ratings yet
STAT Final Sample
Document4 pages
STAT Final Sample
abrammazal42003
No ratings yet
Evaluation of Paracetamol Granules: April 2017
Document18 pages
Evaluation of Paracetamol Granules: April 2017
Cucu Yunengsih
No ratings yet
1.system of Numbers and Conversion
Document20 pages
1.system of Numbers and Conversion
Ampol
No ratings yet
Frequency Distribution Samples
Document11 pages
Frequency Distribution Samples
Esmareldah Henry Sirue
No ratings yet
Essay On Compensation
Document6 pages
Essay On Compensation
afhbgmmtc
100% (2)
Research Statistics Using JASP
Document47 pages
Research Statistics Using JASP
Elmar Francisco
No ratings yet
A Brief Introduction To Error Analysis and Propagation: Georg Fantner February 2011
Document13 pages
A Brief Introduction To Error Analysis and Propagation: Georg Fantner February 2011
Maduamaka Ihejiamatu
No ratings yet
06.4 - Measures of Central Tendency
Document22 pages
06.4 - Measures of Central Tendency
Richard Bachar
No ratings yet
Sampling Distribution Theory: Population and Sample
Document8 pages
Sampling Distribution Theory: Population and Sample
Siddharth Singh
No ratings yet
c16 Differences Collection Vs Udb
Document17 pages
c16 Differences Collection Vs Udb
Samuel Ryckenberg
No ratings yet
Full Download Introduction To Statistics An Active Learning Approach 2nd Edition Carlson Test Bank
Document35 pages
Full Download Introduction To Statistics An Active Learning Approach 2nd Edition Carlson Test Bank
lukec8kne
100% (40)
Educ 502 1 1
Document70 pages
Educ 502 1 1
david rentoria
No ratings yet
Chapter 7 L1
Document28 pages
Chapter 7 L1
Faiz
No ratings yet
Stats&Prob - WEEK 2&3
Document2 pages
Stats&Prob - WEEK 2&3
Ji Pao
100% (2)
Practical Bayesian Inference
Document322 pages
Practical Bayesian Inference
Sinisa Hristov
100% (1)
Chapter 4
Document8 pages
Chapter 4
Ruby Ann Mariñas
No ratings yet
Tarea 3 Distribuciones de Probabilidad
Document14 pages
Tarea 3 Distribuciones de Probabilidad
Fernando Sisa
No ratings yet