Download as pdf or txt
Download as pdf or txt
You are on page 1of 21

SOMYA RATHI

(20BCE2323)

PROBLEM 4

AIM:
Aim of this problem is to Calculate the measures of central tendency and
dispersion of the given data

FORMULAE:
∑ 𝑓𝑥
1. mean = ∑𝑓

x=class mark ; f=frequency


𝑁
−𝐶𝑓
2
2. median = 𝑙 + [ ]∗𝑐
𝑓

l = lower limit of median class

N=total number of observation

Cf=cumulative frequency of class preceding the median class

f=frequency of median class

c=class length
𝑓1−𝑓0
3. mode = 𝑙 + [ ]∗𝑐
2𝑓1−𝑓0−𝑓2

l = lower limit of modal class

f0=frequency preceding the modal class

f1=frequency of modal class

f2=frequency succeeding the modal class ; C= class length


∑ 𝑓𝑖(𝑥𝑖−𝑚𝑒𝑎𝑛)2
Standard deviation =√ ∑𝑖 𝑓𝑖

∑𝑖 𝑓𝑖|𝑥𝑖−𝑚𝑒𝑎𝑛|
Mean deviation (mean) =
∑𝑖 𝑓𝑖

∑𝑖 𝑓𝑖|𝑥𝑖−𝑚𝑒𝑑𝑖𝑎𝑛|
Mean deviation (median) =
∑𝑖 𝑓𝑖

𝑄3−𝑄1
Quartile deviation =
2

R-SYNTAX:
mean<-(sum(f*X))/sum(f): syntax for mean calculation

cf<-cumsum(f):syntax for printing cumulative frequency

median=l+{((N/2)-m)/fm}*c:syntax for determining median

mode=l+{(f1-f0)/(2*f1-f0-f2)}*c:syntax to find mode

(q3-q1)/2:syntax to extract quartile deviation after extracting Q1 and Q3

SD=sqrt((sum(F*(X^2))/250)-(mean^2)):syntax form to find standard deviation

MD=sum(abs(X-mean))/sum(F):syntax form to find mean deviation

data1<-read.csv(file.choose()):syntax to import data from excel

PROBLEM STATEMENT:
R-CODE:
> #20BCE2323

> #Somya Rathi

> date()

[1] "Sat Apr 24 16:31:06 2021"

> #challenging task 1

> data1<-read.csv(file.choose())

> data1

Marks

1 32

2 54
3 38

4 44

5 68

6 41

7 30

8 43

9 46

10 41

11 40

12 31

13 40

14 40

15 36

16 46

17 48

18 32

19 40

20 17

21 48

22 47

23 37

24 52

25 48

26 47

27 32

28 26

29 21
30 41

31 53

32 33

33 32

34 50

35 38

36 33

37 51

38 43

39 45

40 32

41 40

42 50

43 31

44 50

45 42

46 50

47 55

48 52

49 45

50 44

51 41

52 31

53 50

54 45

55 30

56 48
57 37

58 24

59 26

60 40

61 42

62 45

63 40

64 19

65 61

66 32

67 43

68 42

69 27

70 57

71 31

72 57

73 47

74 23

75 60

76 51

77 46

78 40

79 31

80 52

81 21

82 35

83 38
84 15

85 37

86 36

87 41

88 34

89 24

90 30

91 34

92 55

93 34

94 47

95 35

96 58

97 37

98 46

99 41

100 38

101 41

102 15

103 38

104 37

105 52

106 28

107 29

108 38

109 23

110 40
111 51

112 50

113 34

114 34

115 44

116 44

117 43

118 34

119 34

120 38

121 33

122 41

123 44

124 47

125 38

126 30

127 37

128 42

129 41

130 60

131 49

132 37

133 22

134 42

135 48

136 42

137 53
138 44

139 47

140 43

141 54

142 39

143 32

144 44

145 17

146 44

147 54

148 50

149 33

150 44

151 39

152 32

153 35

154 44

155 42

156 42

157 38

158 41

159 25

160 45

161 56

162 50

163 38

164 37
165 50

166 35

167 41

168 33

169 34

170 33

171 26

172 42

173 44

174 42

175 38

176 18

177 56

178 22

179 18

180 38

181 36

182 40

183 50

184 52

185 30

186 44

187 32

188 58

189 33

190 31

191 39
192 48

193 24

194 33

195 46

196 29

197 45

198 38

199 24

200 43

201 48

202 42

203 62

204 37

205 38

206 41

207 32

208 17

209 38

210 28

211 35

212 45

213 49

214 37

215 38

216 31

217 53

218 43
219 47

220 36

221 31

222 47

223 42

224 48

225 40

226 53

227 48

228 51

229 47

230 34

231 29

232 49

233 46

234 46

235 31

236 38

237 48

238 28

239 36

240 45

241 48

242 34

243 39

244 42

245 23
246 37

247 43

248 19

249 39

250 48

> length(which(data1<10))

[1] 0

> length(which(data1<20))

[1] 9

> length(which(data1<30))

[1] 31

> length(which(data1<340))

[1] 250

> length(which(data1<40))

[1] 116

> length(which(data1<50))

[1] 211

> length(which(data1<60))

[1] 245

> length(which(data1<70))

[1] 250

> length(which(data1<80))

[1] 250

> limit-<c("0-10","10-20","20-30","30-40","40-50","50-60","60-70")

Error: unexpected '<' in "limit-<"

> limit<-c("0-10","10-20","20-30","30-40","40-50","50-60","60-70")

> CF<-c(0,9,31,116,211,245,250)
> F<-c(0,9,22,85,95,34,5)

> cumsum(F)

[1] 0 9 31 116 211 245 250

> X<-c(5,15,25,35,45,55,65)

> t<-data.frame(limit,F,CF,X)

>t

limit F CF X

1 0-10 0 0 5

2 10-20 9 9 15

3 20-30 22 31 25

4 30-40 85 116 35

5 40-50 95 211 45

6 50-60 34 245 55

7 60-70 5 250 65

> mean=sum(F*X)/sum(F)

> mean

[1] 40.52

> #median

> medianclass=min(which(CF>250/2))

> medianclass

[1] 5

> median=X[5]-5+(250/2-CF[4])*(10/95)

> median

[1] 40.94737

> mode=X[5]-5+(F[5]-F[4])*(10/(2*F[5]-F[4]-F[6]))

> mode

[1] 41.40845
> SD=sqrt(sum((F*(X-mean)^2)/sum(F))

+ SD=sqrt((sum(F*(X^2))/250)-(mean^2))

Error: unexpected symbol in:

"SD=sqrt(sum((F*(X-mean)^2)/sum(F))

SD"

> SD=sqrt((sum(F*(X^2))/250)-(mean^2))

> SD

[1] 10.15527

> MD=sum(abs(X-mean))/sum(F)

> MD

[1] 0.50208

> #quartile deviation

> Q1=X[4]-5+(250/4-CF[3])*(10/85)

> Q1

[1] 33.70588

> Q3=X[5]-5+((250*3)/4-CF[4])*(10/95)

> Q3

[1] 47.52632

> #quartile deviation

> QD=(Q3-Q1)/2

> QD

[1] 6.910217

> #OBSERVATION

> mean

[1] 40.52

> median

[1] 40.94737
> mode

[1] 41.40845

> SD

[1] 10.15527

> MD

[1] 0.50208

> QD

[1] 6.910217

> #20BCE2323

> #Somya Rathi

> date()

[1] "Sat Apr 24 17:19:48 2021"

>

SCREEN SHOT OF R CODE:


OUTPUT:
Mean= 40.52
Median= 40.94737
Mode= 41.40845
Standard deviation= 10.15527
Mean deviation= 0.50208
Quartile deviation= 6.910217

You might also like