Professional Documents
Culture Documents
Stats Math Da1 Problem 4
Stats Math Da1 Problem 4
(20BCE2323)
PROBLEM 4
AIM:
Aim of this problem is to Calculate the measures of central tendency and
dispersion of the given data
FORMULAE:
∑ 𝑓𝑥
1. mean = ∑𝑓
c=class length
𝑓1−𝑓0
3. mode = 𝑙 + [ ]∗𝑐
2𝑓1−𝑓0−𝑓2
∑𝑖 𝑓𝑖|𝑥𝑖−𝑚𝑒𝑎𝑛|
Mean deviation (mean) =
∑𝑖 𝑓𝑖
∑𝑖 𝑓𝑖|𝑥𝑖−𝑚𝑒𝑑𝑖𝑎𝑛|
Mean deviation (median) =
∑𝑖 𝑓𝑖
𝑄3−𝑄1
Quartile deviation =
2
R-SYNTAX:
mean<-(sum(f*X))/sum(f): syntax for mean calculation
PROBLEM STATEMENT:
R-CODE:
> #20BCE2323
> date()
> data1<-read.csv(file.choose())
> data1
Marks
1 32
2 54
3 38
4 44
5 68
6 41
7 30
8 43
9 46
10 41
11 40
12 31
13 40
14 40
15 36
16 46
17 48
18 32
19 40
20 17
21 48
22 47
23 37
24 52
25 48
26 47
27 32
28 26
29 21
30 41
31 53
32 33
33 32
34 50
35 38
36 33
37 51
38 43
39 45
40 32
41 40
42 50
43 31
44 50
45 42
46 50
47 55
48 52
49 45
50 44
51 41
52 31
53 50
54 45
55 30
56 48
57 37
58 24
59 26
60 40
61 42
62 45
63 40
64 19
65 61
66 32
67 43
68 42
69 27
70 57
71 31
72 57
73 47
74 23
75 60
76 51
77 46
78 40
79 31
80 52
81 21
82 35
83 38
84 15
85 37
86 36
87 41
88 34
89 24
90 30
91 34
92 55
93 34
94 47
95 35
96 58
97 37
98 46
99 41
100 38
101 41
102 15
103 38
104 37
105 52
106 28
107 29
108 38
109 23
110 40
111 51
112 50
113 34
114 34
115 44
116 44
117 43
118 34
119 34
120 38
121 33
122 41
123 44
124 47
125 38
126 30
127 37
128 42
129 41
130 60
131 49
132 37
133 22
134 42
135 48
136 42
137 53
138 44
139 47
140 43
141 54
142 39
143 32
144 44
145 17
146 44
147 54
148 50
149 33
150 44
151 39
152 32
153 35
154 44
155 42
156 42
157 38
158 41
159 25
160 45
161 56
162 50
163 38
164 37
165 50
166 35
167 41
168 33
169 34
170 33
171 26
172 42
173 44
174 42
175 38
176 18
177 56
178 22
179 18
180 38
181 36
182 40
183 50
184 52
185 30
186 44
187 32
188 58
189 33
190 31
191 39
192 48
193 24
194 33
195 46
196 29
197 45
198 38
199 24
200 43
201 48
202 42
203 62
204 37
205 38
206 41
207 32
208 17
209 38
210 28
211 35
212 45
213 49
214 37
215 38
216 31
217 53
218 43
219 47
220 36
221 31
222 47
223 42
224 48
225 40
226 53
227 48
228 51
229 47
230 34
231 29
232 49
233 46
234 46
235 31
236 38
237 48
238 28
239 36
240 45
241 48
242 34
243 39
244 42
245 23
246 37
247 43
248 19
249 39
250 48
> length(which(data1<10))
[1] 0
> length(which(data1<20))
[1] 9
> length(which(data1<30))
[1] 31
> length(which(data1<340))
[1] 250
> length(which(data1<40))
[1] 116
> length(which(data1<50))
[1] 211
> length(which(data1<60))
[1] 245
> length(which(data1<70))
[1] 250
> length(which(data1<80))
[1] 250
> limit-<c("0-10","10-20","20-30","30-40","40-50","50-60","60-70")
> limit<-c("0-10","10-20","20-30","30-40","40-50","50-60","60-70")
> CF<-c(0,9,31,116,211,245,250)
> F<-c(0,9,22,85,95,34,5)
> cumsum(F)
> X<-c(5,15,25,35,45,55,65)
> t<-data.frame(limit,F,CF,X)
>t
limit F CF X
1 0-10 0 0 5
2 10-20 9 9 15
3 20-30 22 31 25
4 30-40 85 116 35
5 40-50 95 211 45
6 50-60 34 245 55
7 60-70 5 250 65
> mean=sum(F*X)/sum(F)
> mean
[1] 40.52
> #median
> medianclass=min(which(CF>250/2))
> medianclass
[1] 5
> median=X[5]-5+(250/2-CF[4])*(10/95)
> median
[1] 40.94737
> mode=X[5]-5+(F[5]-F[4])*(10/(2*F[5]-F[4]-F[6]))
> mode
[1] 41.40845
> SD=sqrt(sum((F*(X-mean)^2)/sum(F))
+ SD=sqrt((sum(F*(X^2))/250)-(mean^2))
"SD=sqrt(sum((F*(X-mean)^2)/sum(F))
SD"
> SD=sqrt((sum(F*(X^2))/250)-(mean^2))
> SD
[1] 10.15527
> MD=sum(abs(X-mean))/sum(F)
> MD
[1] 0.50208
> Q1=X[4]-5+(250/4-CF[3])*(10/85)
> Q1
[1] 33.70588
> Q3=X[5]-5+((250*3)/4-CF[4])*(10/95)
> Q3
[1] 47.52632
> QD=(Q3-Q1)/2
> QD
[1] 6.910217
> #OBSERVATION
> mean
[1] 40.52
> median
[1] 40.94737
> mode
[1] 41.40845
> SD
[1] 10.15527
> MD
[1] 0.50208
> QD
[1] 6.910217
> #20BCE2323
> date()
>