Assignment Stat

You might also like

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 19

Assignment 2 – STAT 20029

Question 1:

a. The value of Orica Ltd at the beginning of every Quarter between 2001 and 2014.

https://au.finance.yahoo.com/q/hp?
s=ORI.AX&a=00&b=1&c=2001&d=11&e=31&f=2014&g=m&z=66&y=132

January April July October


2001 5.61 4.84 4.27 4.21
2002 7.36 7.88 9.17 9.22
2003 10.72 8.87 10.49 11.74
2004 14.25 13.86 14.65 16.84
2005 29.79 17.76 17.29 20.48
2006 20.82 22.90 24.00 22.60
2007 24.80 25.10 29.95 30.15
2008 32.40 28.98 28.52 21.77
2009 13.98 14.64 21.20 23.71
2010 25.50 26.94 24.90 25.98
2011 24.90 26.45 26.94 22.61
2012 24.24 28.26 24.70 25.00
2013 25.07 24.45 20.31 20.06
2014 23.86 21.60 19.66 18.69

Round up the whole number, so the value of Orica Ltd will be as follows:

January April July October


2001 6 5 4 4
2002 7 8 9 9
2003 11 9 10 12
2004 14 14 15 17
2005 30 18 17 20
2006 21 23 24 23
2007 25 25 30 30
2008 32 29 29 22
2009 14 15 21 24
2010 26 27 25 26
2011 25 26 27 23
2012 24 28 25 25
2013 25 24 20 20
2014 24 22 20 19
Stem-and Leaf display:
Stem Leaf

0 445678999

1 012444557789

2 0000112233344444555555566677899

3 0002

b. The relative frequency

With the sample size of 56 (n=56) monthly prices, the relative frequency of each class
equals the frequency of monthly prices divided by 56. So that, the result will be as
follows:

Class Interval Frequency Relative Frequency


$0 to less than $5 2 2
=0.0357
56

$5 to less than $10 7 7


=0.125
56
$10 to less than $15 6 6
=0.1071
56
$15 to less than $20 6 6
=0.1071
56
$20 to less than $25 16 16
=0.2857
56
$25 to less than $30 14 14
=0.25
56
$30 to less than $35 5 4
=0.0714
56
The relative frequency histogram:

Histogram
18
16
14
Frequency

12
10
8
6
4
2
0
$0 to less $5 to less $10 to less $15 to less $20 to less $25 to less $30 to less
than $5 than $10 than $15 than $20 than $25 than $30 than $35

Orica Ltd Monthly Price from 2001 to 2014

Figure 1

c.
 Describe the data from the histogram and the relative frequency table:

A glance at the histogram and stem-leaf display provided reveal the significant change
in monthly prices of Orica Limited over the period 13 years from 2001 to 2014. To
start with, the monthly price of $20 to less than $25 took the lead with the frequency
of 16 times, followed by the monthly price of $25 to less than $30 with the frequency
of 15 times. In stark contrast, the smallest belongs to the monthly price of $0 to less
than $5 with 2 times of occurring. Finally, the monthly price of $5 to less than $10,
$10 to less than $15, $15 to less than $20 and $30 to less than $35 occupied the fairly
similar frequency with 7, 6, 6 and 4 times occurring respectively between 2001 and
2014.
 Assume that the interval size is doubled, the result will be as follows:

Class Interval Frequency Relative Frequency


$0 to less than $10 9 9
=0.1607
56

$10 to less than $20 12 12


=0.2142
56
$20 to less than $30 31 31
=0.5536
56
$30 to less than $40 4 4
=0.0714
56

The relative frequency histogram:

Histogram
35
30
25
Frequency

20
15
10
5
0
$0 to less than $10 $10 to less than $20 $20 to less than $30 $30 to less than $40

Orica Ltd Monthly Price from 2001 to 2014

Figure 2

In a nutshell, as can be seen in two histograms, there are two effects when the interval
size is doubled. In the first place, the frequency of class interval from Figure 2 was the
sum of 2 class intervals in Figure 1. For instance, the frequency of the class interval $0
to less than $10 equal to the class interval $0 to less than $5 plus $5 to less than $10 ( 2
+ 7 =9). In the second place, the original data points have been accurate. That means for
the Figure 2 we cannot know exactly the frequency of the monthly less than $5 or equal
$5.
d. The proportion of stock prices were above $25was:
Question 2:

The data after sorted:

States
ACT NSW NT QLD VIC WA TAS SA
$107.3
$96.00 $96.60 0 $86.60 $94.20 $93.20 $104.50 $97.50
$134.9
$111.00 $108.60 0 $104.30 $107.40 $109.60 $109.60 $105.70
$135.3
$115.00 $109.00 0 $107.90 $109.40 $113.40 $114.80 $109.30
$149.7
$124.10 $112.50 0 $112.50 $113.20 $120.10 $124.60 $113.20
$154.8
$125.70 $119.90 0 $112.90 $120.30 $127.00 $128.20 $120.50
$157.6
$131.40 $125.90 0 $127.70 $126.30 $131.10 $130.30 $124.50
$133.90 $133.40 $127.70 $131.20 $140.40 $132.40 $130.40
$137.00 $137.30 $132.30 $136.40 $143.80 $140.60 $132.10
$145.00 $142.20 $144.60 $139.20 $151.30 $149.90 $138.90
$151.40 $145.30 $145.70 $143.00 $152.50 $139.60
$155.30 $151.80 $153.80 $148.90 $156.70 $148.90
Table 1

a.
n

 Mean: ∑ xi
x= i=1
n

∑ xi 96.00+111.00+115.00+124.10+125.70+131.40+133.90+145.00+ 137.00+ 151.40+ 155.30 1425


i =1
x ACT = = =
n 11 1
n

∑ xi 96.60+108.60+109.00+112.50+ 119.90+ 125.90+133.40+137.30+142.20+145.30+151.80


x NSW = i=1 =
n 11

1382.50
¿ =125.70
11
n

∑ xi 107.30+134.90+ 135.30+ 149.70+154.80+157.60 839.60


x NT = i=1 = = =140.00
n 6 6
n

∑ xi 86.60+104.30+107.90+112.50+112.90+127.70+127.70+132.30+144.60+145.70+153.80
i=1
x QLD = =
n 11

1356.00
¿ =123.27
11
n

∑ xi 94.20+107.40+ 109.40+ 113.20+ 120.30+126.30+131.20+136.40+139.20+143.00+148.90


i=1
x VIC = =
n 11

1369.50
¿ =124.50
11
n

∑ xi 93.20+109.60+113.40+120.10+127.00+131.10+140.40+ 143.80+ 151.0


i=1
x WA = =
n 9

1129.90
¿ =125.54
9
n

∑ xi 104.50+109.60+114.80+124.60+128.20+130.30+132.40+140.60+149.90+152.50+156.70
i=1
x TAS = =
n 11

1444.10
¿ =131.28
11
n

∑ xi 97.50+105.70+109.30+113.20+120.50+124.50+130.40+132.10+138.90+149.60+ 148.90
i=1
x SA = =
n 11

1360.60
¿ =123.69
11
n+1
 Median: The position of the median is 2

Positions of medians for each make:


Position of median ACT =Position of median NSW =Position of medianQLD

( n+ 1 )
¿ Position of medianVIC =Position of medianTAS =Position of medianSA =
2

11+1 12
¿ = =6 th
2 2

( n+1 ) 6 +1 7
Position of median NT = = = =3.5th
2 2 2

( n+1 ) 9+1 10
Position of medianWA = = = =5 th
2 2 2

According to Table 1 the median value will be:


Median ACT =$ 131.40; MedianNSW =$ 125.90; MedianWA =$ 127.00;

$ 135.30+ $ 149.7 285


MedianVIC=$ 126.30; MedianNT ¿
2
=
2
= $ 142.50;

MedianTAS =$ 130.30; MedianSA =$ 130.40 ; MedianQLD =$ 127.70

n+1
 Frist Quartile: The position of first quartile ¿ 4

Positions of first quartile:


Position of first quartile ACT =Position of first quartile NSW

¿ Position of first quartileQLD =Position of first quartile VIC

( n+1 ) 11+1 12
¿ Position of first quartileTAS =Position of first quartile SA= = =
4 4 4

¿ 3 nd
n+1 ( 6 +1 ) 7
Position of first quartile NT = = = =1.75 nd
4 4 4

( n+1 ) 9+1 10
Position of first quartile WA = = = =2.5 nd
4 4 4

Hence, the first quartile values are:


First quartile ACT =115; First quartileNSW =109.00 ; First quartile QLD =107.90

First quartile VIC =109.40; First quartile TAS =114.80; First quartileSA =109.30

First quartile NT =107.30+ 0.75 × ( 134.90−107.30 )=128.00;

First quartile WA =109.60+0.5 × ( 113.4−109.60 ) =111.5

3×(n+ 1)
 Third Quartile: The position of third quartile ¿ 4

Positions of third quartile are:


Position of third quartile ACT =Position of third quartile NSW

¿ Position of third quartileQLD =Position of third quartileVIC

3 × ( n+1 )
¿ Position of third quartileTAS =Position of third quartileSA =
4

3×(11+1) 36
¿ = =9 th
4 4

3 × ( n+1 ) 3 ×(6+1) 21
Position of third quartile NT = = = =5.25th
4 4 4

3 × ( n+1 ) 3 ×( 9+1) 30
Position of third quartile WA = = = =7.5 th
4 4 4

Hence, the third quartile values are:


Third quartile ACT =145.00;

Third quartile NSW =142.20Third quartileQLD=144.60Third quartile VIC =139.20

Third quartile TAS =149.90


Third quartile SA=138.90

Third quartile NT =149.70+ 0.25× ( 154.80−149.70 )=150.98

Third quartile WA =140.40+0.5 × ( 143.80−140.40 )=142

b.

Standard deviation formula: S= √∑ X −¿ ¿ ¿ ¿ ¿ ¿


2

NT’s standard deviation

∑ X 2=¿ 107.302 +134.902 +135.302 +149.702+ 154.802+157.60 2=119,228.28 ¿


¿¿¿

√ √
704,928.16 704,928.16
119,228.28− 119,228.28−
S NT = √∑ X 2−¿ ¿ ¿ ¿ ¿ ¿ = 6
=
6 =$18.66
6−1 5

All in all, the standard deviation of data of NT make is $18.66

NSW’s standard deviation

∑ X 2=¿ 96.602 +108.602 +109.002 +112.502 +119.902 +125.902 +133.402+ 137.302+142.20 2+145.30 2+151.802 =176,9
¿¿¿

√ √
1,911,306.25 1,911,306.25
176,912.61− 176,912.61−
S NSW = √ ∑ X −¿ ¿ ¿ ¿ ¿ ¿ =
2
11
=
11 =$17.77
11−1 10

So, the standard deviation of data of NSW is $17.77

Following the formula like above, the results can be obtained:

Standard deviation of ACT is 17,84


Standard deviation of QLD is $20.43
Standard deviation of VIC is $17.11
Standard deviation of WA is $18.47
Standard deviation of TAS is $17.45
Standard deviation of SA is $16.04

Range = largest value – smallest value = X largest −X smallest


Range ACT =155.30−96.00=59.30

Range NSW =151.80−96.60=55.2

Range NT =157.60−107.30=50.3

RangeQLD =153.80−94.60=59.20

RangeVIC =148.90−94.20=54.70

RangeWA =151.50−93.20=58.30

RangeTAS =156.70−104.50=52.20

RangeSA =124.50−97.50=27.00

Coefficient of variation: CV = ( XS ) ×100 %


CV ACT = ( 129.62
18.74
) ×100 %=13.76 %
CV NSW = ( 125.70
17.77
)×100 %=14.14 %
CV QLD = ( 123.37
20.43
) ×100 %=16.56 %
CV NT = ( 140.00
18.66
) ×100 %=13.33 %
CV VIC = ( 124.50
17.11
)× 100 %=13.74 %
CV WA =( 125.54
18.47
)× 100 %=14.71%
CV TAS = ( 131.28
17.45
) ×100 %=13.30 %
CV SA =( 123.69
16.04
) ×100 %=12.97 %
Question 3:

a.

The probability that a person, randomly selected, has mood (affected) problems and belongs
to the age group 35 to 44 will be:
401.7
P(A) = = 0.134
2,996.2

b. probability that a person, randomly selected, belongs to the age group of 55 or over:
460.8+228.1+170.4
P(B) = = 0.287
2,996.2

c. Given that the person belongs to the age group of 45 to 54, probability that he or she
is suffering from alcohol and drug problems:
35.9
P(C)= =0.067
532.9

d. Are different types of mental and behavioural problems independent of the gender?

Gender
Mental and Males Females Persons
behavioural
problems
Alcohol and drug 98.7 53.4 152.1
problems
Mood (affective) 822.8 1,320.3 2,143.1
problems
Anxiety related 360.1 490.0 850.1
problems
Problems of 100.6 53.6 154.1
psychological
development
Behavioural and 100.8 34.9 135.7
emotional problems
with usual onset in
childhood/adolescence
Other mental and 128.3 67.6 196.0
behavioural problems
(c)
Symptoms and signs 54.3 54.0 108.3
involving cognition,
perceptions, emotional
state and behaviour
Total mental and 1,323.6 1,672.7 2,996.2
behavioural problems

Set up H 0∧H a including: { H 0 :Gender is independent of mental∧behavioural problems


H a :Gender is not independent of metal∧behavioural problems }
The chi-square test of independence is appropriate.

Let α =.05. Here, there are three rows (r=3) and seven columns (c=7). The degree of freedom
is equal to: DF=(r - 1) × (c - 1) = ( 3−1 ) × (7−1 )=12. The critical of chi-square for α =.05 from
Table A.9 page 782 in Textbook is 5.9915.

If the chi-square value calculated from the data is greater than 5.9915, we reject the null
hypothesis and conclude that gender is not independent of mental and behavioural problems.

After that, using the formula of the expected frequencies:

( row i total ) (column jtotal )


e ij =
n

Where: e ij : t h e expected frequency ∈row i , column j


n: the total of all frequencies

Applying the formula we have:

( row 1 total )( column 1total ) 152.1 ×1323.6


e 11= = =67.20
n 2996.2

( row 1 total )(column 2total ) 152.1 ×1672.7


e 12= = =84.91
n 2996.2

Following the formula like above, the consequences will be:


e 21 =946.73 e 51=59.95
e 22=1196.43 e 52=75.75
e 31 =375.53 e 61=86.60
e 32=474.60 e 62=109.42
e 41 =68.08 e 71=47.84
e 42=86.03 e 72=60.50

Then, we have:

Gender
Mental and Males Females Persons
behavioural
problems
Alcohol and drug 98.7 53.4 152.1
problems
67.20 84.91
Mood (affective) 822.8 1,320.3 2,143.1
problems
946.73 1196.43
Anxiety related 360.1 490.0 850.1
problems
375.53 474.60
Problems of 100.6 53.6 154.1
psychological
development
68.08 86.03
Behavioural and 100.8 34.9 135.7
emotional problems
with usual onset in
childhood/adolescence
59.95 75.75
Other mental and 128.3 67.6 196.0
behavioural problems
(c)
86.60 109.42
Symptoms and signs 54.3 54.0 108.3
involving cognition,
perceptions, emotional
state and behaviour
47.84 60.50
Total mental and 1,323.6 1,672.7 2,996.2
behavioural problems

( f ¿ ¿ 0−f e )2
Next, we compute the chi-square value by summing ¿ for all
fe
cell.

2 ( 98.7−67.20 )2 ( 53.4−84.91 )2 ( 822.8−946.73 )2


x= + + +¿
67.20 84.91 946.73
( 1320.3−1196.43 )2 ( 360.1−375.53 )2 ( 490−474.60 )2
+ + +¿
1196.43 375.53 474.60
( 100.6−68.08 )2 (53.60−86.03 )2 ( 100.8−59.95 )2
+ + +¿
68.08 86.03 59.95
( 34.9−75.75 )2 ( 128.3−86.60 )2 ( 67.6−109.42 )2
+ + +¿
75.75 86.60 109.42
2 2
(54.3−47.84) (54−60.50)
+ =166.568
47.84 60.50

Question 4:

201 Ja Ma Ju Au No
Feb Mar Apr Jun Sep Oct Dec
4 n y l g v
1st 0 0 8.8 0 0 23. 0 0 0 0 0 2
8
2nd 0 0 4.2 0 0 0.8 0 0 0 0 1.2 5.2
3rd 0.4 0 0.2 0 2.6 0 0 0 5.2 0 0 1
4th 0 0 4.8 0.2 4.6 0 0 0 0 0 0 9
28. 21.
5th 0 0.8 0 4.6 0 0 0 4.2 0 0
4 4
38.
6th 0 0.4 12 2.2 0 0 0 6.8 0 3.8 0.6
2
18. 10.
7th 0 0 0.2 0.2 1.4 0 0 0 0 17
6 4
27.
8th 0.2 0 0 2.2 0 0 0 1.6 2.6 4.2 0
6
9th 0.2 0 0.2 0 1 1.6 0 0 0 0 0 1.4
1.
10th 0.6 0.2 0 0 0 9.2 0 5.8 0 0 0
2
11th 0 0 0 5 0 2.6 0 0.8 0 0 0 7.8
28.
12th 0 0 0 8 0.2 0 0 0 0 0 9.8
2
30. 18.
13th 0 1.2 1.4 8.4 0 0 0 0 0 0
6 2
14th 0 0 0 2.4 0 3.8 0 0 1.2 11.4 0 0
1. 106.
15th 0 2.6 0 1.6 0 0.2 0 0 0 0
2 8
15. 1.
16th 0 4.6 9.6 0 0.4 0 0.4 0.4 5.8 0
2 6
17. 0. 35.
17th 0 3.8 0 0 0 0 0 0 0
4 2 6
35.
18th 0 0 0 0 0 0 0 0 0.2 0 0
4
36.
19th 0 0 0 0 0 0 0 0 0 0 0
8
24.
20th 0.2 0 0 0 0 0 9.2 0 0 0 0
6
21st 2.2 0 4.2 0 0 0 0 13 0.4 0 0 0
22n
3.4 0 0 0 0 0 0 3.2 0 0 0 0
d
0.
23rd 0.4 0 1.4 0 0 0 6.2 0 0 0 0
2
24th 0 0 0 0 0 0 0 3.8 0 0.2 0 2.6
25th 3.4 0 40 0 0 0 0 0 0 0.2 2.8 0
26th 0.8 0 0.2 4.8 0 0 5. 17. 4.6 0.8 0 13.
6 8 8
22. 28.
27th 0 6.2 0.4 0 0 2 0.2 0 0 0
8 8
12.
28th 0 0 6.4 2.6 0 0 0 0 0 0 0
4
29th 0   1.2 0 0 0 0 1.8 0 0 0 0
30th 0   0 3.4 0 0 0 2.4 0 0 0 0
31st 0   2.8   2.6   0 1.2   0   0

Week no. 1 2 3 4 5 6 7 8 9 10 11 12 13
Rainfall
0 4 2 3 5 2 3 5
days 1 3 0 6 2
Rain 0. 10. 1. 19. 40. 9. 70.
0
amount 4 1 0 4 2 8.6 42 2 17.4 2 4 6
 
14 15 16 17 18 19 20 21 22 23 24 25 26
4 5 3 2 4 2 2 0 2 4 5 1 0
9. 35. 13. 1. 26. 17. 0.
8 2 19.2 5.2 2 2 8.6 0 4 68.8 4 4 0

27 28 29 30 31 32 33 34 35 36 37 38 39
0 1 3 3 0 1 4 7 6 4 3 2 2
1. 82. 107. 64. 0.
0 1.2 3 6 0 6 8 6 4 26.6 9.6 8 4.8

40 41 42 43 44 45 46 47 48 49 50 51 52
0 1 4 3 1 1 1 0 1 7 5 0 2
118. 3. 46. 16.
0 4.2 8 1.2 1.2 8 5.8 0 2.8 56.2 6 0 4

a) Assuming that Rainfall or No-Rainfall event follows a Poisson distribution:

(i) What is the probability that on any given week in a year there would be no rainfall?

λ=
∑ x = 132 = 2.538
n 52
0 −2538
2.538 ∗e
P(No rainfall in a given week ) = P(X=0| λ=2.538) = =
0!

(ii) What is the probability that there will be 2 or more days of rainfall in a week? (There are
52 weeks in a year and a week is assumed to start from Monday).

P(X = 2 or more) = 1- P(X=1) – P(X=0) =

You might also like