Professional Documents
Culture Documents
Apply Stati For Editing 2014
Apply Stati For Editing 2014
មេម ៀនទី១
ស្ថត
ិ ិអត្ថថធិបាយ
(DESCRIPTIVE STATISTICS)
1.1-អ្វីជាស្ថត
ិ ិ?
1.2- របរភទស្ថត
ិ ិ
ស្ថិតិរតូវបានប្ចករចញជាពី រប្ផនក៖
1.2.1- ស្ថត
ិ ិពណ៌នា
ឧទាហរណ៍៖
- According to the microfinance survey shows that 49% of the respondents who used the
credit service met the problem of loan repayment. The statistic 49 describes the number out of
every 100 persons who met the problem.
- According to Consumer Reports, Whirlpool washing machine owners reported 9
problems per 100 machines during 2009. The statistic 9 describes the number of problems out of
every 100 machines.
1.2.2-ស្ថត
ិ ិស្ននិោឋន
ទូ រៅកមា។
Population
Sample
1.3- របរភទអ្រថរ
អ្រថរជានិមិត
ា ស្ញ្ញាមួ យប្ដលរគយកមករយការណ៍ជាតួ រលខ ឬតនមល (មិនានរលខ)រៅនឹ ង
នឹ ងការបង្ហាញទិ នន
ន ័ យអ្រថរអាចជាអ្រថរគុ ណភាព (qualitative) និ ងបរ ិាណ(quanlitative)៖
អ្រថរបរ ិាណជាអ្រថរប្ដលរគយកមករយការណ៍ជារលខ។
Example:បរ ិមាណផលិតផលដែលបានលក់,កំ ឡុងពពលរងចំរបស់អតិ ថិជនសរមាប់ ពសវាកមម, ចណូ
លររួសារ, ចំ នួនពមា៉ ងពលើសកនុងមួ យដែ, ទំ ហំការលក់ (បរ ិមាណលក់ ឬជាែុល្លល), និងមានពរចើនពទៀ
ត។
អ្រថរបរ ិាណអាចជាអ្រថរោច់ (Discrete) និ ងជាអ្រថរជាប់(Continuous)៖
ចរនាលោះរវងតនមលនិមួយៗរបស្់វ។
Example:
ចំ នួនបនទប់ពែកកនុងផទះមួ យ, ចំនួនសមាជិកររួសារ, ចំនួនបរ ិកាាពផេងៗ (1,2,3,...)។
- អ្រថរជាប់ ជាអ្រថរអាចស្នាតពី តនមលរបស្់វស្ថិតរៅកនុងចរនាលោះជាក់ លក់មួយ។
Example:
រយៈពពលពធវើែំពណើរពីភំ ពន ពញពៅពសៀមរាប, ចំ ណូល, ទមងន់វតថុ ។ល។
សពងាបរបពភទអពថរិទិនននយ
DATA
discrete continuous
(number of children) (time taken for an exam)
1.4របភពទិនន
ន ័យស្ថត
ិ ិ.
1.4.1- ទិនន
ន ័យរបាោះពុមព Secondary data
ជាធមមតាបញ្ហ
ា ការស្រសាវរជាវតរមូវឲ្យពបាះពុ មផ
ភ េពវផាយ។ ទិនននយសថិតិពលើ បញ្ហ
ា ទំងពនះអាច
រតូវបានរកព ើញកនុងអតថបទពបាះពុម,ភ journals, សារណាបទ, របាយការណ៍រដ្ឋាភិ បាល, និ ងទសេនាែតី
ជាពែើ ម។
1.4.2- ទិនន
ន ័យបឋម Primary data
ទិ នន
ន យពបាះពុ មរ
ភ ឺ មិនអាចផតល់ពត៌ មានររប់ រាន់ សរមាប់ ការសិកាស្រសាវរជាវ
បានពទ។ កនុងករណីពនះពត៌ មាននឹ ងរតូវបានរបមូ ល,វ ិភារ។ វ ិធី មួយននការរបមូលទិ នន
ន យបឋមរឺ
ពរៀបចំករមងសំណួរ។
1.5 ករមិតរង្ហវស្់ទិនន
ន ័យ .Levels of Measurement
ជាការពិ តមានរបពភទរង្វវស់ទិនន
ន យបួនសរមាប់ អពថរមួ យដែលបានរបមូ ល៖
រង្វវស់ Nominal,
រង្វវស់ Ordinal,
រង្វវស់ Interval, and
រង្វវស់ Ratio.
រង្វវស់ Nominal level (scaled):
ជារង្វវស់សរមាប់ របពភទទិនននយរកុម(categories)ិរុណភាព(qualitative)ពដ្ឋយពុំ រិតលំដ្ឋប់
ឬទី តាំងននការពរបៀបពធៀបពទបើយ។ កនុងរង្វវស់ nominal តនមលពលែ រតូវបានពរបើជាព្មះ
សរមាប់ តាងឲ្យតនមលរបស់អពថរមួ យ។
Example:
អពថរ Variable: ពសវាកមមទំនាក់ទំនង Communication services
តនមលValues (ទិ នន
ន យ data): 1 = Mobitel, 2 = Metfone, 3 = Mfone,
4 = Smart mobile, 5 = Beeline, 6=qb.
Ratio level:
Ratio ជារង្វវស់រតូវបានពរពរបើ សរមាប់វាស់ពលើទិនន
ន យពលែរបស់អពថរដ្ឋច់។
Example: ែូចជាទិនន
ន យចំ ណូលរបចំដែ,ទមងន់វតថុ, ចមាងយ, សីតុណាភាព,ទំ ហំនផទមុែកាត់ , ចំនួន
ផលិតផលមួ យ,ទំ ហំននការលក់ ជាពែើ ម។
ការអ្រងេតកនុងរកុមនិ មួយៗ
Example2:
ស្ំណួរ៖ រតើ អ្ក
ន រកចំណូលបានប៉ាុ នាានកនុងមួ យប្ខ?
1.7- រកាហវបង្ហាញទិនន
ន ័យស្ថត
ិ ិ Graphical presentation of data
តនមលពិតរបស្់វ។
ព័ ត៌ានកនុងតារងររបកង់អាចបង្ហាញជារកាហវស្ស្របាន។ រកាហវស្ស្រ
អាចរតូវបានរគររបើរដើមបីបង្ហាញទិ នន
ន ័ យរង្ហវស្់ nominal និ ងordinalផងប្ដរ។
Bar Chart
រកាហវផិ ត
ល Pie Chart
លំហាត់
g> ePT
c> kmµsiT§rbs;m:asIunftkaEstvIedGU
q> cMnnY eRkDItEdl)ancuHbBa¢Ikgñú qmasfµeI nH
C> manRbePTexaGavmYycMnYnEdleK)anTijkñúgeBlfµI²enHenAhaglk;esovePA
Q>viFIsaRsþénkarTUrTat;ly u kñgú haglk;esovePA.EbgEckGefrnImYy² kñgú cMeNam
GefrTaMgenHCa GefrminEmnelx(Categorical) rWGefrelx (Numerical). RbsinebIGefrelx cUrkMNt;fa
etIGefrNaCaGefrdac; rWGefrCab;.
1.6 cMeBaHGefrécdnüxageRkam kMNt;nUvGefrnImYy² faetIGefrNaCaGefrminEmnelx rWGefrelx.
faetIGefrNaCaGefrdac; rWGefrCab;.
k> cMnYnTwkR)ak;EdlcMNaykñúgkaredIrTijexaGavenAkñgExmun
x> cMnYnGavrgarEdlman
K> hagEdleBjcitþ
X> eBlevlacMNaykñgú karedIrTijexaGavenAkñgú Exmun
g> kMLgu eBledIrTijexaGaveRcInCageKbMput ¬ éf¶Fmµta eBlyb; nigcugs)þah_ ¦
c> cMnYnKUrrbs;eRsamédrgarEdlman
q> RbePTyanþCMniHcMbgEdleRbIkgñú eBledIrTijexaGav
មេម ៀនទី២
2.1- ស្ថិតិស្កល
2.1.1- មធយមស្កល
For ungrouped numerical data, the population mean is the sum of all the population
values divided by the number of population values:
x
N
ប្ដល៖
បានពី រថយនតនិមួយៗ៖
56,000, 23,000, 42,000, and 73,000.
ចូ ររកអាយុមធយមរបស្់រថយនតនិមួយៗ។
ដំ រណាោះស្រសាយ
អាយុមធយមរបស្់រថយនតទាំងបួនរនោះគឺ ៖
x 56,000 23,000 42,000 73,000
48,500
N 4
The population variance for ungrouped numerical data is the arithmetic mean of the squared
deviations from the population mean.
( x ) 2
2
N
( x :) 2
ជាផលបូកគាលតចាាយការរពីមធយមស្កល is the sum of square
deviations from the population mean.
Example:
A class of statistics consists of 5 students with the scores:
30, 17, 23, 18, 15. What is the variance?
x 30 17 23 18 15 130
20.6
- The mean: N 5 5
(x ) 2
2
-The Variance: N
(30 20.6)2 (17 20.6)2 (23 20.6) 2 (18 20.6) 2 (15 20.6)2
5
145.2
2 29.04
5
2.1.3- គាលតស្តង់ោស្កល
The population standard deviation ( ) is the square root of the population variance.
( x ) 2
N
For previous example, the population standard deviation is 5.39 (square root of 29.04).
29.04 5.3888
ស្រងេបរូបមនតស្ត
ិ ថ ិស្កល
x
N
( x ) 2
N
2.2- ស្ថត
ិ ិស្ណា
ំ ក (Sample Statistics)
2.2.1- មធយមស្ថត
ិ ិស្ណា
ំ ក Sample Mean
The sample mean is the sum of all the sample values (observations) divided by the number of x
sample values (observations):
x
x
n
ដែល៖
x តាងមធយមស្ំណាក stands for the sample mean
n ជាចំ នួនននការអ្រងេត (x) កនុងស្ថិតិស្ំណាកមួ យ
Example: សថិតិស្ំណាកននអ្នករបតិ បតតិការ៥រូប បានទទួ លចំ នួនរបាក់ បប្នថមរបចំឆ្នំ កាលពី ឆ្នំមុន៖
$14,000, $15,000, $17,000, $16,000, and $15,000.
x x
n
14,000 15,000 17,000 16,000 15,000
x 15,400
5
s2
( x x )2
n 1
ដែល៖
s 2 : is the sample variance
x : is the value of each observation (value of a variable)
x : is the sample mean
n : is the sample size or number of observations in the sample
(x x) 2
: is the sum of square deviations from the sample mean
Example: ធានាារ ANZ Royal កំ ពុងសិកាពីចំនួនពពលដែលរតូវពរបើ កុងមួ
ន យនថង
ពដ្ឋយមា៉ សុីនែករបាក់ សយរបវតត
វ ិ សិត
ថ ពៅកនុងផារទំពនើបសូរ ិយា។ ខាងពរកាមជាចំ នួន ពពលដែលរតូវ
ពរបើកុងនថង
ន និមួយៗពដ្ឋយមា៉ សុីនពនះកាលពី ១០នថងមុន។
95 34 78 61 59 84 95 47 87 60
1. ចូ រកំណត់មធយមសថិតិសំណាកននចំ នួនពពលដែលរតូវពរបើ ពដ្ឋយមា៉ សុីនពនះ
x
x 95 34 78 61 59 84 95 47 87 60 700 70
n 10 10
s2
(x x) 2
n 1
(95 70) 2 (34 70) 2 (78 70) 2 (61 70) 2 (59 70) 2 (84 70) 2 (95 70) 2 (47 70) 2 (87 70) 2 (60 70) 2
10 1
625 1296 64 81 121 196 625 529 289 100 3926
436.22
9 9
2.2.3- គាលតស្តង់ោស្ថត
ិ ិស្ណា
ំ ក Sample Standard Deviation
s (x x) 2
n 1
ដែល៖
s : is the sample standard deviation
x : is the value of each observation (value of a variable)
x : is the sample mean
n : is the sample size or number of observations in the sample
(x x) 2
: is the sum of square deviations from the sample mean
Example: ទិនន
ន ័ យននរស្វកមាា៉ាស្ុីន ATM របស្់ធនាោរ ANZ Royal Bank រយើងាន៖
s (x x) 2
436.22 20.88
n 1
ស្រងេបរូបមនតស្ត
ិ ថ ិស្ណា
ំ ក
x
x
n
( x x ) 2
s
n 1
2.3- មធយមមួយចំនួនរទៀត Other Means
2.3.1-មធយមននទិនន
ន ័យរកុម The Mean of Grouped Data
មធយមសថិតិសំណាកននទិនន
ន យដែលពរៀបចំ កុងតារាងបំ
ន ដណងដចកពរបកង់ រតូវបាន
រណនាពដ្ឋយរូបមនតខាងពរកាម៖
x xf xf
f n
ដែល៖
x : ជាតនមលផិត
ច ននថ្ននក់ the mid-point of the class
f : ជាពរបកង់ ននថ្ននក់ the frequency of the class
Example:
សថិតិសំណាកនិ សេិតចំ នួន150នាក់ រតូវបានសមាភសពីការចំ ណាយទិ ញកាតទូ រសពទ របចំដែបាន
បង្វាញកនុងតារាងខាងពរកាម៖
Expense category($) Frequency
5 to 10 70
10 to 15 30
15 to 20 20
20 to 25 15
25 to 30 10
30 to 35 5
Total 150
ដំ រណាោះស្រសាយ
xw xf
2025
13.5
f 150
លទធផលអពងេតបង្វាញថ្ននិ សេិតចំណាយសរមាប់ការពៅទូរសពទពចញជាមធយមរឺ
$13.5 កនុងមួ យដែ។
2.3.2- មធយមធរណីារត Geometric Mean
មធយមធរណីមារតមានសារៈសំខាន់ កុងការរកតនមល
ន មធយមភាររយ, ផលពធៀប,
សនទសេន៍, និ ងអរតាកំ ពណើន។ វាក៍ មានការអនុ វតតទូលំទូល្លយកនុងអាជី វកមម, ពសែាកិច,ច
និ ងកសិកមមផងដែរ ពររះពយើងដតងដតរតូវបានសិកាកនុងការរកបដរមបរមួលភាររយ កនុងទិ នផ
ន លស្រសូវ,
ចំ ណូលររួសារ, ការលក់ផលិតផល ឬតួ ពលែពសែាកិចែ
ច ូ ចជា
the Gross National Product (GNP)។ មធយមធរណីមារតននសំណុំចំ នួន n ករណី រឺ ជាឫសទីn ននផល
រុ ណតនមល x និ មួយៗ ននចំនួនn ករណី។
The geometric mean (GM) ននសំណុំចំនួន n ករណីរតូវបានកំ ណត់ពដ្ឋយរូបមនត
ែូ ចខាងពរកាម៖
(1) GM n ( x )( x )( x )...( x )
1 2 3 n
2.5- រង្ហវស្់របាយទិនន
ន ័យ
ឧទហរណ៍៖ ពិ និតយសំណុំទិ នន
ន យពីរខាងពរកាម៖
Set I : 1, 2, 3, 4, 5, 6, 6, 7, 8, 9, 10, 11
Set II: 4, 5, 5, 5,6, 6, 6, 6, 7, 7, 7, 8
1 2 3 4 5 6 7 8 9 10 11 x
Skew = 0
4 5 6 7 8 x
- ភាពពរទតវ ិជជមាន: Mean and Median are to the right of the Mode.
Skew > 0
Mode<Median<Mean
- ភាពររទតអ្វ ិជាាន: Mean and Median are to the left of the Mode.
Skew < 0
Mean<Median<Mode
2.5.2- រង្ហវស្់ននរបាយទិនន
ន ័យ Measures of Disperson
Data set I 1 2 3 4 5 6 6 7 8 9 10 11
Data set II 4 5 5 5 6 6 6 6 7 7 7 8
ដំ រណាោះស្រសាយ
s 20.98
(ទំ ហំនផទដី) CV (100) (100) 35 percent
X 59.17
s 1,298.66
(ទិននផលស្រស្ូវ) CV (100) (100) 73 percent
X 1,774.75
បកស្រសាយ៖ ានទំ នាក់ ទំនងរបាយទិននន័យខាលំងជាងរៅកនុងបំ ប្ណងប្ចកននទិននផលស្រស្ូវ
ស្រងេបរូបមនតរបាយទិនន
ន ័យ
គាលតចាាយrange:
( x x ) 2
s2
n 1
គាលតស្តង់ោស្ថិតិស្ំណាក( s ):
( x x ) 2
s
n 1
រមគុ ណបប្រមបរមួល(CV)
s
CV (100)
x
លំហាត់
muiksickU 8.9
Cb:un 5.5
cRkPBGg;eKøs 3.3
GaLWmg: ; 2.1
)araMg 1.1
eRbsIlu 1
kUer:xageCIg 1
GIutalI 0.6
GURsþalI 0.5
eTAelIEpñkdwkCBa¢ÚnenAkñúgqña2M 006
Gakascr cMnYnt¥ÚjEt¥r(Complain)
British Airways 98
Air France 48
KLM 39
Alitalia 36
Aer Lingus 26
GWur:ubenAeBlGnaKt. xageRkamenHKwCasnÞsSn_éføGWur:Uey:nenAkñúgry³eBl30éf¶enA
cenøaH0eTA100°.
99.24 99.37 98.33 98.91 98.51 99.38 99.71 99.21 98.63 99.10
KNna , , nigemdüan.
2
2.5 xageRkamenHCatémøeRbgkateqARbcaMéf¶KitCaduløarkñúg1)aEr:lenAkñg
ú rdUvekþAqña2M 007³
17.5 17.6 18.3 17.9 17.4 16.9 17.1 18
17.2 18.3 17.8 17.1 18.3 17.5 17.4. 17.7
9-mifuna 760
8-mifuna 719
7-mifuna 715
4-mifuna 730
3-mifnu a 721
2.7 KNnamFümrbs;sMNMusakl³ 6 3 5 7 6
Compute the mean of the following population values: 6 3 5 7 6
2.8 KNnamFümrbs;sMNMusakl³ 7 5 7 3 7 4
Compute the mean of the following population values: 7 5 7 3 7 4
2.9 k> KNnamFümrbs;):an;KRmÚxageRkam³ 5, 9, 4, 10
x> bgðajfa X X 0
a. Compute the mean of the following sample values: 5, 9, 4, 10
b. Show that X X 0 .
x>bgðajfa X X 0
a. Compute the mean of the following sample values: 1.3, 7.0, 3.6, 4.1, 5.0.
b. Show that X X 0 .
cMeBaHlMhat;TI2>13-2>16
k> KNnamFümnBVnþ nig
x> bgðajfaetIvaKWCa Sample statistic rW Population parameter
2.13 Rkúmh‘un Midtown Ford )anCYlbuKÁlikEpñklk;cMnn Y 10nak;. cMnYnLanfµIEdl
)anlk;kal BIExmunedaybuKlÁ ikEpñklk;mandUcteTA³
15, 23, 4, 19, 18, 10, 10, 8, 28, 19
EpñkKNenyüenAkñúgRkúmh‘un)anrab;cMnnY énkarehATUrs½BÞkñúgmYyéf¶eTAkan;
2.14
The Median
2.21 etIGñkraykarN_tMélelxNaCam:Utrbs;sMNuMTinñn½yRbsinebImancMnnY
Tinnñ ½ysrub³
k> Tinnñ ½ycMnYn10 ehIytémønImYy²mindUcKñaeT
x> Tinnñ ½ycMnYn6 ehIyelxTaMgGs;dcU Kña
K> Tinnñ ½ycMnYn5 ehIymantémø 1/ 2/ 3/ 4 nig4?
cMeBaHlMhat; 2.18 – 2.19
k> emdüan nig
x> m:Ut
2.22 xageRkamenHKWCabrimaNeRbg\nnÞ³ERbRbÜlry³eBl7éf¶cugeRkayenA Jiffy Lube enAkac;RCúg Elm
2.23xageRkamenHKWCabERmbRmÜlPaKryénR)ak;cMeNjBIqña1M 997dl;1998sRmab;):an;KMrURkúm
h‘unsMNg;cMnnY 12enA Denver.
5 1 -10 -6 5 12 7 8 2 5 -1 11
Shopping MallenAevlaem:ag10RBwkenH.
12 8 17 6 11 14 8 17 10 8
cugeRkayenH. kMNt;mFümFrNImaRténGRtakMenInRbcaMqñaM;kúñgry³eBlenH.
9.4 13.8 11.7 11.9 14.7
Gñksþab;cMnnY 50nak;RtÚv)aneRCIserIsCa)a:n;KMrUnigmanbgðajenAkñúgbMEngEck
GayudUcxageRkam. BüakrN_mFümGayurbs;Gñksþab;.
2.32 bc©úb,nñmanburs nigRsþIRbmaN1/2lancUlrYmkñúgkatBVkicb © eRmIkgT½B
បកប្រប និងររៀបររៀងរោយៈ រលក យឹង ហាក់
Page 32
សាកលវ ិទាល័យបាត់ដំបង Business Statistics
83 64 84 76 84 54 75 59 70 61
63 80 84 73 68 52 65 90 52 77
95 36 78 61 59 84 95 47 87 60
2.37 cMnYnvIedGUkaemr:a (Temban Camcorders) plitkñúgry³eBl 8em:agkñúg1ev:n ehIyxageRkam
enHCacMnYnev:n 50EdleRCIserIsedayécdnü. kMnt;mFüméncMnYnvIedGUkaemr:aEdl)anplitkñúgev:n
¬ry³eBl8em:ag¦nImYy².
348 371 360 369 376 397 368 361 374
410 374 377 335 356 322 344 399 362
384 365 380 349 358 343 432 376 347
385 399 400 359 329 370 398 352 396
366 392 375 379 389 390 386 341 351
354 395 338 390 333
RtÜtBinitüBItémøeRbg\nÞn³muneBl
2.38 American Automobile Association
មេម ៀនទី៣
របូបាបននបាំណែងណែកន័ ម៉ាល់
(The Normal Probability Distribution)
3.1- លកេណៈននបំប្ណងប្ចកន័រា៉ា ល់
• ប្ខសររកាងានកំ ពូលរៅចំចំណុចកណា
ត លននបំ ប្ណងប្ចក។
(Mean)រស្ាើោន(ឬឆលុោះោន)។
• បំ ប្ណងប្ចករនោះឆលុោះោនរធៀបនឹងមធយម(Mean)។
• ប្ខសរកាងខិ តជិតអ្័កសរដកប្តមិនកាត់។
3.2- របូបាបននបំប្ណងប្ចកស្តង់ោន័រា៉ា ល់
x
z
Example:
នឹ ង$150 និ ងគាលតស្តង់ោ$41។
ដំរណាោះស្រសាយ
x 70 150
- For x=$70 Z 1.95
41
A Z-value = 1.22 indicates that the salary of $200 is 1.22 standard deviation above the mean of
$150, and a Z-value = –1.95 indicates that the salary of $ 70 is 1.95 standard deviation below
Example1:
ប៉ាុ នាាន?
ប៉ាុ នាាន?
ប៉ាុ នាាន?
អ្តិថិជនរកុមរនោះរស្ាើនឹងប៉ាុនាាន?
ដំរណាោះស្រសាយ
Example 2:
សាស្រសាតចរយMannបានកំណត់ ថ្នមធយមភាគពិនុបញ្
ទ ា ប់ កុងមុ
ន ខវ ិជាាស្ថិតិរបស្់ោត់ ជាបំ ប្ណង ប្ចក
ភាគពិនុទាបបំ
ទ ផុតប្ដលនិស្សិតអាចទទួ លបាននិ រទទស្Aរស្ាើនឹងប៉ាុ នាាន?
ដំរណាោះស្រសាយ
បំ ប្ណងប្ចកស្ំណាកននមធយមស្ំណាកជាបំ ប្ណងប្ចករបូបាបននមធយមស្ំណាកទាំងអ្ស្់ននទំ ហំ
មធយមស្ំណាកនិ មួយៗ។
Example:
ដំរណាោះស្រសាយ
ចំនួនស្ំណាកប្ដអាចររជើស្ររ ើស្រោយនចដនយ
បំប្ណងប្ចកររបកង់ននមធយមស្ំណាក៖
មធយមននមធយមស្ថត
ិ ិស្ណា
ំ ក និងមធយមស្ថត
ិ ិស្កល៖
3.3.1- រទឹស្ល
ីត ម
ី ីតកណា
ត ល )The Central Limit Theorem(
• If all samples of a particular size are selected from any population, the sampling
• The central limit theorem is that when we take more samples, we will find the distribution
3.3.2- កំហស្
ុ គំរនូ នមធយមស្ំណាក (Standard Error of Sample Mean)
3.4- ស្ង់ចរនាលោះរជឿជាក់
3.4.1- ស្ង់ចរនាលោះរជឿជាក់ស្រាប់មធយមស្កល µ
confidence interval.
s
• In general, a confidence interval ( CI )for the mean is computed by: CI : X Z
n
Example:
រលករពឹទធបុរស្ននមហាវ ិទាល័យរណិជាកមាចង់បា៉ាន់ សាានចំនួនរា៉ា ងស្ិកាជាមធយមរបចំស្បាតហ៍
ចំ ណុចវយតនមលគឺ24រា៉ា ង(មធយមស្ថិតិស្ំណាក)។
ក. ចូ ររកលរមែៀងស្តង់ោននស្ថិតិស្ំណាករនោះ។
ស្កល)របស្់និស្សិតរៅកនុងមហាវ ិទាល័យរបស្់ោត់ ។
ដំ រណាោះស្រសាយ
3.4.2- ស្ង់ចរនាលោះរជឿជាក់ស្រាប់ផលរធៀបស្ាារតស្កល
Example:
Matt Williams ជាអ្នកររៀបចំ គររាងហិរញ្ា វតថុបានរធវើការវ ិភាគគររាងចូ លនិ វតតន៍ របស្់អ្ក
ន រធវើការវ ័យ
ផលរធៀបស្ាារតរបស្់អ្នករធវើការប្ដលានគររាងលក់ផទោះ និ ងបតូររៅរស្់រៅArizona។
ដំ រណាោះស្រសាយ
លំហាត់
c) etImanb:n
u µanéf¶eRkABI 300éf¶ énqñaEM dlkarlk;nwgTabCag 330Ékta?
3>
4> tMrUvkarcMeBaHplitplmYyKWCabMENgEckFmµta EdlmanmFüménkarlk;RbcaMéf¶ 300Ékta ehIy
nwg Standard deviation 30Ékta.
a) KNnaPaKryéncMnYnéf¶ Edlkarlk;nwgx<s;Cag 350Ékta?
c) etImanb:n
u µanéf¶ eRkABI 300éf¶énqñaM Edlkarlk;nwgTabCag 330Ékta?
d) KNnakMriténkarlk; EdlkMNt;farvl;CageK 20% énqñaM?
មេម ៀនទី៤
មតស្តស្េមតិកេម
(Tests of Hypothesis)
4.1- និយមន័យ
4.1.1- អ្វីជាស្មាតិកមា?
Example:
-The service quality is the main factor leading to the customer loyalty.
- Microfinance is the first priority in contributing to financial help for the poor to improve the
income generation.
4.1.2- រតស្តស្មាតិកមា
គររាងរតស្តស្មាតិកមា
2. កំ ណត់តំបន់ របាោះបង់H0។
3. គណនារតស្តស្ិត
ថ ិ
4. រធវើការស្ររមចចិតតរលើH0 និងស្ននិោឋនរលើលទធផលរតស្ត។
4.1.3- ស្មាតិកមាស្ន
ូ យ
តនមលពិតរបស្់ស្ិត
ថ ិ ស្កលមួ យ ឬពី រ រនាោះកនុងស្មាតិកមាH0ចំបាច់ រតូវររបើ ស្ញ្ញា ។
4.1.4- ស្មាតិកមាជររមើស្
រតស្តដូចខាងររកាម៖
ក.រតើកុងរខតត
ន ប.បឆ្នំរនោះានទិននផលស្រស្ូវជាមធយមតិ ចជាង3.5T/haឬរទ?
ខ.រតើកុងរខតត
ន ប.បឆ្នំរនោះានទិននផលស្រស្ូវជាមធយមររចើនជាង3.5T/haឬរទ?
គ.រតើកុងរខតត
ន ប.បឆ្នំរនោះានទិននផលស្រស្ូវជាមធយមមិ នរស្ាើនឹង3.5T/haឬរទ?
ដំរណាោះស្រសាយ
ក.រតើកុងរខតត
ន ប.បឆ្នំរនោះានទិននផលស្រស្ូវជាមធយមតិ ចជាង3.5T/haឬរទ?
ស្មាតិកមា៖ ឆ្នំរនោះទិននផលស្រស្ូវជាមធយមតិចជាង3.5T/ha។រគបាន
H1:ានមធយមតិចជាង3.5T/ha ( 3.5 )
H1:ានមធយមររចើនជាង3.5T/ha ( 3.5 )
របស្់ស្ិត
ថ ិ ស្កលរនាោះ។ ករមិតរធវើរតស្តប្ចកជាពីរករមិ តគឺ កំហុស្របរភទI និងកំ ហុស្របរភទII តារង
ខាងររកាមបង្ហាញពី លទធផលរតស្តស្មាតិកមាប្ដលានលកេណៈស្ថិតិ៖
Event
State of nature
Decision H0 True H0 False
Researcher
Type II Error
Do not reject H0 Correct
(b)
Type I Error
Reject H0 Correct
(a)
កំហស្
ុ របរភទI(a): របាោះបង់ ស្មាតិកមាស្ូនយរពលស្មាតិកមាស្ូនយវជាតនមលពិតននបា៉ា រ៉ា ប្ម៉ា រតសាកល
រនាោះ។
កំហស្
ុ របរភទII( b ): មិ នរបាោះបង់ ស្មាតិកមាស្ូនយរពលស្មាតិកមាស្ូនយវមិនប្មនជាតនមលពិតននបា៉ា រ៉ា
ប្ម៉ា រតសាកលរនាោះ។
4.1.6- រតស្តស្ត
ិថ ិ
រតស្តស្ិត
ថ ិ ជាតនមលរលខប្ដលគណនារចញពី ស្ិត
ថ ិ ស្ំណាកររបើវកនុងការស្ររមចចិតតរបាោះបង់ ស្មាតិកមា
4.1.8- តនមលរបូបាប
តនមលរបូបាបជារបូបាបននតនមលរលខប្ដលគណនាបានពី រតស្តស្ិត
ថ ិ ររបើវកនុងការស្ររមចចិតត និ ង
4.2.1- ផលរធៀបស្ាារតសាកលមួយ
ស្ាគល់រោយ p ប្ដល៖
p
z
(1 )
n
: population proportion
p : sample proportion
n : number of observations
Example:
កាលពី មុនានអ្នកចូ លរួមបង់វ ិភាគទានគឺ15%ននអ្នកប្ដលបានរផញើmailទាំអ្ស្់។លិខិតនរ ៉ាអ្ង្ហគស្់របាក់
ស្ននិោឋនថ្នលិខិតថាីរនោះានរបស្ិទិ ភា
ឋ ពខុ ស្ពី 15%បានឬរទ? ចូ រររបើករមិតរធវើ រតស្ a=0.05។
ដំ រណាោះស្រសាយ
4.2.1- រតស្តផលរធៀបស្ាារតសាកលពីរ
រតស្តស្ិត
ថ ិ ស្រាប់ផលរធៀបសាកលពី រគឺ ៖
p1 p2
z
pc (1 pc ) pc (1 pc )
n1 n2
ដំ រណាោះស្រសាយ
លំហាត់
1.The following hypotheses are given.
H o: 0.70
H 1: 0.70
A sample of 120 observations revealed that p =0.80
a. Using the 0.01 level of significance, state the decision rule.
b. Compute the value of the test statistic.
c. What is your decision regarding the null hypothesis?
2.The following hypotheses are given.
H o: .40
H 1: .40
A sample of 120 observations revealed that p =0.30. At the 0.05 significance level, can the null
hypothesis be rejected?
age of 6, and 46 out of 55 children age 6 and over recognized Joe Camel, the brand symbol of
Camel Cigarette (Journal of Advertising, Winter 1995).
a. Use a 95% confidence interval to estimate the proportion of all children that recognize
Joe Camel. Interpret the interval.
b. Do the data indicate that recognition of Joe Camel increases with age? Test using
a 0.05 .
6. Independent random samples were selected from two binominal populations. Size and
number of observed successes for each sample are shown in the table below.
Sample 1 Sample 2
n1 = 200 n2 = 200
x1 = 110 x2 = 130
មេម ៀនទី៥
មតស្តស្េមតិកេមអាំពីតនេលេធយេ
(t-Test)
ទំ ហំស្ំណាកធំ ជាង។
5.2- រតស្តស្មាតិកមាអ្ំពីតនមលមធយមសាកលមួយ
រតស្តស្ិត
ថ ិ ស្រាប់តនមលមធយមសាកលមួ យ៖
X
t
s/ n
Example:
ប្ផនកឯកសាររៅរកុមហុនធនារ៉ា ប់ រង McFarland រយការណ៍ថ្នការចំណាយមធយមរលើឯកសាររបើក
រធវើរតស្ត 0.05។
ដំ រណាោះស្រសាយ
5.3- រតស្តស្មាតិកមាអ្ំពីតនមលមធយមសាកលពីរឯករជយ
រតស្តស្ិត
ថ ិ ស្រាប់តនមលមធយមស្ំណាកពី រ៖
x1 x2
t 1 1
S 2p
n1 n2
Example:
ការស្ិកាថាីៗមួ យរបស្់EPAបានររបៀបរធៀបចំ ង្ហយផលូវជាមធយមកនុងសាំងមួយលីរត(គិតជាkm/l) របស្់
រថយនតកុងស្រស្ុ
ន ក និងរថយនតនាំចូល។ស្ំណាករថយនតកុងស្រស្ុ
ន កចំនួន15ររគឿងបានបង្ហាញថ្នចំ ង្ហយ
ដំ រណាោះស្រសាយ
រតស្តស្ិត
ថ ិ ស្រាប់តនមលមធយមស្ំណាកគូ រ៖
d
t
Sd / n
Example:
ភានក់ ង្ហរានក់ចង់ រធវើរតស្តររបៀបរធៀបតនមលស្រាប់ ការជួ លរថយនតរបចំនថាពីរកុមហុនពី រ Hertz និ ង
ររកាម៖
ខុ ស្ោនបានឬរទ? ចូ រររបើករមិតរតស្ត0.05។
ដំ រណាោះស្រសាយ
មុ នរពលរធវើរតស្តស្មាតិកមារតូវប្តគណនា
Company
City Hertz (x1) Avis (x2) d = (x1 - x2) (d d ) 2
Atlanta 42 40 2 1
Chicago 57 52 5 4
Cleveland 45 43 2 1
Denver 48 42 6 9
Honolulu 39 41 -2 25
Kansas City 48 45 3 0
Miami 41 39 2 1
Seattle 50 44 6 9
Total = 24 50
បកប្រប និងររៀបររៀងរោយៈ រលក យឹង ហាក់
Page 61
សាកលវ ិទាល័យបាត់ដំបង Business Statistics
d
d 24
3, Sd
(d d ) 2
50
50
2.67
n 8 n 1 8 1 7
លំហាត់
1. The following hypothesises are given:
H 0 : 10
H1 : 10
For a random sample of 10 observations the sample mean was 12 and the sample standard deviation
2. Using the 0.05 significance level:
a. State the decision rule
b. Compute the value of the test statistic
c. What is your decision regarding the null hypothesis?
មេម ៀនទី៦
កា វិភាគវ៉ា យង់(ANOVA)
6.1- លកេណៈននបំប្ណងប្ចក F
• Each member of the family is determined by two parameters: - the numerator degrees of
6.2- រតស្តតនមលវ៉ារយង់រស្ាោ
ើ ន ននសាកលពីរ
គររាងរតស្តវ៉ារយង់រស្ាោ
ើ ន៖
H 0 : 12 22
Step 1. Hypothesis:
H1 : 12 22
Step 2. Decision rule
- Significant level a
s12 and s22 are the sample variance for the two samples.
មធយមរលើស្ិត
ថ ិ ស្ំណាកននភាគហុនSoftwareចំ នួន10គឺ12% ជាមួ យគាលតស្តង់ោស្ថិតិស្ំណាក3.9%
។អ្រតាននផលចំ រណញមធយមរលើស្ិត
ថ ិ ស្ំណាកននភាគហុនUtilities ចំ នួន8គឺ
ដំ រណាោះស្រសាយ
The F distribution is also used for testing the equality of more than two means using a
ស្កលkរតូវប្ត Normal
ស្ំណាកនិ មួយៗឯករជយ
គររាងរតស្តវ៉ារយង់មួយកតាត៖
• Decision rule: For a given significance level a , reject the null hypothesis if F
freedom.
EXAMPLE:
អាចស្ននិោធនថ្នានការខុស្ោ
ន ន កនុងចំ នួន មធយមនននំបុ័ងសាច់ ប្ដលបានលក់កុងមួ
ន យនថារៅហាងទាំង
បី បានឬរទ?
Treatment
X1 X2 X3
13 10 18
12 12 16
14 13 17
12 11 17
13 11 18
ដំ រណាោះស្រសាយ
• When we reject the null hypothesis that the means are equal, we may want to know
• If the confident interval includes zero, there is not a difference between the treatment
means.
From EXAMPLE develop a 95% confidence interval for the difference in the mean
number of meat loaf dinners sold in Point Place (pop #3) and Sylvania (pop #1). Can
SUMMARY STATISTICS
Groups n Sum Meam
Sylvania 5 64 12.8
Perrys burg 5 57 11.4
Point place 5 86 17.2
1 1 1 1
x3 x1 t MSE 17.2 12.8 2.179 0.9
n1 n2 5 5
4.4 1.31 3.09 up to 5.71
The 95% confident interval ranges from 3.09 up to 5.71. Both end points are positive; hence, we
can conclude that these treatment means differ significantly. That is the mean number of meat
From EXAMPLE develop a 95% confidence interval for the difference in the mean
number of meat loaf dinners sold in Point Place (pop #3) and Sylvania (pop #1). Can
SUMMARY STATISTICS
Groups n Sum Meam
Sylvania 5 64 12.8
Perrys burg 5 57 11.4
Point place 5 86 17.2
1 1 1 1
x3 x2 t MSE 17.2 11.4 2.179 0.9
n1 n2 5 5
5.8 1.31 4.49 up to 7.11
• The 95% confident interval ranges from 3.09 up to 5.71. Both end points are positive;
hence, we can conclude that these treatment means differ significantly. That is the mean
number of meat loaf dinners sold is a difference between the two restaurants.
លំហាត់
6.1- Use Table F-critical values to find each of the following F-value:
6.2- For each of the following cases, identify the rejection region that should be used to test
a. a .10
b. a .05
c. a .025
d. a .01
6.3- For each of the following cases, identify the rejection region that should be use to test
a. a .20
b. a .10
c. a .05
d. a .02
H 0: 1 2
2 2
6.4- Specify the appropriate rejection region for testing in each of the following
situations:
H 1 2 ;a .01, n1 31, n2 41
2 2
d. a
H a 1 2 ;a .05, n1 7, n2 16
2 2
e.
6.5- Independent random samples were selected from each of two normally distributed
populations, n1=12 from population 1 and n2=27 from population 2. The means and variances
Sample 1 Sample 2
n1= 12 n2= 27
x1 31.7 x 2 37.4
s1 2 3.87 s 2 2 8.75
6.6- Independent random samples were selected from each of two normally distributed
populations,
n1=6 from population 1 and n2=5 from population 2. The data are shown in the next table.
Sample 1 Sample 2
3.1 2.3
4.4 1.4
1.2 3.7
1.7 8.9
.7 5.5
3.4
6.7- Two independent ransom samples were selected from normally distributions with means
and variance (µ1, 21) and ( µ2, , 22), respectively. The sample size, means, and variance are
Sample 1 Sample 2
n1 = 20 n2 = 15
x1 123 x1 123
s21 = 31.3 s22 = 120.1
a. Test H0: 21= 22 against Ha: 21 ≠22. Use α = 0.05.
b. Would you be willing to use a t- test the null hypothesis H0: µ1 = µ2 against the alternative
H 0 : 1 2
2 2
H1 : 1 2
2 2
A random sample of eight observations from the first sample resulted in a standard deviation of
10. A random sample of six observations from the second sample gave a standard deviation of
7. At the 0.02 significance level, is there a difference in the variation of the two populations?
H0 : 1 2
2 2
H1 : 1 2
2 2
A random sample of five observations from the first sample resulted in a standard deviation of
12. A random sample of seven observations from the second sample showed a standard
deviation of 7. At the .01 significance level, is there more variation in the first population?
6.10- The following is dample information. Test the hypothesis that the treatment means are
8 3 3
6 2 4
10 4 5
9 3 4
6.11- The following is sample information. Test the hypothesis that the treatment means are
Treatment
X1 X2 X3
9 13 10
7 20 9
11 14 15
9 13 14
12 15
10
6.12- The manager of a computer software company is studying the number of hours top
executives spend at their computer terminals by type of induatry. A sample of five executives
from each of three industries is obtained. At the .05 significance level, can the manager
conclude there is a difference in the mean number of hours spent at a terminal per week by
industry?
12 8 10
10 8 8
10 6 6
12 8 8
10 10 10
6.13- The following sample information is given. Test the hypothesis that the treatment means
Treatment Treatment
1 Treatment 2 3
8 3 3
11 2 4
10 1 5
3 4
f. If
H 0 is rejected, can we conclude that tretment 1 and treatment 2 differ? Use the 95
6.14 The following is sample information. Test the hypothesis that the treatment means are
Treatment Treatment
1 Treatment 2 3
3 9 6
2 6 3
5 5 5
1 8 5
3 5 5
1 4 4
7 1
6 5
មេម ៀនទី៧
វិធីអបា៉ារ៉ាណេ៉ារត
(Chi-Square Test)
7.1-លកេណៈននបំប្ណងប្ចក Chi-Square
- It is positively skewed
- It is non-negative
Example:
ទិ នន
ន ័ យខាងររកាមជាចំនួនអ្វតតានរតូវបានរបមូ លពី ររងចរកឧស្ាហ៍កមា
មួ យ។ រតើរគអាចស្ននិោឋនថ្នានការខុ ស្ោនរវងអ្រតាអ្វតតានតាមនថារធវើការរបស្់កមាករកនុងររងចរក
ដំរណាោះស្រសាយ
Example:
ថ ិ ស្ំណាកយុវវ ័យ
63.9% ,រមា៉ា យ 7.7%, ប្លងលោះ 6.9% និ ងរៅលីវ 21.5%។ របើ ស្ិត
ដំ រណាោះស្រសាយ
Example:
ចូ រររបើករមិ តរធវើរតស្ត0.05។
ដំរណាោះស្រសាយ
លំហាត់
7.1- In a particular chi-square goodness-of-fit test there are four categories and 200 observations.
Use the .05 significance level.
a. How many degree of freedom are there?
b. What is the critical value of chi-square?
7.2- In a particular chi-square goodness-of-fit test there are six categories and 500 obsevations.
Use the .01 significance level.
7.6- The director of human resources collected the following data on absenteeism by day of the
week. At the .05 significance level, can she conclude that there is a difference in the absence rate by
day of the week?
Day Frequency
Monday 124
Tuesday 74
Wednesday 104
Thursday 98
Friday 120
7.7- A group of department store buyers viewed a new line of dresses and their opinions of them.
The results were:
Number of Number of
Opinion Buyers Opinion Buyers
Outstanding 47 Good 39
Excellent 45 Fair 35
Very good 40 Undesirable 34
Because the largest number (47) indicated the new line is outstanding, the head designer thinks that
this is a mandate to go into mass production of the dresses. The head sweeper ( who somehow
became involved in this) believes that there is not a clear mandate and claims that the opinions are
evenly distributed among the six categories. He further states that the slight differences among the
various counts are probably due to chance. Test the null hypothesis that there is no significance
among the opinions of the buyers. Test at the .01 level of risk. Follow a formal approach; that is,
state the null hypothesis, the alternate hypothesis, and so on.
H 0: Forty percent of the observaions are in category A, 40 percent are in B, and 20 percent are in C
H 1: The observations are not as described in H 0 .
We took a sample of 60, with the following results.
Category f0
A 30
B 20
C 10
7.9- The chief of security of a large shopping mall was directed to study the problem of missing
goods. He selected a sample of 100 boxes that had been tampered with and ascertained that for 60
of boxes, the missing pants, shoes, and so on were attriuted to shoplifting. For 30 other boxes
employees had stolen the goods, and for remaining 10 boxes he blamed poor inventory control.
In his report to the mall management, can he say that shoplifting is twice as likely to be the
cause of the loss as compared with employee theft or poor inventory control? Use the .02 level.
7.10- The bank credit card department of Carolina Bank knows from long experience that 5 percent
of the card holders have had some high school, 15 percent have completed high school, 25 percent
have had some college, and 55 percent have completed college. Of the 500 card holders whose cards
have been called in for failure to pay their charges this month, 50 had some high school, 100 had
completed high school, 190 had some college, and 160 had completed college. Can we conclude that
the distribution of card holders who do not pay their charges is different from all others? Use the .01
significance level.
7.11- For many years TV executives used the guideline that 30 percent of the audience were
watching each of the prime-time networks and 10 percent cable stations on weekday night. A
random sample of 500 viewers in the Tampa-St. Petersburge, Florida, area last Monday night
showed that 165 homes were tuned in to the ABC affiliate, 140 to the CBS affiliate, 125 to the NBC
affiliate, and the remainder were viewing a cable station. At the .05 significance level, can we
conclude that the guideline is still reasonable?
7.12- The marketing director for a metropolitan daily newspaper is studying the relationship between
the type of community the reader lives in and the portion of the paper he or she reads first. For a
sample of readers the following information was collected.
National News Sports Comics
Urban 170 124 90
Rural 120 112 100
Farm 130 90 88
At the .05 significance level, can we conclude there is a relationship between the type of community
where the person resides and the portion of the paper read first?
7.13- Four brands of light bults are being considered for use in a large manufacturing plant. The
director of purchasing asked for samples of 100 from each manufacturer. The numbers of
acceptable and unacceptable bulbs from each manufacturer are shown below. At the .05 significance
level, is there a difference in the quality of the bulbs?
Manufacturer
A B C D
Unacceptable 12 8 5 11
Acceptable 88 92 95 89
Total 100 100 100 100
7.14- The Quality Control Department at Food Town, Inc., a grocery chain in upstate New York,
conducts a monthly check on the comparison of scanned prices to posted prices. The chart below
sumarizes the results of a sample of 500 items last month. Company management would like to
know whether threre is any relationship between error rates on regular priced items and specialy
priced items. Use the .01 significance level.
Regular Price Advertised Special Price
Undercharge 20 10
Overcharge 15 30
Correct Price 200 225
7.15- The use of cellular phones in automobiles has increased dramatically in the last few years. Of
concern to traffic experts, as well as manufactures of cellular phones, is the effect on accident rates.
Is smoeone who is using a cellular phone more likely to be involed in a traffic accident? What is your
conclusion from the following sample information? Use the .05 significance level.
Had Accident Did Not Have an
in the Last Year Accident in the Last Year
Cellular phone in use 25 300
Cellular phone not in use 50 400
7.16- Vehicles heading west on Front Street may turn right, left, or go straight ahead at Elm Street.
The city traffic engineer believes that half of the vehicles will continue straight though the
intersection. Of the remaining half, equal proportions will turn right and left. Two hundred vehicles
were observed, with the following results. Use the .10 significance level. Can we conclude that the
traffic enginner is correct?
Straight Right Turn Left Turn
Frequency 112 48 40
7.17- The publisher of a sports magazine plans to offer new subscribers of three gifts: a sweat-shirt
with the logo of their favorite team, a coffee cup with the logo of their favorite team, or a pair of
earnings also with the logo of their favorite team. In a sample of 500 new subscribers, the number
selecting each gift id reported below. At the .05 significance level, is there a preference for the gifts
or should we conclude that the gifts are equally well liked?
Gift Frequency
Sweatshirt 183
Coffee cup 175
Earrings 142
7.18- In a particular television market there are three commercial television stations, each with its
own evening news program from 6:00 to 6:30 P.M. According to a report in this morning’s local
newspaper, a random sample of 150 viewers last night revealed 53 watched the news on WNAE (
channel 5), 64 watched on WRRN ( channel 11), and 33 on WSPD ( channel 13). At the .05
significance level, is there a difference in the proportion of viewers watching the three channels?
7.19- There are four entrances to the Government Center Building in downtown Philadelphia. The
building maintenance supervisor would like to know if the entrances are equally utilized. To
investigate, 400 people were observed entering the building. The number using each entrance is
reported below. At the .01 significance level, is there a difference in the use of the four entrances?
Entrance Frequency
Main Street 140
Broad Street 120
Cherry Street 90
Walnut Street 50
Total 400
7.20- The owner of a mail-order category would like to compare her sales with the geographic
distribution of the population. According to the United State Bureau of the Census, 21 percent of
the population lives in the Northeast, 24 percent in the Midwest, 35 percent in the South, selected
from those shipped last month.
Region Frequency
Northeast 68
Midwest 104
South 155
West 73
Total 400
7.21- The owner of a mail order catalog would like to compare her sales with the geographic
distribution of the population. According to the United States Bureau of the Census, 21 percent of
the population lives in the Northeast, 24 percent in the Midwest, 35 percent in the South, and 20
percent in the West. Listed below is a breakdown of a sample of 400 orders randomly selected from
those shiped last month.
Region Frequency
Northeast 68
Midwest 104
South 155
West 73
Total 400
At the .01 significant level, does the distribution of the orders reflect the population?
7.22- Southern Medical Services performed 3,459 abortions last year. The number of previous
abortions reported by the same patients were:
Abortions Patients
0 2441
1 784
2 184
3 50
Are these data consistent with the national figures, which indicate that 67 percent of the patients
never had abortion before, 27 percent had one previous abortion, 5 percent had two previous
abortions, and 1 percent had 3 or more previous abortions? Use the .05 significance level.
7.23- Did you ever purchase a bag of M&M’s peanut candies and wonder about the distribution of
the colors? The manufacturer reports that 30 percent are brown, 30 percent yellow, and 10 percent
each are blue, orange, red, and green. A 6.0 ounce bag of M&M’s peanut candies purchased at the
BP Station at the Junction of l-75 and U.S.20 in Perrysburg, Ohio, had 17 brown, 20 yellow, 13 blue,
9 orange, 6 green, and 7 red, for a total of 72 candies. At the .05 significance level, can we conclude
the actual breakdown by color of the candies agrees with the expected distribution?
7.24- The American Association of Television Broadcasters recently reported the mean number of
television sets per household in the United States is 2.30 sets and that the standard deviation is 1.474
sets. A sample of 100 homes in Boise, Idaho, revealed the following number of sets per household:
Number of Number of Number of Number of
televsion sets households televsion sets households
0 7 3 18
1 27 4 10
2 28 5 or more 10
At the .05 significance level, is it reasonable to conclude that the number of television sets per
household follows the normal distribution?
7.25- A recent study by a large retailer designed to determine whether there was a relationship
between the importance a store manger place on advertising and the size of the store revealed the
following sample information:
7.26- Two hundred men selected at random from various levels of management were interviewed
regarding their concern about environmental issues. The response of each person was tallied into
one of three categories: no concern, some concern, and great concern. The results were:
មេម ៀនទី៨
I. esckþIepþIm
Time series KWCakarRbmUlpþMú (a collection)énTinñn½yRbvtþiEdl)anekIteLIgkñúgry³eBl knøgmk.
25
20
15
10
Trend
5
0
2000 2001 2002 2003 2004 2005
Year
Cyclical
Variation
Seasonal
Variation
សាកលវ ិទាល័យបាត់ដំបង Business Statistics
14
12
Sales (in thousand$)
10
8
6
4
2
0
2000 2001 2002 2003 2004 2005 2006
Year
25
15
10
0
Q1
Q2
Q3
Q4
Q1
Q2
Q3
Q4
Q1
Q2
Q3
Q4
Q1
Q2
Q3
Q4
Q1
Q2
Q3
Q4
Q1
Q2
Q3
Q4
Q1
Q2
Q3
Q4
2000 2001 2002 2003 2004 2005 2006
Year
100000
90000
80000
70000
60000
Sales
50000
40000
30000
20000
10000
0
1999 2000 2001 2002 2003 2004 2005 2006
Year
mFümnBnþrkM il 4 Fatu (four moving average), mFümnBnþrMkil 5 Fatu (five moving average),
k
xi
mFümnBnþrkM il 7 Fatu (seven moving average). K MA i 1
k
400 Actual Sales
Deseasonalized Sales
Seasonal adjusted series for Sales from SEASON, MOD_1, MUL EQU 7
350
300
250
200
150
100
50
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28
Time
eyIgBinittüeXIjfaTinñn½enHmansPaBeLIgcuHeTAtamxYbRbcaMs)aþh_nimYy² ehIymanninñakar
bnþBsI )aþh_mYyeTAs)aþh_mYeTot. dUcenHedIm,IeFVIkarBüakrN_brimaNlk;enAs)aþh_bnþbnÞab; eTot
eyIgRtUvBRgabTinñn½yenHedayeRbIR)as; Seasonal Index (SI) nig Deseasonalized Seasonal Sales.
bnÞab;mkeyIgnwgeRbIR)as; Least Squares Method edIm,IkMNt;smIkar Regression (Linear Trend
Equation) ecjTinñn½y Deseasonalized Seasonal Sales Data sMrab;eFVI BüakrN_nUvbrimaNlk;.
tarag 18.2 ³ karKNna Seasonal Index (SI) b¤ Typical Index taméf¶kgñú s)aþh_
300
250
200
150
100
50
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28
Time
a ³ KWCacMNucRbsBVénbnÞat;smIkarnwg y kñúgkrNI t = 0 .
n ( DS * t ) ( DS )( ) 28(68373.03) (4625.72629(406)
b 0.7115485
2
n t ( t )
2
28(7714) (406)2
DS t 4625.72629 406
a b 0.7115485. 154.8870572
n n 28 28
ANOVAb
Sum of
Model Squares df Mean Square F Sig.
1 Regression 925.015 1 925.015 18.475 .000a
Residual 1301.752 26 50.067
Total 2226.767 27
a. Predictors: (Constant), Time
b. Dependent Variable: Deseasonalized Sales
Coeffi ci entsa
Unstandardized St andardized
Coef f icients Coef f icients
Model B St d. Error Beta t Sig.
1 (Constant) 154.8870 2.748 56.370 .000
Time .711549 .166 .645 4.298 .000
a. Dependent Variable: Deseasonalized Sales
We have just discussed the smoothing technique by using moving average to smooth the time
series data and to calculate the seasonal indexes. In this section we use the regression equation to
fit the trend in time series, and then compute seasonal indexes.
4 4
Correction Factor : CF 0.9969
Total means of ratio 4.0124
or yˆ b0e Lnb1 (t )
120
R 2 = 0.9999
100
80
60
40
20
0
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Time (t )
edIm,IkN
M t; smIkareGayTinñn½yenH ehIybMBak;LÚkarItenEBrelITinñn½ytamqñaM nimYy²
rYcKNnarkemKuNénsmIkarLÚkarItenH.
3.6.1 Least Square Method
2
Year Time(t) Import(y) Ln(y) (t)Ln(y) t
1995 1 3.00 1.098612 1.098612 1
1996 2 4.20 1.435085 2.870169 4
1997 3 5.70 1.740466 5.221399 9
1998 4 8.30 2.116256 8.465022 16
1999 5 11.50 2.442347 12.21174 25
2000 6 16.00 2.772589 16.63553 36
2001 7 22.40 3.109061 21.76343 49
2002 8 31.00 3.433987 27.4719 64
2003 9 44.60 3.797734 34.1796 81
2004 10 60.10 4.09601 40.9601 100
2005 11 84.30 4.434382 48.7782 121
2006 12 118.60 4.775756 57.30908 144
2007 13 163.90 5.099256 66.29033 169
Total 91 40.35154 343.2551 819
n (t )( Lny) t Lny
Least Square Method : b
2
n t t
2 nig a Lnn ( y) b nt
Lnb
n (t )( Lny) t Lny
13(343.2551) (91)( 40.35154) 790.3262
0.334035
1 2
n t t 2
13(819) (91) 2 2366
Ln( y ) t 40.35154 91
Lnb b 0.334035 0.765722
0 n n 13 13
Lnb1 0.334035
Lnb1 ( t )
yˆ b0e 2.1505e 0.334035( t )
200
Observed
Export (Y) in $million
Exponential
150
100
50
0
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Sequence (Time)
លំហាត់
1. The numbers of bank failure for the years 1994 through 1998 are given below.
Determine the ordinary least squares (OLS) equation and estimate the number of failure
in 2001.
1994 1 79
1995 2 120
1996 3 138
1997 4 184
1998 5 200
2. The personal consumption expenditure for telephone and telegraph, in billions of dollars
in the United State for the years 1993 to 1998 are given below. Determine the least
1993 1 37.9
1994 2 39.8
1995 3 40.4
1996 4 42.7
1997 5 44.1
1998 6 47.1
3. The following table give the annual amount of scrap produced by Machine Products. Inc.
1994 1 2.0
1995 2 4.0
1996 3 3.0
1997 4 5.0
1998 5 6.0
Determine the least squares trend equation. Estimate the amount of scrap for the year 2000.
4. The amounts spent in vending machines in the United States, in billion dollars, for the
years 1994 through 1998 are given below. Determine the least square equation, and
1994 1 17.5
1995 2 19.0
1996 3 21.0
1997 4 22.7
1998 5 24.5
5. Sally’s software, Inc. is a rapidly growing supplier for computer software to the Sarasota
area. Sales for the last five years are given below.
1993 1.1
1994 1.5
1995 2.0
1996 2.4
1997 3.1
b. By what percent did sales increase. on the average. during the period?
6. It appears that the imports of the carbon black have been increasing by about 10
percent annually.
1987 92.0
1988 101.0
1989 112.0
1990 124.0
1991 135.0
1992 149.0
1993 163.0
1994 180.0
b. By what percent did imports increase. on the average. during the period?
7. Victor Anderson, the owner of Anderson Belts, Inc., is studying absenteeism among his
employees. His workforce is small, consisting of only five employees. For the three years
he recorded the following number of employee absences, in days, for each quarter.
Quarter
Year I II III IV
1996 4 10 7 3
1997 5 12 9 4
1998 6 16 12 4
8. The Appliance Center sells a variety of electronic equipment and home appliances. For
the last four years the following quarterly sales( in $ millions) were reported.
Quarter
Year I II III IV
bBa¢IÉksareyag
List of References
Robert D. Mason, Douglas A. Lind, and William G. Marchal(1999): Statistical Techniques in
Business and Economics, Irwin/McGraw-Hill.
Kwanchai A. Gomez et al.: Statistical Procedures for Agricultural Research, John Wiley & Sons,
Inc. Printed in Singapore.
James T. McClave, P. George Benson, and Terry Sincich(2005): Statistics for Business and
Economics, Pearson Prentice Hall.
Acheson J. Duncan, Ph.D. (1986): Quality Control and Industrial Statistics, Irwin/McGraw-Hill,
Richard D. IRWIN. INC
Margaret L. Lial, and Thomas W. Hungerford (1999): Mathematics with Applications in the
Management, Natural, and Social Sciences, Addison-Wesley Educational Publisher Inc. Printed
in the United States of America.
David M. Levine, David Stephen, Timothy C. Krehbiel, and Mark L. Berenson (2002) : Statistics
for Managers Using Microsoft Excel, Pearson Education, Inc., Uper Shaddle Rever, New Jersey,
07458, Printed in the United States of America.
Gerald Keller and Brian Warrack (2003). Statistics for management and economics. Thomson.
United states of America