PS Set-2

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 23

1 .

a) Define descriptive statistics and give any five applications in computer


science Engineering.
Ans: Descriptive Statistics:
Descriptive statistics is a branch of statistics that deals with the collection,
analysis, interpretation, presentation, and organization of data. It involves
summarizing and describing the main features of a dataset, providing insights
into its central tendency, variability, and distribution.
Applications in Computer Science Engineering:
1. Algorithm Analysis:
Descriptive statistics are employed to analyze the performance of algorithms,
including metrics such as runtime, space complexity, and execution efficiency.
This aids in comparing and optimizing algorithmic solutions.
2. Quality Assurance in Software Engineering:
Descriptive statistics help in quality assurance by summarizing and analyzing
metrics related to software testing. This includes defect rates, test coverage, and
performance metrics, contributing to software reliability.
3. Network Performance Monitoring:
In computer networks, descriptive statistics play a crucial role in monitoring
performance metrics such as latency, throughput, and packet loss. This
information is vital for optimizing network design and troubleshooting issues.
4. Data Analytics and Machine Learning:
Descriptive statistics are fundamental in the exploratory data analysis phase of
data analytics and machine learning. They help in understanding the
characteristics of datasets, identifying outliers, and making informed decisions
about data preprocessing.
5. Resource Management in Cloud Computing:
Descriptive statistics are utilized in cloud computing for resource management.
This involves analyzing resource usage patterns, such as CPU utilization,
memory consumption, and storage, to optimize resource allocation and cost-
effectiveness in cloud environments.
These applications highlight the significance of descriptive statistics in
the field of computer. science engineering, ranging from algorithm analysis to
quality assurance and resource management.
2.a) What do you understand by a measure of dispersion? What purpose does a
measure dispersion serve?

Measures of Dispersion

A measure of dispersion indicates the scattering of data. It explains the disparity


of data from one another, delivering a precise view of their distribution. The
measure of dispersion displays and gives us an idea about the variation and the
central value of an individual item.

In other words, dispersion is the extent to which values in a


distribution differ from the average of the distribution. It gives us an idea about
the extent to which individual items vary from one another, and from the central
value.

The variation can be measured in different numerical measures, namely:

(i) Range: It is the simplest method of measurement of dispersion and defines


the difference between the largest and the smallest item in a given distribution.
If Y max and Y min are the two ultimate items, then

Range = Y max – Y min

(ii) Quartile deviation: It is known as semi-interquartile range, i.e., half of the


difference between the upper quartile and lower quartile. The first quartile is
derived as Q, the middle digit Q1 connects the least number with the median of
the data. The median of a data set is the (Q2)second quartile. Lastly, the number
connecting the largest number and the median is the third quartile (Q3). Quartile
deviation can be calculated by

Q = ½ × (Q3 – Q1)

(iii) Mean deviation: Mean deviation is the arithmetic mean (average) of


deviations ⎜D⎜of observations from a central value (mean or median).

Mean deviation can be evaluated by using the formula: A = 1⁄n [∑i|xi – A|]

(iv) Standard deviation: Standard deviation is the square root of the arithmetic
average of the square of the deviations measured from the mean. The standard
deviation is given as,

σ = [(Σi (yi – ȳ) ⁄ n] ½ = [(Σ i yi 2 ⁄ n) – ȳ 2] ½


Apart from a numerical value, graphics methods are also applied for estimating
dispersion.

.
6. a) Two cards are drawn at random from an ordinary deck of 52 playing cards. What is the
probability of getting two aces if(i) the first card is replaced before the second card is
drawn;(ii)the first card is not replaced before the second card is drawn?
b. Fit the Poisson distribution for the following data.

x 0 1 2 3 4 5 Total
f 142 156 69 27 5 1 400
7 a) Samples of size 2 are taken from the population Let S = {1,5,6,8}, without replacement
find the following:
i. The mean of the population
ii. The standard deviation of the population
iii. Mean of the sampling distribution of means
iv The standard deviation of the sampling distribution of means.
b) Define (i) Population and samples (ii) Sampling distribution (iii) F distribution.
8.a) Assuming that the population standard deviation is 0.3, calculate the (i)
95% and (ii)99% confidence intervals for the mean lead concentration in a river
if the mean lead concentration recovered from a sample of lead measurements
in 36 different locations is 2.6 gms/ml.
b) In a random sample of 100 packages shipped by air freight 13 had some damage,
construct 95% confidence interval for the true proportion of damage package.
9. a) Explain the procedure generally followed in testing of hypotheses.
b) A random sample of 100 recorded deaths in the United States during the past year showed an
average life span of 71.8 years. Assuming a population standard deviation of 8.9 years, does this
seem to indicate that the mean life span today is greater than 70 years? Use a 0.05 level of
significance?
10. a) The means of two large samples of sizes 1000 and 2000 members are 67.5 inches and
68.0 inches respectively. Can these samples be regarded as drawn from the same population of
S.D. 2.5 inches?

b) In a study conducted by the Department of Human Nutrition and Foods at Virginia


Tech, the following data were recorded on sorbic acid residuals, in parts per million, in ham
immediately after dipping in a sorbate solution and after 60 days of storage:
Sorbic Acid Residuals in Ham
Slice Before Storage After Storage
1 224 116
2 270 96
3 400 236
4 444 329
5 590 437
6 660 597
7 1400 689
8 680 576

Assuming the populations to be normally distributed, is there sufficient evidence, at


the 0.05 level of significance, to say that the length of storage influences sorbic acid
residual concentrations?

You might also like