Professional Documents
Culture Documents
Binomial Distribution
Binomial Distribution
Binomial distribution
Introduction
It was introduced in a journal that the binomial distribution is a form of
probability distribution in which it summarizes the likeness that a particular amount
that will get one of two independent values under a condition or set of parameters or
assumptions. There are such assumptions on the binomial distribution that there is
only one outcome for each trial and therefore each of these trials is mutually exclusive
or independent of one another. In statistics, the binomial distribution is one of the
commonest discrete distributions different from a continuous distribution, like the
normal distribution. The reason why the idea of the binomial distribution is opposing
to the normal distribution is that it counts only two states and this represents a 1 (for
success), or a 0 (for failure) that is given on the trials represented in the data.
Moreover, the binomial distribution represents the x and its probability for success in
the trials (n), given also a success probability (p) for each trial (Barone, 2021).
Symmetry Property of Binomial distribution
It has been discussed in an article released by Stattrek (2022) that the mean of
the binomial distribution is (p) and its standard deviation is represented in sqr (p(1-
p)/n). The shape of a binomial distribution is said to be symmetrical when p=0.5 or
when the (n) is large.
Finding the Mean, the Variance, and the Standard Deviation of this Distribution
To determine the mean of the binomial distribution, it is important to
understand that the mean of the distribution (μx) is equal to n*P, then the variance
(σ2x) is n*P*(1-P), and the standard deviation (σx) is computed by sqrt [n *P * ( 1 - P )
] (StatsDirect, 2021).
Simple Example of Binomial Distribution
According to a study by the Corporate Finance Institute (2020), there are
several types of binomial distribution trials. The first is fixed trials, such as coin flips,
where the number of times a trial has been performed is recorded since the beginning.
If a coin is flipped ten times, each flip of the coin is referred to as a trial. Second,
there are independent trials, such as tossing a coin or rolling a die, where the first
event, in the case of tossing the coin, is considered to be independent of the
subsequent events. Third, the fixed chance of success, as demonstrated by the fact that
when a person tosses a coin, the likelihood of receiving a head is approximately 0.5,
Task 3
and if 50 trials are conducted, the expected value of the number of heads is 25.
(50x0.5).
Exercise 66
Introduction
Many firms use acceptance sampling as a quality control method to monitor
incoming shipments of various parts, raw materials, and other items. Parts are
frequently sent in huge batches from suppliers in the electronics industry. The n trials
of a binomial experiment are represented by a sample of the components indicated by
n. The expected outcome for each component evaluated (trial) will be a determination
of whether the component is good or defective. Reynolds Electronics accepts a lot
from a specific supplier if the number of defective components in the lot does not
exceed 1%. In the problem, there are supposed five items that are recorded as the
samples from a recent shipment.
Problem
a. Assume that 1% of the shipment is defective. Compute the probability that no
items in the sample are defective.
(Probability of Defective) P (D) = 1% = 0.01%
(Probability of Non-Defective) P (D’) = 1-1% = 0.99
P (A) = 5co (0.01)0(0.99)5 + 5c1(0.01)1(0.99)4
(0 defective) (1 defective)
made up so that the signs of + and - are equal in size or the population means are
equal to the given sample mean (Statistics Solution, 2021).
Testing of the Hypothesis using Binomial Distribution for both Non-Directional
and Directional Cases
In the journal released by Zach (2021), it has been
explained that the directional hypothesis is also known as an
alternative hypothesis that contains less than (represented
by the sign “<”) or those greater than (represented by the
sign “>”). however, when it comes to the non-directional
hypothesis, this is an alternative hypothesis containing the
not equal (represented by the sign “≠”).
1. A survey of 40 home prices in a metropolitan area has the following results:
Price Less than Equal to $200,000 More than $200,000
$200,000
Number of 13 1 27
Homes
a. Test the hypothesis that the median price in the metropolitan area is $200,000.
x̄ 1= 13
x̄ 2= 1
x̄ 3= 27
Above are the values of the mean of every number of homes given in
the table. The symbol x̄ represents the mean of every data in each column
provided.
a. Ho = (the median price in the metropolitan area is $200,000)
b. Ha= (the median price in the metropolitan area is not $200,000)
Test Statistic: t-test
Since this is a t-test and there are multiple samples, the groups are
divided into three parts which are groups 1& 2, groups 1&3, and groups 2 & 3.
Moreover, the alpha which is 0.05 will also be divided into 3 since the groups
for the t-test are subdivided into 3, the alpha is now 0.0166.
Conclusion
Task 3
The first group had yielded a P(T<=t) one-tail of 0.0285 and since this is not
less than 0.0166, then there is no significant difference in the responses of the groups
1 & 2 which are those less than and equal to $200,000.
The second group had yielded a P(T<=t) one-tail of 0.3037 and since this is
not less than 0.0166, then there is no significant difference in the responses of the
groups 1 & 3 which are those less than and more than $200,000.
The third group had yielded a P(T<=t) one-tail of 0.1878 and since this is not
less than 0.0166, then there is no significant difference in the responses of the groups
2 & 3 which are those equal and more than $200,000.
Since the group equal to $200,00 yields the middle amount for the data, this
means that although there is no significant difference in the p-value of these groups, it
is also concluded that the median is equal to $200,000.
b. Test the hypothesis that the median price in the metropolitan area is more than
$200,000
x̄ 1= 13
x̄ 2= 1
x̄ 3= 27
Above are the values of the mean of every number of homes given in
the table. The symbol x̄ represents the mean of every data in each column
provided.
c. Ho = (the median price in the metropolitan area is more than $200,000)
d. Ha= (the median price in the metropolitan area is not more than $200,000)
Test Statistic: t-test
Since this is a t-test and there are multiple samples, the groups are
divided into three parts which are groups 1& 2, groups 1&3, and groups 2 & 3.
Moreover, the alpha which is 0.05 will also be divided into 3 since the groups
for the t-test are subdivided into 3, the alpha is now 0.0166.
Conclusion
Task 3
The first group had yielded a P(T<=t) one-tail of 0.0285 and since this is not
less than 0.0166, then there is no significant difference in the responses of the groups
1 & 2 which are those less than and equal to $200,000.
The second group had yielded a P(T<=t) one-tail of 0.3037 and since this is
not less than 0.0166, then there is no significant difference in the responses of the
groups 1 & 3 which are those less than and more than $200,000.
The third group had yielded a P(T<=t) one-tail of 0.1878 and since this is not
less than 0.0166, then there is no significant difference in the responses of the groups
2 & 3 which are those equal and more than $200,000.
Since the group 2 & 3 which represents values equal and more than $200,00 yields
the P(T<=t) of 0.1878, this means that since there is no significant difference in the p-
value of these groups, it can be concluded that the median value is more than
$200,000.
This table shows the significant changes and consistencies in the year
performances of the S&P. The total return is a method of evaluating all profits from a
transaction by taking into account both price appreciation and income production over
a specific period, usually a year, as defined by Lavrakas (2021). The total return,
sometimes called the total rate of return, is usually represented as a percentage and
allows you to compare the total return performance of various assets. In its most basic
form, a return, often known as a financial return, is the amount of money gained or
lost on a business over time, as defined by Investopedia (2021). A return can be
calculated using the change in the monetary value of an investment over time. The
profit-to-investment ratio can also be used to compute a return as a percentage. The
table also includes the Net Total Return, which is defined by Law Insider (n.d.) as the
performance of an index's underlying portfolio after any tax has been deducted,
including dividends, interest, and other income reinvested.
Task 3
40
30
20
10
0
2010 2012 2014 2016 2018 2020 2022
-10
-20
The scatter plot has been prepared to show the relations between the data from
the collected year performances from the year 2012 to 2020.
40
30
20
10
0
2010 2012 2014 2016 2018 2020 2022
-10
-20
This is a type of linear graph in which data are also seen and are more
emphasized when it comes to its increasing and decreasing plots. In performing such
regression analysis of data, the hypothesis may happen in a way that proves that there
is a significant change in the collected yearly data of S&P and is represented by the
null hypothesis that process that there is no significant change in the collected yearly
data of S&P.
To perform the successful computation of the regression analysis, several
methods have been used such as Google Sheets and the Add ons focusing on
Task 3
Regression
Statistics
Multiple R 0.9999999449
R Square 0.9999998898
Adjusted R
Square 0.9999998531
Standard
Error 0.006577924769
Observation
s 9
ANOVA
df SS MS F Significance F
0.000259614565
Residual 6 6 0.00004326909427
Total 8 2356.232889
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95%
- 0.0180561185 -
Intercept -0.02779486104 0.01873830464 -1.483317812 0.1885145325 0.07364584065 8 0.07364584065
RESIDUAL PROBABILITY
OUTPUT OUTPUT
This data represented had shown the results from the computation of the
valuable points in the regression analysis such as the regression statistics, the
ANOVA, the Coefficients, Standard error, t stat, P-value, residual outputs, and the
probability outputs which then proves the null hypothesis that there is a significant
change in the collected yearly data of S&P since the P-value of the data represents a
total of more than the alpha of 0.05 (as standard) which proves that the data is
changing over time (Analytics Vidhya, 2021).
The residual plot is a graph with the residuals on the vertical axis and the
independent variable on the horizontal axis. If the dots in a residual plot are randomly
distributed across the horizontal axis, a linear regression model is appropriate for this
study; otherwise, a nonlinear model appears to be more appropriate (Stattrek, n.d.).
plotted against a theoretical normal distribution. Deviations from the norm are
indicated by deviations from the straight line (National Institute of Standards &
Technology, n.d.).
Regression analysis indeed helps with the computation of values such as the p-
value and the coefficients computed for the plotting of data, however, another mode
of computation might also be suitable such as a t-test. The influence of the data is
positive in way that it can be visited by almost all people as long as they search for it
on the web, however, it is also an adverse situation to be part of the people involved
specifically when there are these computed low returns and that this is one of the
challenges in the field of trading and other financial assets.
Conclusion
Since the data from the graph shows a changing height of the line instead of
just a straight line, this means that the data is not consistent and this means that the
first rule which is the adequacy check in the regression analysis is not that achieved
through the given data. However, there is one thing confirmed in the analysis, which
is the significant change in the provided data set from the year’s performances.
Task 3
References
Barone, A. (2021, October 9). How binomial distribution works. Investopedia.
https://www.investopedia.com/terms/b/binomialdistribution.asp
Binomial distribution. (2022). Statistics and Probability.
https://stattrek.com/probability-distributions/binomial.aspx
Binomial distribution - StatsDirect. (2021). StatsDirect Statistical Analysis Software.
https://www.statsdirect.com/help/distributions/binomial.htm
Corporate Finance Institute. (2020, May 14). Binomial distribution - Definition,
criteria, and example.
https://corporatefinanceinstitute.com/resources/knowledge/other/binomial-
distribution/
Zach. (2021, April 29). What is a directional hypothesis? (Definition & examples).
Statology. https://www.statology.org/directional-hypothesis/
Sign test. (2021, August 2). Statistics Solutions.
https://www.statisticssolutions.com/free-resources/directory-of-statistical-analyses/
sign-test/
Kenton, W. (2022, February 15). S&P 500 index. Investopedia.
https://www.investopedia.com/terms/s/sp500.asp
Lavrakas, T. (2021, July 16). Understand the total value of your investments with a
total return. Forbes Advisor. https://www.forbes.com/advisor/investing/what-is-total-
return/
Return. (2003, November 25). Investopedia.
https://www.investopedia.com/terms/r/return.asp
Net total return definition. (n.d.). Law Insider.
https://www.lawinsider.com/dictionary/net-total-return
Everything you need to know about hypothesis testing in machine learning. (2021,
September 9). Analytics Vidhya.
https://www.analyticsvidhya.com/blog/2021/09/hypothesis-testing-in-machine-
learning-everything-you-need-to-know/
Residual analysis in regression. (n.d.). Statistics and Probability.
https://stattrek.com/regression/residual-analysis.aspx
National Institute of Standards & Technology. (n.d.). Normal probability plot.
https://www.itl.nist.gov/div898/handbook/eda/section3/normprpl.