Professional Documents
Culture Documents
QUANTITATIVE REASONING
QUANTITATIVE REASONING
QUANTITATIVE REASONING
QUANTITATIVE REASONING
BS-III SEMESTER (Final Term Notes)
Mathematical models
A mathematical model is a description of a system using mathematical concepts, for example algebra, graphs,
equations and functions, and language, for example arithmetic signs. Mathematical modelling is the process of
developing mathematical models.
Deterministic models and statistical models (Probabilistic Model)
Mathematical models can be classified as either deterministic models or statistical models.
A deterministic model is a mathematical model in which the output is determined only by the specified
values of the input data and the initial conditions. This means that a given set of input data will always generate
the same output.
A statistical model is a mathematical model in which some or all of the input data have some
randomness, for example as expressed by a probability distribution, so that for a given set of input data the
output is not reproducible but is described by a probability distribution. The output ensemble is obtained by
running the model a large number of times with a new input value sampled from the probability distribution
each time the model is run. Statistical models can be run by using Monte Carlo simulation.
So, another definition of a statistical model is a mathematical description of a system that accounts for
uncertainty in the system. Statistical modelling is the process of forming a hypothesis for a statistical model on
a set of data, developing a model and then testing it on the data to see if the hypothesis is true.
Definition of Deterministic Models
Deterministic mathematical models are models in which the final outcome is entirely determined by the initial
conditions and parameters of the system. In these models, there is no random or uncertain component involved,
and they always produce the same output for the same input.
Characteristics of Deterministic Models
Deterministic models have the following characteristics.
They produce consistent results for the same input.
They do not involve any random component.
The future state of the system can be precisely predicted. Examples of deterministic models include
classical mechanics, geometric optics, and some deterministic optimization problems.
Linear Function
A linear function is a function that represents a straight line on the coordinate plane. For example, y = 3x - 2
represents a straight line on a coordinate plane and hence it represents a linear function. Since y can be replaced
with f(x), this function can be written as f(x) = 3x - 2.
What is a Linear Function?
A linear function is of the form f(x) = mx + b where 'm' and 'b' are real numbers. Isn't it looking like the slope-
intercept form of a line which is expressed as y = mx + b? Yes, this is because a linear function represents a
line, i.e., its graph is a line. Here,
'm' is the slope of the line
'b' is the y-intercept of the line
'x' is the independent variable
'y' (or f(x)) is the dependent variable
A linear function is an algebraic function. This is because it involves only algebraic operations.
Linear Function Equation
The parent linear function is f(x) = x, which is a line passing through the origin. In general, a linear function
equation is f(x) = mx + b and here are some examples.
f(x) = 3x - 2
f(x) = -5x - 0.5
f(x) = 3
Real Life Example of Linear Function
Here are some real-life applications of the linear function.
A movie streaming service charges a monthly fee of $4.50 and an additional fee of $0.35 for every
movie downloaded. Now, the total monthly fee is represented by the linear function f(x) = 0.35x + 4.50,
where x is the number of movies downloaded in a month.
A t-shirt company charges a one-time fee of $50 and $7 per T-shirt to print logos on T-shirts. So, the
total fee is expressed by the linear function f(x) = 7x + 50, where x is the number of t-shirts.
The linear function is used to represent an objective function in linear programming problems, to help minimize
the close, or maximize the profits.
How to Find a Linear Function?
We use the slope-intercept form or the point-slope form to find a linear function. The process of finding a linear
function is the same as the process of finding the equation of a line and is explained with an example.
Example: Find the linear function that has two points (-1, 15) and (2, 27) on it.
Solution:
The given points are (x1, y1) = (-1, 15) and (x₂, y₂) = (2, 27).
Step 1: Find the slope of the function using the slope formula:
m = (y₂ - y1) / (x₂ - x1) = (27 - 15) / (2 - (-1)) = 12/3 = 4.
Step 2: Find the equation of linear function using the point slope form.
y - y1 = m (x - x1)
y - 15 = 4 (x - (-1))
y - 15 = 4 (x + 1)
y - 15 = 4x + 4
y = 4x + 19
Therefore, the equation of the linear function is, f(x) = 4x + 19.
Identifying a Linear Function
If the information about a function is given as a graph, then it is linear if the graph is a line. If the information
about the function is given in the algebraic form, then it is linear if it is of the form f(x) = mx + b. But to see
whether the given data in a table format represents a linear function:
Compute the differences in x-values.
Compute the differences in y-values
Check whether the ratio of the difference in y-values to the difference in x-values is always constant.
Example: Determine whether the following data from the following table represents a linear function.
x y
3 15
5 23
7 31
11 47
13 55
Solution:
We will compute the differences in x-values, differences in y values, and the ratio (difference in y)/(difference
in x) every time and see whether this ratio is a constant.
Since all numbers in the last column are equal to a constant, the data in the given table represents a linear
function.
Graphing a Linear Function
We know that to graph a line, we just need any two points on it. If we find two points, then we can just join
them by a line and extend it on both sides. The graph of a linear function f(x) = mx + b is
an increasing line when m > 0
a decreasing line when m < 0
a horizontal line when m = 0
There are two ways to graph a linear function.
By finding two points on it.
By using its slope and y-intercept.
Graphing a Linear Function by Finding Two Points
To find any two points on a linear function (line) f(x) = mx + b, we just assume some random values for 'x' and
substitute these values in the function to find the corresponding values for y. The process is explained with an
example where we are going to graph the function f(x) = 3x + 5.
Step 1: Find two points on the line by taking some random values.
We will assume that x = -1 and x = 0.
Step 2: Substitute each of these values in the function to find the corresponding y-values.
Here is the table of the linear function y = 3x + 5.
x y
-1 3(-1)+5 = 2
0 3(0)+5 = 5 Therefore, two points on the line are (-1, 2) and (0, 5).
Step 3: Plot the points on the graph and join them by a line. Also, extend the line on both sides.
Graphing a Linear Function Using Slope and y-Intercept
To graph a linear function, f(x) = mx + b, we can use its slope 'm' and the y-intercept 'b'. The process is
explained again by graphing the same linear function f(x) = 3x + 5. Its slope is, m = 3 and its y-intercept is (0,
b) = (0, 5).
Step 1: Plot the y-intercept (0, b).
Here, we plot the point (0, 5).
Step 2: Write the slope as the fraction rise/run and identify the "rise" and the "run".
Here, the slope = 3 = 3/1 = rise/run.
So rise = 3 and run = 1.
Step 3: Rise the y-intercept vertically by "rise" and then run horizontally by "run". This results in a new point.
(Note that if "rise" is positive, we go up and if "rise" is negative, we go down. Also, if "run" is positive", we go
right and if "run" is negative, we go left.)
Here, we go up by 3 units from the y-intercept and thereby go right by 1 unit.
Step 4: Join the points from Step 1 and Step 2 by a line and extend the line on both sides.
Unlike exponential growth, linear growth doesn't have moments when it slows down or speeds up. Here, growth
is constant and it goes up by the same amount each time.
Linear Decay
Consider the relationship represented by the table shown below.
That is, an increase on the part of one variable introduces an increase on the part of the other variable or a
decrease on the part of one variable introduces a decrease on the part of the other variable.
For example, the number of hours spent on studying and final exam scores:
When the data points are close to forming a smooth line or graph that shows the negative relationship, we can
say there is a strong negative association.
The graph above shows a positive association that is quite strong.
Negative Association:
Both exponential decay and linear decay are examples of a negative association between two things.
A negative association happens, when two variables move in the opposite directions.
That is, an increase on the part of one variable results a decrease on the part of the other variable or a decrease
on the part of one variable results an increase on the part of the other variable. For example, the number of
absences over the semester and final exam scores:
The graph above shows a negative association that is quite strong.
Exponential Growth and Decay
Exponential growth and decay apply to physical quantities which change in value or form in a rapid manner.
The change can be measured using the concept of exponential growth and exponential decay, and the new
obtained quantity can be obtained from the existing quantity. The formulas of exponential growth and decay are
f(x) = a(1 + r)t, and f(x) = a(1 - r)t respectively.
Let us learn more about exponential growth and decay, the formula, applications, with the help of examples,
FAQs.
What Is Exponential Growth And Decay?
Exponential growth and decay apply to quantities which change rapidly. Exponential growth and decay have
been derived from the concept of geometric progression. Quantities that do not change as constant but change in
an exponential manner can be termed as having an exponential growth or exponential decay.
The simplest representation of exponential growth and decay is the formula abx, where 'a' is the initial quantity,
'b' is the growth factor which is similar to the common ratio of the geometric progression, and 'x' in the time
steps for multiplying the growth factor. For exponential growth, the value of b is greater than 1 (b > 1), and for
exponential decay, the value of b is lesser than 1 (b < 1).
Exponential growth finds applications in studying bacterial growth, population increase, and money growth
schemes. Exponential decay refers to a rapid decrease in a quantity over a period of time. The exponential decay
can be used to find food decay, half-life, and radioactive decay. The formulas of exponential growth and decay
are as presented below.
Exponential growth uses a factor 'r' which is the rate of growth. Here the r-value lies between 0 and 1 (0 < r <
1). The term (1 + r) can be taken as the growth factor. And’t’ is the time steps which is the number of times the
growth factor is to be multiplied. The value of’t’ can be a whole number or a decimal number. For exponential
decay, the growth factor is (1 - r), which has a value lesser than 1.
Formulas of Exponential Growth and Decay
The exponential growth and decay have different interpretations of the formulas which are interrelated and can
be interpreted differently. The below table shows the three different formulas of exponential growth and decay.
P = Poekt P = Poe-kt
In the above formulas the 'a' or Po is the initial quantity of the substance. Further for exponential growth
b = 1 + r = ek and for exponential decay we have b = 1 - r = e-k.
Applications of Exponential Growth and Decay
The concept of exponential growth and decay can be observed in numerous day-to-day scientific and industrial
activities. Let us check a few important applications of exponential growth and decay.
Bacterial Growth: The initial bacterial growth can be observed in numerous communicable diseases.
The recent Covid-19 is a quick example of exponential growth where the disease is highly
communicable and is transmitted from one to many, and then to many more people. Bacteria and viruses
are growing in an exponential manner, and the initial bacteria grows at an exponential rate. The initial
bacterial if taken to double every second would grow in numbers such as 1, 2, 4, 8, 16, 32, 64, 128, 256,
512... With this, we can observe the manner of exponential growth in bacteria.
Nuclear Chain Reactions: The nuclear chain reactions can be broadly classified as nuclear fission and
nuclear fission reactions. The concept of nuclear fusion can be linked with exponential growth and the
concept of nuclear fission can be linked with exponential decay. Nuclear fusion is a reaction in which
two or more atoms combine to form a larger atom and this kind of reaction can be observed in the core
of the sun. Nuclear fission is a kind of exponential decay that can be observed in radioactive material, in
which the initial quantity decomposes and we have a smaller quantity by the end of the observed time
period.
Feedback: The concept of feedback, more so of customer feedback grows in an exponential manner.
This can be observed more so in negative feedback. The bad functioning of the product if experienced
by a customer is shared by this customer to another person, who in turn shares it with two or more
people, and each of those people again shares it with two or more people. The medium of the internet
helps for this easy sharing of feedback which flows exponentially to a larger customer segment. Thus
the feedback in this current day of the internet is conveyed at a rapid pace.
Processing Power of Computers: The processing and storage power has now increased exponentially.
Earlier the data storage was only in MB, which has now grown to GB and TB exponentially. The slow
growth in the computer hardware and storage system in the initial 1970s and 1980s has now grown
exponentially. The initial floppy disks and hard disk drives have now been replaced with cloud servers
which can be easily assessed by the user through an internet-connected mobile device.
Food Degradation: Food degradation can also be understood as a case of exponent growth and decay.
The food remains good for a certain amount of time and then it degenerates exponentially. The food is
slightly stale initially, and then it stales rapidly until we discover that it has spoilt completely. This is
also a typical case of exponential decay, where once the decay process initiates, then it decays at a rapid
pace until the entire food quantity is completely stale.
Aging of Human Beings: The aging process of humans or any living being at the ending part of the
lifetime follows an exponential decay process. The person remains hale and healthy for a normal course
of life for about 60 years. This is the same reason that for many jobs the retirement age is set as 60 years.
After this, the aging process is so rapid that it affects the body at an exponential rate. The degradation in
the quality of life seen in the initial days is further degraded drastically in the later years. This could be
because of the advancement of certain existing diseases or the malfunctioning of certain organs.
Internet Content: The internet is exploding with information. In the initial stages of the internet, google
had to collect useful information and add it to the net. But with time the internet users started adding
information to the internet, and now the amount of information now available on the internet is mind-
boggling. Also now the application of artificial intelligence algorithms helps build the content in an
exponential manner. The content is generated thousands and millions of times in a short span of time.
CHAPTER-II
What is Probabilistic?
A probabilistic method or model is based on the theory of probability or the fact that randomness plays a role in
predicting future events. The opposite is deterministic, which is the opposite of random — it tells us something
can be predicted exactly, without the added complication of randomness.
What is a Probabilistic Model?
Probabilistic models incorporate random variables and probability distributions into the model of an event or
phenomenon. While a deterministic model gives a single possible outcome for an event, a probabilistic model
gives a probability distribution as a solution. These models take into account the fact that we can rarely know
everything about a situation. There’s nearly always an element of randomness to take into account. For
example, life insurance is based on the fact we know with certainty that we will die, but we don’t know when.
These models can be part deterministic and part random or wholly random.
Random variables from the normal distribution, binomial distribution and Bernoulli distribution form the
foundation for this type of modeling.
A normal distribution curve, sometimes called a bell curve, is one of the building blocks of a probabilistic
model.
What is the Probabilistic Method?
The probabilistic method, first introduced by Paul Erdős, is a way to prove the existence of a structure with
certain properties in combinatorics. The idea is that you create a probability space, and — choosing elements at
random — prove than any random element from the space has both a positive probability and the properties
sought after. The method is widely used in a variety of disciplines, including: statistical physics, quantum
mechanics, and theoretical computer science.
Waiting time between eruptions and the duration of the eruption for the Old Faithful Geyser in Yellowstone
National Park, Wyoming, USA. This scatterplot suggests there are generally two “types” of eruptions: short-
wait-short-duration, and long-wait-long-duration.
What is Bivariate Analysis?
Bivariate analysis means the analysis of bivariate data. It is one of the simplest forms of statistical analysis, used
to find out if there is a relationship between two sets of values. It usually involves the variables X and Y.
Univariate analysis is the analysis of one (“uni”) variable.
Bivariate analysis is the analysis of exactly two variables.
Multivariate analysis is the analysis of more than two variables.
The results from bivariate analysis can be stored in a two-column data table. For example, you might want to
find out the relationship between caloric intake and weight (of course, there is a pretty strong relationship
between the two. You can read more here.). Caloric intake would be your independent variable, X and weight
would be your dependent variable, Y.
Bivariate analysis is not the same as two sample data analysis. With two sample data analysis (like a two
sample z test in Excel), the X and Y are not directly related. You can also have a different number of data
values in each sample; with bivariate analysis, there is a Y value for each X. Let’s say you had a caloric intake
of 3,000 calories per day and a weight of 300lbs. You would write that with the x-variable followed by the y-
variable: (3000,300).
Two sample data analysis
Sample 1: 100,45,88,99
Sample 2: 44,33,101
Bivariate analysis
(X,Y)=(100,56),(23,84),(398,63),(56,42)
Types of Bivariate Analysis
Common types of bivariate analysis include:
1. Scatter plots,
These give you a visual idea of the pattern that your variables follow.
A simple scatterplot.
2. Regression Analysis
Regression analysis is a catch all term for a wide variety of tools that you can use to determine how your data
points might be related. In the image above, the points look like they could follow an exponential curve (as
opposed to a straight line). Regression analysis can give you the equation for that curve or line. It can also give
you the correlation coefficient.
3. Correlation Coefficients
Calculating values for correlation coefficients are usually performed on a computer, although you can find the
steps to find the correlation coefficient by hand here. This coefficient tells you if the variables are related.
Basically, a zero means they aren’t correlated (i.e. related in some way), while a 1 (either positive or negative)
means that the variables are perfectly correlated (i.e. they are perfectly in sync with each other).
Importance of bivariate analysis
Bivariate analysis is an important statistical method because it lets researchers look at the relationship between
two variables and determine their relationship. This can be helpful in many different kinds of research, such as
social science, medicine, marketing, and more.
Here are some reasons why bivariate analysis is important:
Bivariate analysis helps identify trends and patterns: It can reveal hidden data trends and patterns by
evaluating the relationship between two variables.
Bivariate analysis helps identify cause and effect relationships: It can assess if two variables are
statistically associated, assisting researchers in establishing which variable causes the other.
It helps researchers make predictions: It allows researchers to predict future results by modeling the
link between two variables.
It helps inform decision-making: Business, public policy, and healthcare decision-making can benefit
from bivariate analysis.
The ability to analyze the correlation between two variables is crucial for making sound judgments, and this
analysis serves this purpose admirably.
y is the predicted value of the dependent variable (y) for any given value of the independent variable (x).
B0 is the intercept, the predicted value of y when the x is 0.
B1 is the regression coefficient – how much we expect y to change as x increases.
x is the independent variable ( the variable we expect is influencing y).
e is the error of the estimate, or how much variation there is in our estimate of the regression
coefficient.
Linear regression finds the line of best fit line through your data by searching for the regression coefficient (B 1)
that minimizes the total error (e) of the model.
While you can perform a linear regression by hand, this is a tedious process, so most people use statistical
programs to help them quickly analyze the data.
The Correlation Coefficient r
Besides looking at the scatter plot and seeing that a line seems reasonable, how can you tell if the line is a good
predictor? Use the correlation coefficient as another indicator (besides the scatterplot) of the strength of the
relationship between x and y.
The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is a numerical measure of the
strength of association between the independent variable x and the dependent variable y.
The correlation coefficient is calculated as:
Figure 10
NOTE: Strong correlation does not suggest that x causes y or y causes x. We say “correlation does not imply
causation.” For example, every person who learned math in the 17th century is dead. However, learning math
does not necessarily cause death!
Difference between Correlation and Regression
Correlation and regression are both used as statistical measurements to get a good understanding of the
relationship between variables. If the correlation coefficient is negative (or positive) then the slope of the
regression line will also be negative (or positive). The table given below highlights the key difference between
correlation and regression.
Correlation Regression
If you are using a small dataset (n ≤ 30) that is approximately normally distributed, use
the t distribution instead.
The t distribution follows the same shape as the z distribution, but corrects for small sample sizes. For
the t distribution, you need to know your degrees of freedom (sample size minus 1).
Check out this set of t tables to find your t statistic. We have included the confidence level and p values for
both one-tailed and two-tailed tests to help you find the t value you need.
For normal distributions, like the t distribution and z distribution, the critical value is the same on either side of
the mean.
Confidence interval for the mean of normally-distributed data
Normally-distributed data forms a bell shape when plotted on a graph, with the sample mean in the middle and
the rest of the data distributed fairly evenly on either side of the mean.
The confidence interval for data which follows a standard normal distribution is:
Where:
CI = the confidence interval
X̄ = the population mean
Z* = the critical value of the z distribution
σ = the population standard deviation
√n = the square root of the population size
The confidence interval for the t distribution follows the same formula, but replaces the Z* with the t*.
In real life, you never know the true values for the population (unless you can do a complete census). Instead,
we replace the population values with the values from our sample data, so the formula becomes:
Where:
ˆx = the sample mean
s = the sample standard deviation
Example: Calculating the confidence intervalIn the survey of Americans’ and Brits’ television watching habits,
we can use the sample mean, sample standard deviation, and sample size in place of the population mean,
population standard deviation, and population size.
To calculate the 95% confidence interval, we can simply plug the values into the formula.
For the USA:
So for the USA, the lower and upper bounds of the 95% confidence interval are 34.02 and 35.98.
For GB:
So for the GB, the lower and upper bounds of the 95% confidence interval are 33.04 and 36.96.
Hypothesis Testing
Hypothesis Testing Example
The best way to solve a problem on hypothesis testing is by applying the 5 steps mentioned in the previous
section. Suppose a researcher claims that the mean average weight of men is greater than 100kgs with a
standard deviation of 15kgs. 30 men are chosen with an average weight of 112.5 Kgs. Using hypothesis testing,
check if there is enough evidence to support the researcher's claim. The confidence interval is given as 95%.
Hypothesis Testing and Confidence Intervals
Confidence intervals form an important part of hypothesis testing. This is because the alpha level can be
determined from a given confidence interval. Suppose a confidence interval is given as 95%. Subtract the
confidence interval from 100%. This gives 100 - 95 = 5% or 0.05. This is the alpha value of a one-tailed
hypothesis testing. To obtain the alpha value for a two-tailed hypothesis testing, divide this value by 2. This
gives 0.05 / 2 = 0.025.
Related Articles:
Probability and Statistics
Data Handling
Data
Important Notes on Hypothesis Testing
Hypothesis testing is a technique that is used to verify whether the results of an experiment are
statistically significant.
It involves the setting up of a null hypothesis and an alternate hypothesis.
There are three types of tests that can be conducted under hypothesis testing - z test, t test, and chi
square test.
Hypothesis testing can be classified as right tail, left tail, and two tail tests.