Professional Documents
Culture Documents
1 Marketing Business Analytics Notes Final 1 To 33
1 Marketing Business Analytics Notes Final 1 To 33
Manisha Kapoor
• Further for students who wish to specialize in analytics, the course provides a strong
foundation in the application of marketing analytics with analytical platforms.
Perceptual Mapping can help us to answer If I want to shift myself to my ideal point/market leader
what strategy /changes in positioning messages I need
following questions to make ?
• What is the need of segmentation? What if we do not segment? One size fit all? Benefits
of segmentation
• Different ways of segmentation?
Benefits of Segmentation:
• Identify right customers, whom to target?
• Segment profiles (segmenting and targeting variables) – how to target?
6. Pricing Analytics
In Pricing we start by understanding people’s willingness to pay, then we translate it into
demand curves and then finally set optimal price from the demand curve. So we will fit
distributions to willingness to pay data and we will look at how to obtain a demand curve
from it and then ultimately we will look at some examples of optimization of prices.
7. Social-Media Analytics
Social media is gradually being recognized as a significant resource for understanding customer
sentiments, the reputation of products, willingness to buy, and customer satisfaction with
products and/or services.
Social media analytics in hospitality and tourism/film industry/mobile companies
Social Media Analytics
• Text mining & Sentiment Analysis
Marketing Business Analytics Session2 Prof. Manisha Kapoor
R TUTORIAL
In this session we will cover the basics of working in R with emphasis on working with data frames. We
will also learn the basics of R Markdown and how to create HTML reports. Please note in this course
we will do all the analysis in RMD file.
```{r}
```
Here x is object to which we have assigned value 2+4. The symbol '<-' is used to assign value to an
object. Run this block using Ctrl+Enter in windows and command + Enter in Mac. In order to print this
value type x. Check the environment when you run this code. Also, try typing X.
```{r}
x<- 2+4 #sum
x
```
Bottom right corner - variety of windows. Plots will appear here under “Plots” tab. You can install
packages and view already installed ones under the “Packages” tab. And you can type in names of
functions under “Help” tab to find explanation of their operation*
```{r}
read.csv("http://goo.gl/qw303p")
seg.df <- read.csv("http://goo.gl/qw303p")
seg.df
```
```{r}
head(seg.df) #Returns first 6 rows of your data
colnames(seg.df) #Returns column names
dim(seg.df) #Number of rows and column i.e. no of observations & variables
str(seg.df) #Structure of different variables
tail(seg.df) ##Returns last 6 rows of your data
```
seg.df data has 300 rows and 7 columns. Gender, own home, subscribe and segment are categorical
variables.
```{r}
summary(seg.df)
```
Marketing Business Analytics Session2 Prof. Manisha Kapoor
The summary function generates a full summary of the data set. For categorical variable it tells the
frequency. The seg data set has 7 variables 143 Males and 157 Females.
**Metric variables**
Age, income, kids and segment are metric variables.
The summary function gives 5 value summary, mean and missing values if any. Missing values are
coded as NAs in R. There are no missing values.
*As can be seen from this summary, the segmentation data set contains answers from 157 women and
143 men. The age of the respondents is a metric variable summarised by the minimum value (Min.),
the first quartile (1st Qu.),the median, the mean, the third quartile (3rd Qu.), and the maximum (Max.).
The youngest respondent is 19, and the oldest 80 years old. Half of the respondents are between 33 and
47 years old.
**Categorical variables**
The summary function shows the frequency of categorical variables i.e.how frequently they occur.
There are three categorical variables i.e. Gender, own home, subscribe and segments. These categorical
variables are saved as factors. (Check in environment)
R provides a standard way to describe relationships among variables through formula specification. A
formula uses the tilde (∼) operator to separate response variables on the left from explanatory variables
on the right. The basic form is: y∼x
This is used in many contexts in R, where the meaning of response and explanatory depend on the
situation. For example, in linear regression, the simple formula above would model y as a linear function
of x. In the case of the aggregate() command, the effect is to aggregate y according to the levels of x.
```{r}
seg.df <- read.csv("http://goo.gl/qw303p")
seg.df
aggregate(income ∼ Segment, data=seg.df, mean)
```
The general form is aggregate(formula, data, FUN).In our example,we tell R to “take income by
Segment within the data set seg.df, and apply mean to each group.”
```{r}
aggregate(income ∼ Segment + ownHome + subscribe, data=seg.df, mean)
```
B. [ausvacmot_Dataset](https://homepage.boku.ac.at/leisch/MSA/datasets/vacation.csv)
15. Read data & name it ausvacmot.df.
16. Summarize Education by Gender and Occupation. Which cohort scores highest on education? and
which one scores lowest?
17. Summarize Age by Gender and State. Which cohort is the oldest ? What is the average age of this
cohort?
Marketing Business Analytics Session2 Prof. Manisha Kapoor
We can check associations between two categorical variables and later check Chi Square test for
significance.
```{r}
table(Gender = seg.df$gender, seg.df$ownHome)
table_A <-table(Gender = seg.df$gender, seg.df$ownHome)
chisq.test (table_A)
```
No significant difference between men and women in terms of home ownership as p value is greater
than 0.05.
In R, we obtain a numeric summary of the data with command summary(). This command returns the
range, the quartiles, and the mean for numeric variables.
For categorical variables, the command returns frequency counts. The command also returns the
number of missing values for each variable.
Helpful graphical methods for **numeric data** are histograms, boxplots and scatter plots.
Bar plots of frequency counts are useful for the visualisation of **categorical variables**.
Histograms visualise the distribution of numeric variables. They show how often observations within a
certain value range occur. Histograms reveal if the distribution of a variable is unimodal and symmetric
or skewed. To obtain a histogram, we first need to create categories of values. We call this **binning**.
The bins must cover the entire range of observations, and must be adjacent to one another. Usually, they
are of equal length. Once we have created the bins, we plot how many of the observations fall into each
bin using one bar for each bin.
We plot the bin range on the x-axis, and the frequency of observations in each bin on the y-axis.A
number of R packages can construct histograms. We use package lattice because it enables us to create
histograms by segments.
Marketing Business Analytics Session2 Prof. Manisha Kapoor
```{r}
install.packages("lattice")
library("lattice")
```
```{r}
histogram(~ age, data = seg.df)
```
By default, this command automatically creates bins. We can gain a deeper understanding of the data
by inspecting histograms for different bin widths by specifying the number of bins using the argument
breaks:
```{r}
histogram(~ age, data = seg.df, breaks = 30,type = "density")
```
This command leads to finer bins, as shown in the plot above. The finer bins are more informative,
revealing that the distribution is bi-modal with many respondents aged around 35–50 and around 20-30
years.
Argument type = "density" rescales the y-axis to display density estimates. The sum of the areas of all
bars in this plot ads up to 1. Plotting density estimates allows us to superimpose probability density
functions of parametric distributions.
```{r}
barplot(table(seg.df$ownHome))
barplot(table(seg.df$ownHome), ylab ='Frequency', main = 'Barplot of Home ownership') #with labeled axis
fem_ownHome <- table (seg.df$ownHome[seg.df$gender=='Female']) #filtered by gender
fem_ownHome
barplot(fem_ownHome,ylab ='Frequency', main = 'Barplot of Female Home ownership')
```
**Exercise 4 : Visualisation**
24. Create Histograms of p1sales and p2sales using 20 number of bins of store.df dataset. Which product
has better sales? Why ?
25. Create barplot of State by Females
26. Create barplot of State by Income2
Marketing Business Analytics Session3 Prof. Manisha Kapoor
What is a market?
• Market is the people that you realistically think will be attracted in your product.
• Market is your prospective customer base
What is the market for?
Automobile Kids wear Tea brand Pet supplies ex. Brand that sells
brand brand Dog Food Meat
No, we can’t because people in a market do not share universal wants and needs. THEY HAVE
DIFFERENT WANTS, NEEDS, TASTES AND PREFERENCES
Marketing Business Analytics Session3 Prof. Manisha Kapoor
Whether they should offer one product or all the options? Which option they should select?
The mid-way!
The compromise or mid-way would be to collect data from customer about their preferences and try to
cluster that data or those preferences and try to group them into homogeneous segment. As that
people within a certain segment would want more or less the
same product features and that product would be different from 2 SEGMENT
• Notebooks
• Tablets
• Laptops
• Desktops
Marketing Business Analytics Session3 Prof. Manisha Kapoor
Basis of segmentation
What is the approach of creating these segments? On what basis can we segment the market?
Benefits of segmentation
By understanding people’s preferences, by segmenting them into homogeneous segments, you are
customizing the product to their taste, you are personalizing experiences of the customer if it is a service.
If the product or the service comes close to what people want, then they would be more satisfied.
Satisfied customers are loyal and they come back, and they can also refer your products to others. So,
what you gain out of this is that when you appeal to people preferences, you get customer loyalty and
retention. From the firm the fact that you're segmenting the market, some markets are appealing, some
segments may not be appealing to you, it is your decision to decide on which segment to target. So, in
that case, it's an exercise that can help you identify valuable customers. And now that because you know
what your people want and need, then you can have more targeted communication. You have more
targeted promotions product people. At the end, what you get is what we call higher customer life-time
value,
Benefits of segmentation -
Marketing Business Analytics Session4 Prof. Manisha Kapoor
1 20 18
2 20 16 Concept of clustering is related to distance. The Euclidean
distance between person 1 and 2 is 2.
3 30 20
Similarly, you can calculate the distance between all the
4 25 15
respondents as given below.
5 25 12
The lesser is the distance between two cases/people, the
6 30 22 similar they are.
Hierarchial clustering works on Euclidean distances or any form of distances. And we have to supply
it in form of matrix. It then identifies the closest people. So, in this case person 1 and 2 have shortest
distance. So, it will group one and two into one cluster and 3 and 6 in another cluster at a distance of
two. How many clusters do we have now – 4.
Okay now we have to update this distance matrix. And we have to select the linkage criteria
Step 2 : Update D using minimum distance rule and Minimum linkage criteria
Okay now we have to update this distance matrix. And we have to select the linkage criteria. So, we
have 1 and 2 as cluster and 3&6 as cluster. What is the distance between (1 and 2 ) and 3? It’s like if I
have to calculate distance between Delhi and Gurgaon
- whether I take distance from South Delhi (1) to new Gurgaon (3).
- Or take distance from North Delhi(2) to new Gurgaon (3).
We check this through minimum linkage criteria which we have selected in this example
Example :
D[1,2]3 = Min [10.2,10.8] = 10.2
D[1,2]6 = Min [10.8,11.6] = 10.8
D[1,2][3,6] = Min [10.2,10.8] = 10.2
D [1,2]4 = Min [5.8,5.1] = 5.1
D [1,2]5 = Min [7.8,6.4] = 6.4
D [1,2][4,5] = Min [5.1,6.4] = 5.1
Dendrogram
Marketing Business Analytics Session4 Prof. Manisha Kapoor
The hierarchical clustering method begins with each observation in its own cluster. It then successively
joins neighboring observations or clusters one at a time according to their distances from one another
and continues this until all observations are linked. This process of repeatedly joining observations and
groups is known as an agglomerative method.
From this dendrogram you can see which
cluster of respondents joined first and were
similar. Remember Our goal is to join similar
cluster together and try not to join dissimilar
cluster. We can do this by looking at the
joining distance so the cluster which join early
or smaller joining distance are more similar
but the clusters which joined at larger joining
distance are most dissimilar.
From this dendrogram can you tell how many
clusters we will get if we cut the tree at joining
distance of 15? You need to count the number
of branches it cuts by drawing a hypothetical
parallel line starting from joining distance of 15. You can see the dendrogram suggests 3 cluster
solution.
Marketing Business Analytics Session4 Prof. Manisha Kapoor
K- Means clustering
- K--means requires the specification of the number of clusters in advance, say S=3.
- The method aims to group the observations based on their similarity using an optimization
procedure. Indeed, the aim is to minimize the within-cluster variation which is defined as the
sum of square of the euclidean distance between each data point to the centroid of its cluster.
More precisely, the algorithm works as follow:
- K--Means algorithm:
- Start by randomly assigning each subject to a cluster, s=1,…,S
- Compute the centroid of each cluster and the distance of each subject to each of the clusters
centroids
- Reassign each subject to the cluster with closest centroid
- Repeat steps 2 and 3 until no further reassignment is possible (i.e., when the within--cluster
variance is minimized)
K-means is a way of taking a dataset and finding groups (or clusters) of points that have similar
properties. K-means works by grouping the points together in such a way that the distance between all
the points and the midpoint of the cluster they belong to is minimized.
Midpoint - Think of the simplest possible example. If you are asked to create 3 groups for the points
below and draw a star where the middle of each group would be, what would you do?
Step 2 Measure the distance between the 1st point and 3 initial clusters
Measure and cluster using the mean values – Measure the distance of a
Step 6 point with the mean of each cluster
Calculate the center of each cluster and recluster and keep doing this
Step 7 iterations till the cluster assignment doesn’t change
Marketing Business Analytics Session5 Prof. Manisha Kapoor
A Marketing Application
The objective of segmentation is to find -
MARKETING
STRATEGY
Marketing Business Analytics Session5 Prof. Manisha Kapoor
Steps in Clustering
Data pre-processing
1. Missing values
2. Standardizing/Scaling the data
Euclidean distance
hclust() output
Three clusters
Marketing Business Analytics Session5 Prof. Manisha Kapoor
Perceptual Mapping
• A perceptual map enables the marketing analyst to determine how a product perceived
compared to the competition so that they can (re) position them appropriately.
• When a perceptual map of products or brands is available, the marketing analyst can use an
individual’s ranking of products to find the individual’s ideal point or most-wanted location
on perceptual map
• This provides a useful graph for marketers to understand the competition structure among
brands, products or industries.
• So for the brand rating data like the one we will use here, performing a perceptual map can let
us understand how each brand is comparatively positioned in peoples’ minds. It is also helpful
for developing a marketing strategy for differentiating from other competing brands.
Am I differentiated ?
Is there any current gap in the market ? – new product & existing product
• If we can reduce the data to it’s underlying dimensions , we can more clearly identify the
relationships among concepts
Marketing Business Analytics Session6 Prof. Manisha Kapoor
What is PCA?
An unsupervised learning PCA calculates a new projection PCA creates plots based
mathematical procedure that of the data set and the new axis on correlations among samples
uses an orthogonal based on the standard deviation or in other words a PCA plot
transformation to convert a set of variables converts the correlation among
of observations of possibly all of the sample into a 2 D (2
correlated variables into a set of dimensional) graph
linearly uncorrelated principal
components.
Marketing Business Analytics Session6 Prof. Manisha Kapoor
What is dimension?
• A dimension is underlying pattern or common variation across variables
• Gentle and Good for children tended to receive similar responses from respondents which
means they share common variance.
• There shared variance is 1 Dimension
• These dimensions are the backbone of perceptual mapping because they allow us to chart the
variables and brands relative to the dimensions and one another in a 2 Dimension map.
Marketing Business Analytics Session7 Prof. Manisha Kapoor
2. Increase differentiation, one possibility would be to take action to shift your brand in some
direction on the map.
Suppose you want to move in the direction of brand c. You could look
at the specific differences from c in the data:
```{r}
brand.mean["c", ] - brand.mean["e", ]
```
3. Not to follow another brand but aim for differentiated space where no brand is positioned
• Gap between f,g and b,c --→ value leader area
• How do we find out how to position there?
Summary
• To summarize, when you wish to compare several brands across many dimensions, it can be
helpful to focus on just the first two or three principal components that explain variation
in the data.
• You can select how many components to focus on using a scree plot, which shows how much
variation in the data is explained by each principal component.
• A perceptual map plots the brands on the first two principal components, revealing how the
observations relate to the underlying dimensions (the components).
• PCA may be performed using survey ratings of the brands (as we have done here) or with
objective data such as price and physical measurements, or with a combination of the two. In
any case, when you are confronted with multidimensional data on brands or products, PCA
visualization is a useful tool for understanding differences in the market.
• Product ratings, position of consumer segments, ratings of political candidates,
evaluations of advertisements, or any other area where you have metric data on
multiple dimensions that is aggregated for a modest number of discrete entities of
interest
REGRESSION BASICS
In this session we will cover the basics of Regression.
To give an overview, ML
Linear Regression is not just a
models can be classified on the
machine learning algorithm, it
basis of the task performed
plays a huge role in statistics.
and the nature of the output:
Regression: Output is a
Regression & Classification fall
continuous variable.
under supervised
Classification: Output is a
learning while Clustering fall
categorical variable.
under unsupervised learning
Clustering: No notion of output.
Regression is a form of
predictive modeling technique
where we try to find a
significant relationship between
a dependent variable and one
or more independent variables.
There are various types of
regression techniques: Linear,
Logistic, Polynomial, Ridge,
Lasso, Softmax.
• In supervised learning you train the machine using data which is well labelled.
Unsupervised learning is a machine learning technique, where you do not need to
supervise the model. Supervised learning allows you to collect data or produce a data
output from the previous experience
What is Regression?
• When you think of regression, think prediction. A regression uses the historical
relationship between an independent and a dependent variable to predict the future
values of the dependent variable.
• Businesses use regression to predict such things as future sales, stock prices, currency
exchange rates
• A regression models the past relationship between variables to predict their future
behavior.
• As an example, imagine that your company wants to understand how past
advertising expenditures have related to sales to make future decisions about
advertising.
Note : Dependent variable is what you have to calculate/predict. Here we are trying to predict future
sales. The dependent variable in this instance is sales and the independent variable is advertising
expenditures.
• Q How much sales I can generate next year if I spend 30 lakhs rupees on
advertisement?
o You can find this by creating a regression equation of past year sales and
advertisement data.
o (i.e. historical relationship)
o You already have advertisement numbers (that the company is thinking of
spending i.e. 30 lakhs) which you can put in place of X and can
calculate/predict Y i.e. future sales….
In this case, you would plot last year's data for monthly sales and advertising expenditures as shown on
the scatter plot below. (Data for independent and dependent variables must be from the same period of
time.)
1
Marketing Business Analytics Session9 Prof. Manisha Kapoor
Scatter plots are effective in visually identifying relationships between variables. These relationships
can be expressed mathematically in terms of a correlation coefficient, which is commonly referred to
as a correlation. Correlations are indices of the strength of the relationship between two variables. They
can be any value from –1 to +1.
When you use regression to predict future values of the dependent variable, the ideal correlation
between the independent and dependent variable is high—in absolute value terms, somewhere in the
range between .5 to .99. Viewing the scatter plot in the right, you can see that there appears to be some
degree of correlation between the level of advertising expenditure and product awareness. When
calculated, this correlation equals .89.
This historical data will enable you to predict the relationship between the two variables in the future,
before any further expense is incurred.
In order to make these predictions, a regression line must be drawn from the information appearing in
the scatter plot.
Regression Line
The figure below is the same as the scatter plot above, with the addition of a regression line fitted to the
historical data.
The regression line is the line with the smallest possible set of distances between itself and each data
point. As you can see, the regression line touches some data points, but not others. The distances of the
data points from the regression line are called error terms.
Marketing Business Analytics Session9 Prof. Manisha Kapoor
A regression line will always contain error terms because, in reality, independent variables are never
perfect predictors of the dependent variables. There are many uncontrollable factors in the business
world. The error term exists because a regression model can never include all possible variables; some
predictive capacity will always be absent, particularly in simple regression.
The typical procedure for finding the line of best fit is called the least-squares method. This calculation
is usually performed using computer software. In this calculation, the best fit is found by taking the
difference between each data point and the line, squaring each difference, and adding the values
together.
The least-squares method is based upon the principle that the sum of the squared errors should be made
as small as possible, so the regression line has the least error.
Hope you are able to understand now the error is the difference between the data points on predicted
line of regression and actual past data points of advertisements and sales.
So, how we arrive at future values…..
Regression Line : The best fit line and future predictions
Once this line is determined, it can be extended beyond the historical data to predict future levels of
product awareness, given a particular level of advertising expenditure.
The extension of the line of regression requires the assumption that the underlying process causing the
relationship between the two variables is valid beyond the range of the sample data. Regression is a
powerful business tool due to its ability to predict future relationships between variables such as these.
When you run a regression in R or SPSS, the program will provide you with a report.
Marketing Business Analytics Session9 Prof. Manisha Kapoor
We will now see the details of these reports, and the definition of all the terms included in the
report.
But, before that let’s check what is Sum of Square Residuals, Sum of Square Regression ….
Scenario 1 – when we have one variable i.e sales Scenario 2 – when we have two variables sales
and we are predicting sales of next month by and advertisement we can do much better
calculating mean prediction by using regression model.
Is it so?
So we compare the two scenarios to find
whether regression is doing good job or our old
method of taking mean of one variable is better.
How we do that?
We check SST, SSR AND SSE
Note : This comes in your ANOVA table. Try to
understand the relationship of all this with
Rsquare.
Marketing Business Analytics Session9 Prof. Manisha Kapoor
large explained variance then it is easy to see that our regression model was successful in
explaining the variance in the dependent variable and more effective is our model.
• The ratio of Explained Variance and Unexplained Variance is called F-Ratio
Adjusted R Square
• Adjusted R square value only accounts for significant predictors.
• The R square value will increase if we increase more number of independent variables even if
it doesn’t improve the model. R square cannot decrease as more independent variables are
added to the regression equation. Yet diminishing returns set in, so that after the first few
variables, the additional independent variables do not make much of a contribution
• Whereas the Adjusted R Square value will increase if we add significant predictors.
• So the Adjusted R Square value is adjusted for the sample size and independent variables (i.e.
number of predictors in the model)
N is number of samples
p is number of independent variable
Marketing Business Analytics Session10 Prof. Manisha Kapoor
Budget !
• First item on his checklist ? Budget ! It’s a 18h – approximately 800 kms – fun ride, so a total
of 24h on the road.
• The follow up question : How much money should Prateek allocate for petrol?
• He approaches this problem with a science-oriented mindset, thinking that there must be a
way to estimate the amount of money needed, based on the distance he will be travelling.
• First, he looks at some data.
Scenario 1
Prateek was laboriously tracking his bus’s efficiency for the last year – so somewhere in his computer
he had this spreadsheet
Training Data
He ends up with a mathematical model that describes the relationship b/w miles driven and money
spent to fill the tank
Marketing Business Analytics Session10 Prof. Manisha Kapoor
Scenario 2
Mitali and Kartikeya feels that Prateek needs to be more rigorous in his calculations and he should
also add oil price as an independent variable in this model
Training Data
He ends up with a mathematical model that describes the relationship b/w miles driven, oil price and
money spent to fill the tank
Marketing Business Analytics Session10 Prof. Manisha Kapoor
Scenario 3
Now, Kunal suggest that Prateek should also add number of passengers as an independent variable in
this model
Training data
He ends up with a mathematical model that describes the relationship b/w miles driven, oil price,
number of passengers and money spent to fill the tank
Scenario 4
Someone now suggest that Prateek should also add Driver’s Mood as an independent variable in this
model
Marketing Business Analytics Session10 Prof. Manisha Kapoor
He ends up with a mathematical model that describes the relationship b/w miles driven, oil price,
number of passengers, Driver’s mood and money spent to fill the tank
Change of mind !
At this, point some of the students starts reconsidering the current destination option and comes up
with 8 options.
Prateek doesn’t want to say no to his friends but in a class of 88 students it’s now quite difficult for
Prateek to calculate the amount for each destination option manually.
Sessions 11 to 33
Dr Shirish Jeble,
ICFAI Business School, Pune
The analytics technique used for identifying such affinity of products being bought together and using
it to increase sale of products is called as market basket analysis.
In this technique, first we collect POS transactions data from the retail stores. This data is used to
calculate a variable called as “Lift” which is an indicator of higher frequency of purchasing in the
transactions. Higher lift indicates, these 2 or 3 products are frequently purchased together.
Once we know the “Lift” of products, we can use this insight in various ways:
A. Placing of products next to each other so that a greater number of customers who purchase
one product are likely to pick up the second one
B. Though there is association between 2 products, one of them is may have lower sale. In such
cases, retailer can package or bundle them together to increase sale of one of the products
which has lower frequency of sale (Cross selling)
C. Special deals can be offered to enhance product sales
D. Optimizing store layout
Well known ecommerce website, Amazon employs innovative way of data mining with market basket
analysis. It displays on its home page, items “Frequently bought together” and “Customers who
bought this item also bought.”. This recommendation engine accounts for more than one third of its
sale.
Marketing Business Analytics Session10 Prof. Manisha Kapoor
[ Total no of transactions * Fraction of time A was purchased * Fraction of times B was purchased]
= P( A & B) /[ N * A * B]
P(A & B): number of transactions where products A and B were purchased together
Suppose N = 1000
P(A&B) = 200
Above table shows that baby product and DVDs have highest lift.
1.5 REFERENCES:
• Winston, W. L. (2014). Marketing analytics: Data-driven techniques with Microsoft Excel. John
Wiley & Sons.
• https://searchcustomerexperience.techtarget.com/definition/market-basket-analysis
Marketing Business Analytics Session10 Prof. Manisha Kapoor
Many marketing situations require probability associated with certain events, outcomes which can be
established on past data. For example:
• Predicting whether a customer is likely to buy a product based on available information such
as age, income group, lifestyle, family size etc.
• whether a loan seeker from a bank is likely to default on the loan
• whether a customer of a telecom company is likely to churn
• whether an employee of a company is likely to quit
Binomial logistic regression works on probability associated with the outcome. Probability of any
possible outcome can go from 0 to 1. Higher the probability, higher the possibility that the event may
happen. For example, probability of a prospective customers buying a car is 0.92 then there is higher
likelihood that he/she will buy the car. If we can get list of such prospective customers with higher
probability then marketing efforts can be directed towards them with better outcomes and higher
profits.
Like multiple regression, logistic regression takes more than one independent variable that include
categorical, continuous and discrete variables. We can check whether independent variables are
statistically significant or not. We need to retain only those variables which are statistically significant
and drop others. As we go through multiple iterations, final regression equation can be formed.
x1: continuous
x4: discrete
summary(glm1)
# if any variable is not significant, then drop the variable and create 2nd model glm2
summary(glm2)
Pseudo_R_Squared
0.72
# This shows that above model glm2 explains 72% of deviance which means model is a good fit.
vif(glm2)
If VIF for all variables is less than 5 then there multicollinearity is not an issue in the model.
1
0.94
Above value of 0.94 shows high probability of customer response as Yes.
FALSE TRUE
0 132 168
1 71 629
A confusion matrix is a table that is often used to describe the performance of a classification model
(or "classifier") on a set of test data for which the true values are known. It is another way to test the
accuracy of the model using the same data that was used to build the model.
2.8 REFERENCES:
• Hodeghatta, U. R., & Nayak, U. (2016). Business analytics using R-a practical approach. Apress.
Marketing Business Analytics Session10 Prof. Manisha Kapoor
q = quantity demanded, p = unit price, a and b are constants to be determined using linear demand
curve method.
Example: q = 20 – 5p
Example: q = 20 * p -2
total price which is lower than when they are sold individually. Therefore, it requires analytic model
for pricing decisions.
Quantity discounts: There are several versions of quantity discounts. For quantity 100 or less price is
$10, for quantity 101 to 200 price is $8. Another version of quantity discount is for first 100 units price
is $10 and for any additional units above 100 price is $8.
Two-part tariffs: Some of the sports club have annual club membership fee of Rs 15,000 and then for
availing any facilities members have to pay a small fee of Rs 100 per use.
For maximising profit with nonlinear pricing, excel solver can be used. For a linear demand curve,
nonlinear pricing earns twice as much profit as linear pricing.
Prices of high-tech or fashion products generally drop over period of time. There are several reasons
for the drop in prices for such products.
Learning Curve: cost of producing products follow learning curve or experience curve (T.P. Wright,
1936). As per data prices drop between 10 to 30% when the product volumes double. Firms pass on
this cost saving to consumers resulting in price drops and it also help them in gaining more market
share. Thus company is able to expand capacity and produce more.
Competition: As market for new product such as smart phones expand, new firms enter that market
and launch their products. Due to competitive situation prices are dropped by firms.
Price Skimming: When a new product is launched, different potential buyers perceive different value
for the new product. Some buyers are willing to pay $100 while others will buy it at $50. If the company
launches product at lower price, it loses an opportunity to tap high valuation customers.
Therefore, prices are usually kept high initially as companies want to charge high prices to customers
who value the product highly and want to have something new, unique. After demand from such
Marketing Business Analytics Session10 Prof. Manisha Kapoor
customers slows down, price must be lowered down to be able to sell to those customers who may
be willing to buy it at lower prices. This is called as price skimming.
Sale: Companies announce “Sale” from time to time especially during holiday season by offering
discounts to attract those customers who perceive low-value for product or they are willing to buy at
lower prices.
3.5 REFERENCES:
• Marketing Analytics by Wayne Winston, Wiley Publication
Marketing Business Analytics Session10 Prof. Manisha Kapoor
RFM is another approach to get insights into customer transactions in retail industry. Marketers
analyse the customer transactions to segment customers into different groups with the goal of
increasing customer retention, expanding customer base and focus marketing efforts on right set of
customers.
From the retail transactions, its possible to classify customers by time (when was the most recent
visit), frequency of their transactions and the amount they have spent.
Recency: when was the most recent visit by each customer, days passed their visit is an indicator
whether customer is still active or not. Active customers are more likely to return to store respond to
promotions.
Frequency: How frequently customer has visited the retail store or website. Customers which visit
frequently are important for the retailer as they keep the inventory moving.
Monetary value: How much customer spends on an average in the store or website? Some customer
visit frequently but they spend less on an average, whereas some visit once in a while and the value
of their bought items is quite high. Customers that spend more are more valuable for the retailer as
that results in higher revenue and profits for the company.
Each of the above parameters are important in their own sense. But any single parameter fails to get
a complete picture. A customer who visited recently may not be a regular customer, another customer
coming frequently buys low value goods, customer has purchased few high-priced items turns out to
be a one-time customer. Therefore, its in RFM above 3 are combined into single RFM score and that
is used as a basis for grouping them and analyse relative importance of each customer.
Customers are important for the business. It does no good to retail business by giving equal
importance to all customers. More efforts need to be focused on retaining a regular customer who
spends $1000 per week in a store than another regular customer spending $100 per week. Therefore,
there needs to be a way to understand relative importance of each customer. RFM addresses this
issue.
Marketing Business Analytics Session10 Prof. Manisha Kapoor
• Who are my best customers which contribute higher revenues and profits?
• Which customers are likely to stop giving business?
• Considering their importance, which customers need to be retained?
Now analysing the RFM score, retailer needs to interpret the data on these lines and plan marketing
strategies accordingly:
• Customers with highest RFM Scores – most loyal and profitable customers, retain them by
giving them special privileges or offers.
• Customers with High Frequency and High Monetary score but low recency score – retailer
needs to reach out to such customers to bring them back to store
Marketing Business Analytics Session10 Prof. Manisha Kapoor
5.3 REFERENCES:
• Blattberg, R. Database Marketing: Analyzing and Managing Customers [Electronic
Resource]/Robert C. Blattberg, Byung-Do Kim, Scott A. Neslin New York, NY.
• https://clevertap.com/blog/rfm-analysis/
Marketing Business Analytics Session10 Prof. Manisha Kapoor
By using this tool marketing also moves from focusing on short term transactional relationship with
customers to long term association with customers (Rust et al., 2004)
This concept is quite useful in industries such as telecom, retailing where customer association with
the firm is for long term. After computing CLV, it can be determined whether its profitable to retain
the customer.
• CLV Determines present value of customers brought for an organization during customer’s life
cycle.
• CLV is used to classify its clients based on CLV rankings so that various marketing strategies can be
defined for each segment.
Firm can design strategies around each of these segments on investing into relationship with them.
For example, firm may want to develop Gold customers by investing into marketing efforts so that
move into platinum category.
5.4.3 Example 1
customer retention 70%
revenue per customer $ 300.00
discount rate 10%
profit% 50%
marketing cost $ 40.00
Net Profit/customer =
Net Profit per customer $ 110.00 revenue per customer - Marketing cost
in the above example, customer retention rate is 70%, revenue per customer is $300 with profitability
at 50%. Marketing cost is $40.
Which means profit per customer is $300 *50 – $40 = $150 - $40 = $110
The next table, see the calculations for 1 customers lifetime value.
Marketing Business Analytics Session10 Prof. Manisha Kapoor
5.5 REFERENCES:
• Marketing Analytics using data driven techniques with Microsoft Excel – Wayne Winston
• Database Marketing – Analyzing and Managing Customers – Blattberg, Kim, Neslin, Springer
• Rust, R. T., Lemon, K. N., & Zeithaml, V. A. (2004). Return on marketing: Using customer equity to
focus marketing strategy. Journal of marketing, 68(1), 109-127.
Marketing Business Analytics Session10 Prof. Manisha Kapoor
Clustering is an unsupervised learning technique used to separate the instances into different clusters
based on common characteristics such as age, income, lifestyle, family size, education etc. This
grouping happens through a iterative process using computer algorithm.
Cluster analysis results in customer segments such that customers within a segment are similar to
each other, and collectively different than other segments. The clustering variables are demographic,
psychographic, transactions, online data or even call centre data based on customer interactions.
Once cluster are formed, those can be used for targeting with particular marketing offer.
Classification is supervised learning technique; wherein predefined categories are assigned to each
instance based on available features. e.g. of classification is logistic regression. For example, when
classification model is applied to the customers banking data, customer is classified into defaulter or
no-defaulter based on his transactions, demographic and psychographic variables as independent
variables. Thus in classification models, 2 groups (defaulter, no-defaulter) are predefined. Whereas in
cluster analysis such groups are not predefined. These get created. Once system creates clusters
Marketing Business Analytics Session10 Prof. Manisha Kapoor
through iterative process, we can visually represent clusters on scatter plots and then decide either
to increase or decrease the number of clusters.
In above example, we have customers 1 and 2 are closest to each other (or similar). Hence we create
one cluster of 1-2 in iteration 1
Distance between 3 and 4 is lowest (3). Therefore, we will combine these 2 into a cluster 3-4.
In this method, user can decide to create k clusters from the given data.
Let us say, we have 100 records for customers with their age, income. Now we want to create 2
clusters from this data.
Randomly select any 2 points, i.e. customers A and B as centroids of the 2 clusters. Their coordinates
Age, income become the values which are considered for computing shortest distance with other 98
customers.
Iteration 1: Then we assign rest of the 98 customers to these 2 clusters, based on shortest distance
with the centroids A and B. 2 clusters are formed with customers assigned to each cluster.
Iteration 2: Calculate new centroids of the 2 clusters by taking means for all the points assigned to
them. Find out shortest distance for each of the 100 customers with the 2 centroids (say C1 and C2)
and assign them. Now customers will move from existing cluster (with centroids A,B) to newly
formed clusters with centroids (C1 and C2).
Iteration 3: Again, calculate another revised 2 centroids of the 2 clusters from iteration 2 above by
taking means for all the points assigned to them (let’s call them C3 and C4). These will be different
than from iteration 2. Find out shortest distance for each of the 100 customers with centroid C3 and
Marketing Business Analytics Session10 Prof. Manisha Kapoor
C4 and reassign them. Now customers will move from existing clusters (C1,C2) to newly formed
clusters (C3,C4).
This process repeats till centroids do not change and customers do not get reassigned.
6.6 REFERENCES:
• Hodeghatta, U. R., & Nayak, U. (2016). Business analytics using R-a practical approach. Apress.
• Blattberg, R. Database Marketing: Analyzing and Managing Customers [Electronic
Resource]/Robert C. Blattberg, Byung-Do Kim, Scott A. Neslin New York, NY.
• https://www.differencebetween.com/difference-between-clustering-and-vs-
classification/#:~:text=Clustering%20is%20unsupervised%20learning%20while%20Classification
%20is%20a,subsets%20to%20group%20the%20instances%20with%20similar%20features.
Marketing Business Analytics Session10 Prof. Manisha Kapoor
It can be defined as the practice of gathering data from social media websites and analyzing that data
using social media analytics tools to make business decisions. The most common use of social media
analytics is to mine customer sentiment to support marketing and customer service activities.
It is commonly used by marketers to track online conversations, comments about products and
companies. One author defined it as "the art and science of extracting valuable hidden insights from
vast amounts of semi-structured and unstructured social media data to enable informed and insightful
decision making.
Big Data created on the social media consist of text messages, songs, pictures, videos etc. People share
information through text messages, videos, pictures, songs etc. They often express their intention to
purchase a product, request for feedback, share their service experience or product reviews on the
social media. This data from social media has valuable information for firms which can analyse and
mine this data. Figure below summarizes a generic process consumers follow for purchasing products.
This process illustrates how consumers research products using search engines and social media. They
also contribute their views, opinions on social media before and after purchasing the product.
Technology enables firms to use this medium to support business strategies and tactical decision
making. We summarize some of the ways in which social media analytics can be helpful for the firms
in the next sub-section.
Marketing Business Analytics Session10 Prof. Manisha Kapoor
• Data Analysis
• Information interpretation
7.5 REFERENCES:
Zeng, H. Chen, R. Lusch and S. Li, "Social Media Analytics and Intelligence," in IEEE Intelligent
Systems, vol. 25, no. 6, pp. 13-16, Nov.-Dec. 2010, doi: 10.1109/MIS.2010.151.
Batrinca, B., & Treleaven, P. C. (2015). Social media analytics: a survey of techniques, tools and
platforms. Ai & Society, 30(1), 89-116.
Marketing Business Analytics Session10 Prof. Manisha Kapoor
8.1.1 Wind and Roberson Framework for marketing concepts and models:
Section I – traditional assessment of market opportunities and business strengths
a) Analysis of opportunities and threats
b) Analysis of business strengths and weaknesses
Section II – The marketing strategy core
a) Segmentation by positioning analysis
b) Opportunity analysis
c) Synergy analysis (in advertising, distribution, manufacturing etc. among products,
segments, marketing mix components)
d) Functional requirements (specifications)
e) Portfolio analysis, the analytical core of the process
Section III – The generation and evaluation of objectives and strategies
a) Generation of objectives and strategies
b) Evaluation of objectives and strategies
c) Implementation, monitoring and control of the program
Marketing Business Analytics Session10 Prof. Manisha Kapoor
There are various factors that contribute to shorter product life cycles – number of options for
consumer, various price points at which products are available, introduction of newer products with
better features. Firms which ignore the PLC will not be able to sustain in the long run. PLC time varies
as per the price, technology evolution in the field and the duration for going “from product concept
to market”. Technology driven products’ such as smart phones, laptops, tablets have few months PLC
whereas R&D process for developing car can go up to few years. Therefore, car models have longer
life cycles.
Thus PLC is an important concept in designing marketing strategy. It can be used for marketing
planning. There are several strategic alternatives in the context of PLC such as developing newer
model of the same product, advertising campaigns, pricing strategies. These will extend the life of
existing product. Alternate strategy which is commonly used by mobile and electronic companies
(such as Samsung, Sony) is to keep introducing new products with better features with altogether new
life cycle for the new product.
Cq = Cn (q/n)-b
b = learning constant
If experience for the employees doubles, let us say cost per unit drops to 80% of original levels. In this
case the learning rate is 80%. Learning rate is related to learning constant as:
r = 2-b X 100
b = ln 100 – ln r / ln 2
There are several other models that demonstrate the common characteristic that with experience
labour costs decline over time.
But not all researchers agree that it is experience that leads to entire cost reduction. Alberts (1989)
argues that causes of cost reduction can be partly because of economies of scale and partly because
of innovations. These innovations could be operators (better sources of input raw materials,
manufacturing process, alignment of resources etc.), improvement in technology, process
improvements.
Scale based cost reduction could be due to reduction in excess capacity, cost-effective substitutes,
better deals on sourcing due to higher volumes.
Above observations outline the fact that experience by itself does not lead to cost reduction, it
provides a framework for cost reduction projects.
is important. The quality of service depends on people aspects of service. Process is another important
component of the mix. Process can be made more efficient by improvements.
Marketing mix modelling is an analytical approach that uses data from various internal and external
sources. The data sources can be transactional data for the past periods, incremental data in response
to promotional and advertising efforts, customer data and financial data. Statistical techniques such
as linear or non-linear regression methods are applied with dependent variable as sales. The models
are used to determine statistically significant marketing mix variables and then use the knowledge to
design marketing strategies or optimise the marketing budgets. It is not uncommon to use sales data
for forecasting or simulations purpose for upcoming periods.
For example, price is an important element in the marketing mix which impacts sales volumes. Slight
increase in price adversely affects sales volumes. A model can estimate % change in sales in response
to each percent change in price. In a competitive situation, this provides an important guideline to
decision makers to decide right price for the products. Similarly, promotions have favourable impact
on sales. A model can be designed to include promotion as input variable and estimate the volumes
generated by each promotional events.
• Planning Process – This stage is for setting objectives, strategy for achieving objectives and
budgeting. Objectives are quantitative in nature though there could be some qualitative
elements in it. Strategy consists of creative, offer, product and media plans. For budgeting
baseline can be previous year’s budget and based on objectives and strategy it can be
modified.
• Developing copy Content – This stage includes creative, offer and product.
Marketing Business Analytics Session10 Prof. Manisha Kapoor
• Select Media – Database marketer has several options for communicating with customers –
emails, social media, catalogs etc. Media selection depends on expected exposure for the
contents to the potential customers with the intention of maximizing response rate.
• Evaluate Results – to measure effectiveness of database marketing communication, 4
methods are adopted – calculation of profits, statistical analysis, embedded testing, surveys.
8.4 REFERENCES:
• Lilien, G. L., Kotler, P., & Moorthy, K. S. (1995). Marketing models. Prentice Hall.
• https://en.wikipedia.org/wiki/Marketing_mix_modeling
• Blattberg, R. Database Marketing: Analyzing and Managing Customers [Electronic
Resource]/Robert C. Blattberg, Byung-Do Kim, Scott A. Neslin New York, NY.