Professional Documents
Culture Documents
Customer Churn Case Answers
Customer Churn Case Answers
You will notice that the cases that do not satisfy the condition are slashed out.
To build the histogram for the number of customers who churned by age.
Now you will notice that the slashes are gone and you can run the logit regression.
AnalyzeàRegressionàBinary Logistic…àPut the “Churn1Yes0No” in the dependent variable
boxà Put “CustomerAgeinMonths”, “CHIScoreMonth0”, “CHIScore01”, “SupportCasesMonth0”,
“SupportCases01”, and “Views01” into the Box in the Block section. (Just as a reminder, in this
case, none of the independent variables or predictors are categorical variables, if there are some
categorical variables, you need to define them by clicking the categorical button in the upper
right)àOK.
While conducting binary logistic regression, in ‘Save’ check ‘probabilities’.
a. What is the probability of churn based on customer age according to the resulting output?
The results suggest that the likelihood of churn is slightly greater with age (Exp(B) = 1.018).
b. What are the ‘customer age’ and the ‘predicted probability’ that customers 354, 672, and
5,203 will leave?
Based on the regression equation and the data:
• for customer 672 (see below):
Churn = =-2.672+(0.017*16)-(0.006*148)-(0.009*1)+(0.13*0)+(0.0001*85)-(0.139*0)
Churn = -3.2885
Probability = EXP[Churn] / (1 + EXP[Churn])
Probability = 0.036, 3.6%
If you checked to save probabilities as you are running the binary regression (below), a
variable named as PRE_1 gets added to the data set. The probability in SPSS is 3.672%
• For Customer 5203:
Churn = =-2.672+(0.017*4)-(0.006*37)-(0.009*32)+(0.13*1)+(0*1)-(0.139*1)
Churn = -3.123
Probability = EXP[Churn] / (1 + EXP[Churn])
Probability = 0.042, 4.2%
The probability in SPSS is 4.307%
3. Now run logistic regression by splitting the data into three slices as follows: Customer age 0-6
months, 7 to 13 months, and 14+ months. Run separate logistic regressions for each slice.
a. What is the probability of churn based on customer age according to the resulting output
of the three groups?
Next,
Then
Next
Then,
Next
7-13 Month Group:
14+ Month Group:
b. Do you think separating the data into three slices improve predictive power?
I do. Separating age groups shows the difference between ‘older’ customers, who are less
likely to churn (B value becomes <1) versus the other groups. The exp(B) is < 1 for the oldest
customer group, which means that the probability of churn is decreasing by age for this
group, while the reverse is true for the other two groups. This is a significant difference from
the insight gained from the overall model.
The probabilities:
• for customer 672 (16 mo):
Probability before segmentation = 0.036, 3.6%