Professional Documents
Culture Documents
Credit EDA Case Study
Credit EDA Case Study
Credit EDA Case Study
APPROACH
1) Importing and Cleaning of 2) Formatting or Grouping 3) Performing Univariate & 4) Draw useful Insights.
the Data provided for an effective analysis. Bivariate analysis on
Categorial and Numerical
fields.
UNIVARIATE DISCRETE ANALYSIS FOR AGE GROUPS
Applicants are increasing with Age of the applicant until age 40 and after that we see decline in the no of
applications.
And from the 2nd plot , we see that Default rate is decreasing as the Age of the applicant increases.
UNIVARIATE DISCRETE ANALYSIS FOR FAMILY STATUS
• From the chart, It can be inferred that the most of applicants belongs to Married, Single & Civil Marriage
categories sequentially. Out of which, Single & Civil Marriage tend to default more, and Unknown
category never defaulted.
UNIVARIATE DISCRETE ANALYSIS FOR OCCUPATION
Most of the Applicants occupation is Missing. However Top applicants are from Laborers ,Sales Staff.
But most of the Default percentage is occurring from Low-Skill Laborers group
UNIVARIATE DISCRETE ANALYSIS FOR INCOME TYPE
Most of the applicants are From Working Category & Commercial Associate and Least are Businessman
and Student
However, Default percentage is more on Maternity Leave and Unemployed applicant group
UNIVARIATE CATEGORICAL ANALYSIS FOR ORGANIZATION
TYPE
Most of the Applicants are from Business Entity Type 3 , Missing and Self Employed.
However, Most default percentage is from Transport Type 3 , Industry Type 13 and industry Type 8 groups
ORDERED/ CONT., NUMERI CAL
VARIABLE ANALYSIS ON WORK
EXP
• AMT_GOOD_PRICE , AMT_CREDIT,
AMT_ANNUITY doesn’t seem to have any impact
on the default rate.
B I VA R I AT E A N A LY S I S -
E DU C AT I O N V S G E N D E R V S
INCOME
•except for the Region rating client vs Region REGION_RATING_CLIENT_W_CITY REGION_POPULATION_RELATIVE 0.446977 0.539005
• 63% of Previous loans are approved and those applicants have 8% default rate whereas previously Refused
applicants have 12% default percentage.
PREVIOUS
APPLICATION
DATA
ANALYSIS