Professional Documents
Culture Documents
© Jitesh Khurkhuriya - Azure ML Online Course
© Jitesh Khurkhuriya - Azure ML Online Course
© Jitesh Khurkhuriya - Azure ML Online Course
Exabytes
45000
40,900
40000
35000
30000
25000
20000
15000
10000
7900
5000
130 1200
0
2005 2010 2015 2020
Exabytes
90%
90% of today’s data has been created in last two years alone
• Faster decisions
Green Apple
Y = f(X)
• The function is approximated to predict new values of Y given X
• Examples
• Regression – Output variable is a real value such as Amount, Height, Weight
etc
• Classification – Output variable is a category, such as Yes, No, Red, Blue,
Yellow etc
Loan_ID Gender Married Dependents Self_Employed Income LoanAmt Term CreditHistory Property_Area Status
LP001002 Male No 0 No $5,849.00 60 1 Urban Y
LP001003 Male Yes 1 No $4,583.00 $128.00 120 1 Rural N
LP001005 Male Yes 0 Yes $3,000.00 $66.00 60 1 Urban Y
LP001006 Male Yes 2 No $2,583.00 $120.00 60 1 Urban Y
• Examples
• Clustering – Customer behaviour grouping
• Association – Recommendation model
4,000 4,400 5,000 5,500 5,700 5,800 6,200 6,400 6,400 6,400 7,000
1 2 3 4 5 6 7 8 9 10 11
• Sample Space (S) – Set of possible outcomes that might be observed for
an event
• Dice Sample Space (S) = {1, 2, 3, 4, 5, 6}
• Probability of 3
• P(A) = 1/6 = 0.1667
• Examples
• Assigning a given email into "spam" or "non-
spam" classes Or Primary, Social or Promotional
emails
• Will this customer default on loan repayment?
X
• Will this customer buy my product?
Claims
• Examples
• Predicting the future sale of products
• Computing fair price of the product or
service