Play With Data Science

#1 Introduction to Probability and Statistics
Probability and Statistics Page 1

#2 Population and Sample

#3 Gaussian Normal Distribution and its PDF(Probability Density
Function)

#4 CDF(Cumulative Distribution function) of Gaussian Normal
distribution

#5 Symmetric distribution, Skewness and Kurtosis

#6 Standard normal variate (Z) and Standardization

#7 Kernel density estimation

#8 Sampling distribution & Central Limit Theorem

#9 (Q-Q plot) How to test if a random variable is normally distributed or
not

#10 How distributions are used ?

#11 Chebyshev’s Inequality

#12 Discrete and Continuous Uniform distributions

#13 Bernoulli and Binomial Distribution

#14 Log Normal Distribution

#15 Power Law Distribution

#16 Box Cox Transform

#17 Applications of non-gaussian distributions

#18 Co-Variance

#19 Pearson Correlation Coefficient

#20 Spearman Rank Correlation Coefficient

#21 Correlation vs Causation

#22 How to use Correlations

#23 Confidence Interval (C.I) Introduction

#24 Computing Confidence Interval given Normal Distribution

#25 C.I for mean for any Random Variable

#26 Confidence Interval using Bootstrapping

#27 Hypothesis testing methodology, Null-hypothesis, p-value

#28 Hypothesis Testing Intuition with coin toss example

#29 Resampling and permutation test

#30 Kolmogorov-Smirnov(K-S) Test for similarity of two Distributions

#31 Hypothesis testing another example

#32 Resampling and Permutation test another example

#33 How to use hypothesis testing ?

#34 Proportional Sampling

#1 Geometric intuition of Logistic Regression
12 November 2020 04:31 PM
Logistic Regression Page 75

#2 Squashing and Sigmoid function
12 November 2020 08:02 PM

#3 Mathematical formulation of Objective function

#4 Weight vector

#5 L2 Regularization Overfitting and Underfitting

#6 L1 regularization and sparsity

#7 Probabilistic Interpretation Gaussian Naive Bayes

#8 Loss Minimization Interpretation

#9 Hyperparameters and Random Search

#10 Column Standardization

#11 Feature Importance and Model Interpretability

#12 Model Interpretability and Collinearity of features

#13 Test Run time Space and Time complexity

#14 Real world cases

#15 Non-linearly separable data & feature engineering

#16 Extensions to Generalized linear models

#1 Geometric Intuition of Linear Regression
Linear Regression Page 113

#2 Mathematical formulation

#3 Real world Cases

#1 Differentiation
Solving Optimization Problems Page 119

#2 Maxima and Minima

#3 Vector calculus: Gradient

#4 Gradient Descent Geometric Intuition

#5 Learning Rate

#6 Gradient descent for Linear Regression

#7 Stochastic Gradient Descent (SGD) Algorithm

#8 Constrained Optimization & PCA

#9 Logistic Regression formulation revisited

#10 Why L1 regularization creates sparsity

#1 Geometric Intuition
Support Vector Machine- SVM Page 139

#2 Mathematical Formulations (Hard Margin SVM)
zz

#3 Mathematical Formulations (Soft Margin SVM)

#4 Why we take values +1 & -1 for Support vector planes?

#5 Loss function (Hinge Loss) based interpretation

#6 Dual form of SVM formulation

#7 Kernel trick

#8 Polynomial Kernel

#9 Radial Basis Function (RBF) Kernel

#10 Domain specific Kernels

#11 Train and Run time complexities

#12 nu-SVM control errors and Support Vectors

#13 SVM Regression

#14 Some real world Cases

#1 Conditional Probability
Naive Bayes Page 164

#2 Independent vs Mutually exclusive events

#3 Bayes Theorem with examples

#4 Naïve Bayes algorithm

#5 Toy example Train and test stages
http://shatterline.com/blog/2013/09/12/not-so-naive-classification-with-the-naive-bayes-classifier/

#6 Naïve Bayes on Text data

#7 Laplace Additive Smoothing

#8 Log-Probabilities for numerical stability

#9 Bias and Variance trade off

#10 Feature importance and interpretability

#11 Imbalanced Data

#12 Outliers

#13 Missing values

#14 Handling Numerical features (Gaussian NB)

#15 Multiclass classification

#16 Similarity or Distance matrix

#17 Large dimensionality

#18 Best and worst cases

#1 How “Classification” works
K-Nearest Neighbours Page 193

#2 Data matrix notation

#3 Classification vs Regression (examples)

#4 K-Nearest Neighbours Geometric intuition with a toy example

#5 Failure cases of KNN

#6 Distance measures Euclidean(L2) , Manhaltan(L1), Minkowski,
Hamming

#7 Cosine Distance & Cosine Similarity

#8 How to measure the effectiveness of k-NN

#9 Test Evaluation time and space complexity

#10 KNN Limitations

#11 Decision surface for K-NN as K changes

#12 Overfitting and Underfitting

#13 Why need for Cross validation?

#14 K-fold cross validation

#15 Visualizing train, validation and test datasets

#16 How to determine overfitting and underfitting?

#17 Time based splitting

#18 k-NN for regression

#19 Weighted k-NN

#20 Binary Search Tree

#21 How to build a kd-tree

#22 Find nearest neighbors using kd-tree

#23 Limitations of KD tree

#24 Hashing vs LSH

#25 LSH for cosine similarity

#26 LSH for Euclidean Distance

#27 Probabilistic class label

#1 Geometric Intuition of decision tree Axis parallel hyperplanes
Decision Tree Page 250

#2 Sample Example

#3 Entropy

#4 Information Gain

#5 Gini Impurity

#6 Decision Tree Constructing

#7 Splitting Numerical features
[[

[[]

#8 Do we need Feature Standardization ?

#9 Categorical features with many possible values

#10 Overfitting and Underfitting

#11 Train and Run time complexity

#12 Regression using Decision Trees

#13 Cases

#1 What are ensembles ?
Ensemble Models Page 278

#2 Bootstrapped Aggregation (Bagging) Intuition

#3 Random Forest and its construction

#4 Bias-Variance trade-off

#5 Train and run time complexity

#6 Extremely randomized trees

#7 Random Tree Cases

#8 Boosting Intuition

#9 Residuals, Loss functions and gradients

#10 Gradient Boosting

#11 Regularization by Shrinkage

#12 Train and Run time complexity Gradient Boosting Decision Tree

#13 AdaBoost geometric intuition

#14 Stacking models

#15 Cascading Classifiers


Play With Data Science

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Play With Data Science

Uploaded by

Copyright:

Available Formats

#1 Introduction to Probability and Statistics

Probability and Statistics Page 1

Probability and Statistics Page 3

Probability and Statistics Page 4

Probability and Statistics Page 7

Probability and Statistics Page 9

Probability and Statistics Page 10

Probability and Statistics Page 12

Probability and Statistics Page 13

Probability and Statistics Page 16

Probability and Statistics Page 18

Probability and Statistics Page 20

Probability and Statistics Page 23

Probability and Statistics Page 25

Probability and Statistics Page 27

Probability and Statistics Page 30

Probability and Statistics Page 32

Probability and Statistics Page 34

Probability and Statistics Page 37

Probability and Statistics Page 39

Probability and Statistics Page 40

Probability and Statistics Page 42

Probability and Statistics Page 43

Probability and Statistics Page 45

Probability and Statistics Page 47

Probability and Statistics Page 49

Probability and Statistics Page 52

Probability and Statistics Page 54

Probability and Statistics Page 56

Probability and Statistics Page 59

Probability and Statistics Page 61

Probability and Statistics Page 64

Probability and Statistics Page 67

Probability and Statistics Page 70

Probability and Statistics Page 73

Logistic Regression Page 75

Logistic Regression Page 78

Logistic Regression Page 81

Logistic Regression Page 83

Logistic Regression Page 85

Logistic Regression Page 88

Logistic Regression Page 91

Logistic Regression Page 93

Logistic Regression Page 95

Logistic Regression Page 98

Logistic Regression Page 99

Logistic Regression Page 101

Logistic Regression Page 104

Logistic Regression Page 106

Logistic Regression Page 108

Logistic Regression Page 112

Linear Regression Page 113

Linear Regression Page 115

Linear Regression Page 117

Solving Optimization Problems Page 119

Solving Optimization Problems Page 122

Solving Optimization Problems Page 124

Solving Optimization Problems Page 126

Solving Optimization Problems Page 129

Solving Optimization Problems Page 130

Solving Optimization Problems Page 132

Solving Optimization Problems Page 134

Solving Optimization Problems Page 136