Professional Documents
Culture Documents
Stastical Insights (Python)
Stastical Insights (Python)
Tendency.
It gives the mean, median and mode values of the dataset.
If the mean, median & mode are same then the data has normal distribution
otherwise the distribution is not normal.
MEAN:
Mean is the average of the data and it easily gets influenced by the outliers.
BEFORE PREPROCESSING:
AFTER PREPROCESSING:
MEDIAN:
BEFORE PREPROCESSING:
AFTER PREPROCESSING:
MODE:
BEFORE PREPROCESSING
AFTER PREPROCESSING
Second Moment Business Decision or Measures of Dispersion
* It contains Variance, Standard Deviation & Range.
* It gives a general idea about the spread of data in the dataset.
VARIANCE:
Variance is the average squared distance of each data point from the mean
BEFORE PREPROCESSING
AFTER PREPROCESSING
STANDARD DEVIARTION:
BEFORE PREPROCESSING:
AFTER PREPROCESSING:
From the variance and std. Dev we can see the spread of the data, we can confirm
our observation of the central tendency as the data is spread over a high range.
Which means there is large variability and there is a chance of outliers being present
BEFORE PREPROCESSING:
AFTER PREPROCESSING :
From skewness we can see that all of them are positively skewed which indicates
majority of the datapoints are on the lower side.
AFTER PREPROCESSING:
From Kurtosis it shows that it is positive kurtosis which means that the distribution
has more values in the tails and fewer values in the middle than a normal
distribution , which indicates that the data may not be normally distributed.