A histogram should be used when analyzing numerical data to understand the shape of its distribution and determine if it is approximately normal. It can also be used to analyze if a process can meet requirements, compare the outputs of processes, see if a process has changed over time, and quickly communicate a data distribution to others. Outliers in a histogram should be investigated further as they can provide interesting information about the data.
A histogram should be used when analyzing numerical data to understand the shape of its distribution and determine if it is approximately normal. It can also be used to analyze if a process can meet requirements, compare the outputs of processes, see if a process has changed over time, and quickly communicate a data distribution to others. Outliers in a histogram should be investigated further as they can provide interesting information about the data.
A histogram should be used when analyzing numerical data to understand the shape of its distribution and determine if it is approximately normal. It can also be used to analyze if a process can meet requirements, compare the outputs of processes, see if a process has changed over time, and quickly communicate a data distribution to others. Outliers in a histogram should be investigated further as they can provide interesting information about the data.
• You want to see the shape of the data’s distribution, especially when determining whether the output of a process is distributed approximately normally • Analyzing whether a process can meet the customer’s requirements • Analyzing what the output from a supplier’s process looks like • Seeing whether a process change has occurred from one time period to another • Determining whether the outputs of two or more processes are different • You wish to communicate the distribution of data quickly and easily to others
Normal Distribution
A common pattern is the bell-shaped curve known as the "normal
distribution." In a normal or "typical" distribution, points are as likely to occur on one side of the average as on the other. Note that other distributions look similar to the normal distribution. Statistical calculations must be used to prove a normal distribution. It's important to note that "normal" refers to the typical distribution for a particular process. For example, many processes have a natural limit on one side and will produce skewed distributions. This is normal—meaning typical—for those processes, even if the distribution isn’t considered "normal."
Skewed Distribution
If the data are left-skewed, then the mean is
typically LESS THAN the median. If the data are right-skewed, then the mean is typically GREATER THAN the median.
OUTLIERS (AND THE OZONE LAYER)
Outliers can be described as extremely low or high values that do not fall near any other data points. Sometimes outliers represent unusual cases. Other times they represent data entry errors, or perhaps data that does not belong with the other data of interest. Whatever the case may be, outliers can easily be identified using a histogram and should be investigated as they can shed interesting information about your data.
If we want to find outliers, we should do the previous:
step 1. Find mean and standard deviation amounts; step 2. Add mean 3*standard deviation any value upper that may be our outlier, we should further investigate this case just looking at the histogram is not enough. Also, Subtract Mean by 3*standard deviation any variable that may be our outlier, we should further investigate this case just looking at the histogram is not enough.