Professional Documents
Culture Documents
Ananlytics Notes
Ananlytics Notes
Ananlytics Notes
Nominal- categorize in no order. In nominal form of data there is no order, no meaningful difference
and there is no absolute zero.
Ordinal- Has something meaningful order to it like 1,2,3 or good, excellent, bad etc. It is ranking and
rating. There is no absolute zero in ordinal data.
Interval- There is something meaning full in the data even when it is zero. Ex- 0-degree water temp
does not mean nothing exists but it means that the water is frozen.
The only difference between interval and ratio is that in ratio there is absolute zero.
Descriptive analysis-
Types of summaries that can be obtained through descriptive analysis-
1.) Distribution- Distribution shows us the frequency of different outcomes ( or data points ) ina
population or sample. It can be represented as numbers in a list, table or graph. Ex- list
showing no. of those with different hair color.
Descriptive statistics
Mean Range
Kurtosis <0 means peak is short and broad, tails are shorter
Kurtosis >0 means peak is higher and thinner, tails are longer
The sample kurtosis is useful measure of weather there is a problem with outliers in a data set.
Larger kurtosis indicates a more serious outlier problem, and may lead the research to choose
alternative statistical methods.
Data Warehouse-
A data warehouse is process for collecting and managing data from varied sources to
provide meaningful business insight.
It is typically used to connect and analyze data from heterogeneous data ( different
sources).
It is a blend of technologies and components which aids the strategic use of data.
It is electronic storage of a large amount of information by a business.
It is a process of transforming data into information and making it available to users in
timely manner to make a difference.
Data warehouse is not a product but an environment.
It is an architectural construct of an information system
A data mart is a data storage system that contains information specific to an organizational
business unit. It contains small and selected part of data that the company stores in a larger
storage system. Company uses a data mart to analyze department-specific information
more efficiently.
Supplier database
Customer
database
Data warehouse
Sales database
Data
Mart
Data mart
Financial Manger
Marketing
1) A data warehouse works as a central repository where information arrives from one or more
data sources.
2) Data flows into a data warehouse from the transactional system and other relational databases.
Data mart
It is a centralized warehouse.
It also provides the ability to classify data according to the subject and give access accordingly.
Are nothing but data store required when neither data warehouse nor OLTP systems support
organizations reporting needs.
Data mart-
Enables the real time execution of large no. of database transactions by large no. of people.
OLAP
Is a technology that organizes large business database and supports complex analysis.
It can be used to perform complex analytical queries without negatively affecting transaction
systems.