Professional Documents
Culture Documents
Data Assimilation Vs Data Mining
Data Assimilation Vs Data Mining
Data Assimilation Vs Data Mining
Data Assimilation
S. Lakshmivarahan
School of Computer Science
University of Oklahoma
Norman, Oklahoma
varahan@ou.edu
S. Lakshmivarahan
S. Lakshmivarahan
S. Lakshmivarahan
S. Lakshmivarahan
S. Lakshmivarahan
S. Lakshmivarahan
S. Lakshmivarahan
S. Lakshmivarahan
S. Lakshmivarahan
S. Lakshmivarahan
S. Lakshmivarahan
Data Mining has been and still continues to be the basis for
the advancement of knowledge in all of Sciences and
Engineering
S. Lakshmivarahan
S. Lakshmivarahan
S. Lakshmivarahan
Is it DM or DA?
S. Lakshmivarahan
A classification of models
S. Lakshmivarahan
Forms of Data
Data arise in various forms:
Time series data - annual rain fall, total monthly sales
Data martix m n - n objects (columns) and m attributes
(rows)
Cross Sectional data - Tabular forms
Practical problems: Missing data, outliers, Data quality
control
Note: In Science and Engineering, data are often of the
quantitative type (permiting full blown arithmetic
operations). In Economics, Social Sciences etc., data could be
a mixture of both quantitative and qualitative types.
Algorithms for mining/assimialtion qualitative data dier from
those of quantitative data sets
S. Lakshmivarahan
S. Lakshmivarahan
Framework for DA
S. Lakshmivarahan
S. Lakshmivarahan
S. Lakshmivarahan
Optimization problems
S. Lakshmivarahan
S. Lakshmivarahan
Association rules
Image processing, voice recognition
Decision trees (1960)
Probabilistic reasoning in networks (1990s) - J. Pearl Turing
Award in 2012
Random eld - Spatial data analysis
S. Lakshmivarahan
Supervised learning
Learning with a teacher - Learning in Neural Networks
Learning with a probabilistic teacher - using imprecise
knowledge
S. Lakshmivarahan
Summary
At the rst level Data Mining seeks to uncover the basic laws
that are hidden in the data. These laws are presented by
models of some kind with unknown parameters
At the second level Data Assimilation deals with the task of
fusing data with models to produce an assimilated model - by
estimating the unknown parameters
At the third level, using the given assimilated model produce
various forecast products for public consumption
DM, DA and Forecasting are the three parts of a continuum
in knowledge discovery
S. Lakshmivarahan
References
S. Lakshmivarahan