Professional Documents
Culture Documents
Oracle Data Mining
Oracle Data Mining
TOPICS
1 2 3 4
DEVELOPER
INTRODUCTION
HISTORY
FUNCTIONS
DEVELOPER
INTRODUCTION
Oracle Data Mining (ODM) is an option of Oracle Corporation's Relational Database Management System (RDBMS) Enterprise Edition (EE). It contains several data mining and data analysis algorithms for classification, prediction, regression, associations, feature selection,
Oracle Data Mining (ODM) provides powerful data mining functionality as native SQL functions within the Oracle Database.
Oracle Data Mining enables users to discover new insights hidden in data
and to leverage investments in Oracle Database technology. With Oracle Data Mining, you can build and apply predictive models that help you target your best customers, develop detailed customer profiles, and find and prevent fraud. Oracle Data Mining, a component of the Oracle Advanced Analytics Option, helps companies better "compete on analytics."
The Oracle Data Miner "work flow" based GUI, an extension to SQL
Developer, allows data analysts to explore their data, build and evaluate
models, apply them to new data and save and share their analytical methodologies. Data analysts and application developers can use the SQL APIs to build nextgeneration applications that automatically mine star schema data to build and deploy predictive models that deliver real-time results and predictions throughout the enterprise.
Because the data, models and results remain in the Oracle Database, data
movement is eliminated, information latency is minimized and security is maintained.
Additionally, Oracle Data Mining models can be included in SQL queries and embedded in applications to offer improved business intelligence. Data analysts can quickly access their Oracle data using Oracle Data Miner 11g Release 2 graphical user interface and explore their data to find patterns, relationships, and hidden insights.
HISTORY
Oracle Data Mining was first introduced in 2002 and its releases are named according to the corresponding Oracle database release:
Oracle Data Mining 9iR2 (9.2.0.1.0 - May 2002) Oracle Data Mining 10gR1 (10.1.0.2.0 - February 2004) Oracle Data Mining 10gR2 (10.2.0.1.0 - July 2005) Oracle Data Mining 11gR1 (11.1 - September 2007) Oracle Data Mining 11gR2 (11.2 - September 2009)
FUNCTIONS
As of release 11gR1 Oracle Data Mining contains the
Classification:
Naive Bayes (NB). Generalized linear model (GLM) for Logistic regression. Support Vector Machine (SVM). Decision Trees (DT). Regression:
Feature extraction:
Data mining (the advanced analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science,[2][3][4] is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems.[2] The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use. Analysis of data is a process of inspecting, cleaning, transforming, and modeling data with the goal of highlighting useful information, suggesting conclusions, and supporting decision making. In machine learning and statistics, classification is the problem of identifying to which of a set of categories (sub-populations) a new observation belongs, on the basis of a training set of data containing observations (or instances) whose category membership is known.
A prediction (Latin pr-, "before," and dicere, "to say") or forecast is a statement about the way things will happen in the future, often but not always based on experience or knowledge. While there is much overlap between prediction and forecast, a prediction may be a statement that some outcome is expected, while a forecast is more specific, and may cover a range of possible outcomes. In statistics, regression analysis is a statistical technique for estimating the relationships among variables. It includes many techniques for modeling and analyzing several variables, when the focus is on the relationship between a dependent variable and one or more independent variables. association rule learning is a popular and well researched method for discovering interesting relations between variables in large databases. It is intended to identify strong rules discovered in databases using different measures of interestingness.
In machine learning and statistics, feature selection, also known as variable selection, attribute selection or variable subset selection, is the process of