Professional Documents
Culture Documents
Data Mining Methods Basics Q&A
Data Mining Methods Basics Q&A
The process of extracting valid, useful, unknown info from data and using it to
make proactive knowledge driven business is called
Data mining -- Correct
***********************************************************************************
***********************************************
What is the other name for Data Preparation stage of Knowledge Discovery Process?
ETL -- Correct
Which of the following modelling type should be used for Labelled data?
Predictive Modelling -- Correct
Noisy values are the values that are valid for the dataset, but are incorrectly
recorded
True -- Correct
***********************************************************************************
***********************************************
Probability of theft in an area is 0.03 with expected loss of 20% or 30% of things
with probabilities 0.55 and 0.45. Insurance policy from A costs $150 pa with 100%
repayment. Policy with B, costs $100 pa and first $500 of any loss has to be paid
by the owner. Which data mining technique can be used to choose the policy?
Decision Tree -- Correct
Statistical technique used for investigating and modelling the relationship between
two or more variables is:
Regression analysis -- Correct
***********************************************************************************
***********************************************
Machine learning task of inferring a function from labelled training data is known
as
Supervised Learning -- Correct
Which is the statistical technique used for investigating and modelling the
relationship between two or more variables?
Regression analysis -- Correct
Which data mining method groups together objects that are similar to each other and
dissimilar to the other objects?
Clustering -- Correct
Which of the following activities are performed as part of data pre processing?
All the options -- Correct
_________ are the values that mark the boundaries of the confidence interval.
Confidence limits -- Correct
The process of extracting valid, useful, unknown info from data to make proactive
knowledge driven business is called
Data mining -- Correct