Professional Documents
Culture Documents
DMBD (3rd) Dec2020
DMBD (3rd) Dec2020
of Pages 02
Total No. of Questions : 17
INSTRUCTIONS TO CANDIDATES:
1. SECTION-A contains EIGHT questions carrying TWO marks each and students
has to attempt ALL questions.
2. SECTION-B consists of FOUR Subsections : Units-I, I, l & iV. Each Subsection
contains TWO questions each carrying ElGHT marks each and student has to
attempt any ONE question from each Subsection.
3 SECTION-C is COMPULSORY and consist of ONE Case Study carrying TWELVE
marks.
o m
.r c
SECTION-A
1. What do you mean by Multiple Linear Regression?
2
p e
How is data warehouse different from database? How are they similar?
m
5.
a
What is Data Discretization?
o
List any four
p
applications of Data Mining.
r .r c
What is
b
Context Based Mining?
b r
SECTION-B
UNIT-I
9 Compare and contrast Data Warehouse and Data Mart? Also specify the reasons for
creating a Data Mar
UNIT-III
13 Explain Apriori algorithm with the help of an example.
14 What is Classification? Explain Bayesian Classification with the help of suitable
example.
UNIT-IV
16. a) What is Regression? Explain Linear Regression with the help of an example.
o m
.r c
SECTION-C
p e
A database has four transactions. Let min_sup = 60% and min_conf
m
80%
pa o
.r c
TID Date Items-Bought
br
100 11/10/19 (K,A, B,D
200
300
11/10/19
19/10/19
p e D, A, C, E, B}
C, A, B, E
400 22/10/19
p a B, A, D
a. Find all frequent items using Apriori.
b r
b. List all of the strong association rules (with support s and confidence c) matching the
following metarule where X is a variable representing customers, and item i denotes
variables representing items (e.g.. "A", "B", etc.):
NOTE: Disclosure of Identity by writing Mobile No. or Making ofpassing request on any
page of Answer Sheet will lead to UMC against the Student.
21 S321458