Professional Documents
Culture Documents
Data Mining MCQ FINAL
Data Mining MCQ FINAL
Data Mining MCQ FINAL
UNIT-1
a. Volatile
b. Distributed
c. Non-Volatile
Ans: c
2. Which one of the following is not a tool for Data warehouse development?
a. COGNOS
b. SCCS
c. Informatica
d. Business Objects
Ans: b
3. The Data Warehouse does not cater to the Real-time operational requirements of
the enterprise. (True/False).
Ans: True
b. Analysis
c. Validation
Ans: b
Ans: True
a. Production Data
b. Sales Data
c. Marketing Data
d. Purchase Data
Ans: a
a. Department level
b. Limited in size
c. Read-only
d. All the above
Ans: d
8. The three major Data Staging Components are Data Extraction, Data
Transformation and ___.
a. Data Retrieval
b. Data Loading
c. Data Refresh
d. Data Access
Ans: d
a. Relation database
b. MDDB
c. Flat files
Ans: a
a. Many to many
d. Many to one
Ans: a
11. Each Dimension table has a ___ relationship to the fact table.
a. Many to many
c. Many to one
d. One to one
Ans: b
12. Dimensional table and a fact table can be connected with the following
database keys:
a. Foreign key
b. Surrogate key
c. Candidate key
Ans: a
13. In Data Warehouse, a single record link to all the duplicate record in the source
systems is called ___.
a. Decoding of fields
b. De-duplication
c. Merging of Information
d. Summarization
Ans: b
a. Initial load
b. Incremental load
c. Iterative load
d. Full refresh
Ans: c
15. Adding value to the data to give it more meaning is called ___.
a. Data cleansing
b. Data profiling
c. Data integration
d. Data Enrichment
Ans: d
a. One
b. Four
c. Five
d. Two
Ans: c
Ans: a
Ans: b
19. Which of the following are the intermediate servers that stand in between a
relational back-end server and client front end tools?
a. ROLAP
b. MOLAP
c. HOLAP
d. All of the above
Ans: d
Ans: True
b. B+ tree indexing
c. Compression indexing
d. Clustered indexing
Ans: b
Ans: d
a. Extraction
b. Cleaning
c. Loading processes
Ans: d
24. Storing, data mapping and transformation from source systems to the Data
Warehouse fall into:
a. Technical metadata
b. Operational metadata
c. Business metadata
Ans: a
25. Key hierarchies and key performance indicators are ___ kind of metadata.
a. Technical metadata
b. Operational metadata
c. Business metadata
Ans: c
a. Unit testing
b. Regression
c. User accepting testing
d. Integration testing
Ans: a
27. Which of the following are the main areas of testing that should be done for the
ETL process.
a. Making sure that all the records in the source system that should be brought into
the warehouse and all the components of the ETL process are complete.
b. All of the extracted source data is correctly transformed into dimension tables
and fact tables
c. All of the extracted and transformed data is successfully loaded into Data
Warehouse
Ans: d
28. The advantage of using a data cube is that it allows fast indexing to pre-
computed summarized data. (True/False)
Ans: True
30. Which of the following analytic tools should be used for extracting the data
from the Data Warehouse?
a. OLAP tools
c. SQL
Ans: d
UNIT-2
c. Genetic algorithms
d. Decision trees
Ans: c
a. Data mining
b. Data warehouse
c. Databases
Ans: a
3. Predictive modelling requires which of the following Data set for initial model
creation?
Ans: a
Ans: d
5. Which of the following is the private network to access the data through the
web.
a. Internet
b. Extranet
c. Intranet
Ans: c
a. Web technology
b. Grid computing
c. Artificial intelligence
d. None of these
Ans: a
a. Distributed system
c. Parallel system
Ans: a
8. The system delivers the result of requests for information through remote
browsers is called.
a. Web browser
b. Information delivery
c. Data presentation
d. Data dissemination
Ans: b
a. Charles Babbage
b. Ralph Kimball
c. Bill Inmon
d. Fritz Bauer
Ans: c
a. Star schema
b. Snow-Flake schema
c. Fact-Constellation
d. None of these
Ans: b
Ans: d
a. Facts, Information
b. Dimensions, Weight
c. Dimensions, Facts
d. Data, Information
Ans: c
a. Three
b. Two
c. Four
d. One
Ans: c
44. Writing the same data to two disk drives connected to the same controller ifs
known as ___.
a. Data Duplexing
b. Data Mirroring
c. Disk Striping
d. Data Profiling
Ans: b
15. ___ provides the Enterprise with intelligence and ___ provides the Enterprise
with a memory.
Ans: c
a. Clementine
b. Intelligent Miner
c. Weka3
d. Enterprise Miner
Ans: c
17. In the star schema, the dimension table is ___ and the fact table is ___.
a. Wide, Wide
b. Wide, Deep
c. Deep, Wide
d. Deep, Deep
Ans: b
18. Which of the following is an open-source ETL tool?
a. Cover
d. Microsoft DTS
Ans: a
c. Work on fact and business subjects for which all users have the same meaning
Ans: d
c. Both a and b.
d. None of these
Ans: c
b. System, User
c. System, Event
d. Insert, Update
Ans: b
Ans: b
b. Catalogue of data
Ans: d
24. Which of the following interfaces are used to access the Data Warehouse?
a. Browser
b. Search engine
c. Active X applets
Ans: d
25. Data mining is ____ driven approach not ____ driven approach.
a. Event, Data
b. Data, User
c. User, Event
d. User, Data
Ans: b
Ans: a
27. Which of the following RAID level does not implement error checking?
a. RAID1
b. RAID (0+1)
c. RAID0
d. RAID5
Ans: c
28. ____ and ____ of data take place on a large scale in the data staging area.
a. Sorting, searching
b. Searching, merging
c. Sorting, merging
d. Searching, acquisition
Ans: c
29. True/False
Ans: c
30. True/False
2. OLAP tools enable the user to access the data in Data Warehouse in an
interactive manner.
3. Data mining is a data-driven approach, not a user-driven approach
Ans: a
UNIT-3
1. OLTP stands for ___.
Ans. True
Ans. False
4. Data Warehouse is a database that is designed for facilitating ___ and ___.
Ans. Non-Volatile
6. Data Warehouse contains only aggregated data and individual transactions (true/false)
Ans. True
7. List the types of the data warehouse.
8. ___ data Warehouse will allow changes in the information to be monitored and recorded over time.
Ans. time-variant
9. The Data Warehouse functions as ___ and an Executive Information System (EIS).
Ans. DSS
Ans. Metadata
Ans. Analysis
Ans. Historical
13. In most organizations, two groups of people are key to the success of the project, ___ and ___.
15. Data Warehouses does not require real-time validation (True / False)
Ans. True
16. In most organizations, two groups of people are key to the success of the project, ___ and ___.
17. In Data Warehouse, the requirements are gathered subject area wise. (True / False)
Ans. True
18. The 3 major functions that needed to be performed for getting the data ready into the Data
Warehouse are extraction, transformation and ___.
Ans. Loading
19. ___ and ___ of data take place on a large scale in the data staging area.
a. To remove redundancy
d. None
Ans. a
22. E-R modelling and Dimensional modelling are the same (True / False)
Ans. No
23. A Dimension is an entity or subject area, which can group the data (True / False)
Ans. True
a. Relational database
b. MDDB
c. Flat files
e. None
Ans. a
28. Customer name change in the dimensional model comes under ___.
Ans. Slowly-changing-dimension
29. The most popular model for the data warehouse is ___.
30. Which of the following schema supports the normalization in dimensional modelling?
a. Star Schema
b. Snow-Flake schema
c. Fact-Constellation
Ans. a
UNIT-4
1. Each dimension table is in ___ relationship with the central fact table.
Ans. One-to-many
2. Dimensional table and a fact table can be connected with the following database keys:
a. Foreign key
b. Surrogate key
c. Candidate key
Ans. a
4. OLAP tools are data accessing and discovery tools (True / False)
Ans. True
5. In Data Warehouse a system with multiple architectures is called ___
a) Department level
b) Limited in size
c) Read-only
Ans. d
Ans. EIS
8. Info Data extraction, ___ and ___ encompass the areas of data acquisition and data storage.
9. Populating all the Data Warehouse tables for the very first time is called ___.
d) Microsoft DTS
e) Clover
Ans. Clover
13. OLAP tools enable the user to access the data in Data Warehouse in an interactive manner (True /
False)
Ans. True
Ans. OLTP
Ans. true
17. Which of the following are the intermediate servers that stand in between a relational back-end
server and client front-end tools?
a. ROLAP
b. MOLAP
c. HOLAP
Ans. all
18. The advantage of using a data cube is that it allows fast indexing to precomputed summarized data.
(True / False)
Ans. true
19. In Data Warehouse, a single record link to all the duplicate record in the sources systems is called
___.
Ans. De-duplication
20. Sorting the data in the given source file is a transformation (True / False).
Ans. True
23. Key hierarchies and key performance indicators are ___ kind of Metadata.
24. Storing, data mapping and transformation from source systems to the data warehouse fall into:
a. Technical metadata
b. Operational metadata
c. Business metadata
Ans. a
a. Extraction
b. Cleaning
c. Loading processes
Ans. d
26. One tool that can allow data warehouse managers to deal with metadata is called___.
Ans. Repository
Ans. Metadata
29. Information can be converted into knowledge about ___ patterns and future trends.
Ans. Historical
Ans. Metadata
UNIT-5