Full Download Data Mining A Tutorial Based Primer 1st Edition Roiger Solutions Manual

Data Mining A Tutorial Based Primer 1st Edition Roiger Solutions Manual
Data Mining A Tutorial Based Primer 1st

Edition Roiger Solutions Manual
Visit to get the accurate and complete content:
https://testbankfan.com/download/data-mining-a-tutorial-based-primer-1st-edition-roig
er-solutions-manual/
Visit TestBankFan.com to get complete for all chapters

CHAPTER 2
Review Questions
1. Differentiate between the terms:
a. A data mining strategy is a template for problem solving. A data mining techique involves the
application of a strategy to a specific set of data.
b. A set if independent variables is used to build a model to determine the values of one or more
dependent variables.
2. Yes on both counts. As one example, feed-forward neural networks and linear regression models can
both be used for estimation problems. Likewise, neural networks can be used for estimation as well as
prediction problems. It is the nature of the data, not the data mining technique, that determines the data
mining strategy.
3. Is each scenario a classification, estimation or prediction problem?

a. This is a prediction problem as we are trying to determine future behavior.
b. This is a classification problem as we are classifying individuals as a good or poor secured credit
risks.
c. This is a prediction problem.
d. This is a classification problem as the violations have already occurred.
e. This is a classification or estimation problem depending on how the output variable(s) are
represented.
4. There are no absolute answers for this question. Here are some possibilities.
a. For 3a: As there is an output attribute and the attribute is numeric, a neural network is a good
choice. Statistical regression is also a possibility.
For 3b, 3d, 3e: A decision tree model or a production rule generator is a good choice as an output
attribute exists and we are likely to be interested in how the model reaches its conclusions.
For 3c: A neural network model is a best choice as the output can be interpreted as the probability
of a stock split.
b. For 3a: Any technique limited to categorical output attributes would be of limited use as we are
interested in a numeric output.
For 3b, 3d, 3e: Any technique that does not explain its behavior is a poor choice.
For 3c: A numeric output between 0 and 1 inclusive that can be treated as a probability of a stock
split allows us to make a better determination of whether a stock is likely to split. Therefore, any
technique whose output attribute must be categorical is a poor choice.
c. Answers will vary.
5. For supervised learning decision trees, production rules and association rules provide information
about the relationships seen between the input and output attributes. Neural networks and regression
models do a poor job of explaining their behavior. Various approaches to unsupervised clustering have
not been discussed at this point.

6. As home mortgages represent secured credit, model A is likely to be the best choice. However, if the
lender’s cost for carrying out a forclosure is high, model B may be a better alternative.
7. As the cost of drilling for oil is very high, Model B is the best choice.
8. If clusters that differentiate the values of the output attribute are formed, the attributes are appropriate.
9. Each formed cluster is designated as a class. A subset of the instances from each class are used to build
a supervised learner model. The remaining instances are used for testing the supervised model. The test
set accuracy of the supervised model will help determine if meaningful clusters have been formed.
Data Mining Questions

1. The architecture of the network should appear similar to the network shown in Figure 2.2. However, the
network should have 4 input-layer nodes, 5 hidden-layer nodes, and 1 output-layer node.
2. Students find this to be an interesting exercise. You may wish to discuss credit card billing categories
and help students set up their individual spreadsheets.
Computational Questions
1. Consider the following three-class confusion matrix.
a. 86%
b. 48, 45, 7
c. 2
d. 0
2. Suppose we have two classes each with 100 instances.

a. 40
b. 10
3. Consider the confusion matrices shown below.

a. 2.008
b. 2.250
4. Let + represent the class of individuals responding positively to the flyer. Then we have,
Dividing the first fraction by the second fraction and simplifying gives the desired result.
C11
P (  | Sample) 
C11  C21
C11  C12
P (  | Population) 
P

Another random document with
no related content on Scribd:
OR IMPLIED, INCLUDING BUT NOT LIMITED TO
WARRANTIES OF MERCHANTABILITY OR FITNESS FOR
ANY PURPOSE.
1.F.5. Some states do not allow disclaimers of certain implied

warranties or the exclusion or limitation of certain types of
damages. If any disclaimer or limitation set forth in this
agreement violates the law of the state applicable to this
agreement, the agreement shall be interpreted to make the
maximum disclaimer or limitation permitted by the applicable
state law. The invalidity or unenforceability of any provision of
this agreement shall not void the remaining provisions.
1.F.6. INDEMNITY - You agree to indemnify and hold the

Foundation, the trademark owner, any agent or employee of the
Foundation, anyone providing copies of Project Gutenberg™
electronic works in accordance with this agreement, and any
volunteers associated with the production, promotion and
distribution of Project Gutenberg™ electronic works, harmless
from all liability, costs and expenses, including legal fees, that
arise directly or indirectly from any of the following which you do
or cause to occur: (a) distribution of this or any Project
Gutenberg™ work, (b) alteration, modification, or additions or
deletions to any Project Gutenberg™ work, and (c) any Defect
you cause.
Section 2. Information about the Mission of

Project Gutenberg™
Project Gutenberg™ is synonymous with the free distribution of
electronic works in formats readable by the widest variety of
computers including obsolete, old, middle-aged and new
computers. It exists because of the efforts of hundreds of
volunteers and donations from people in all walks of life.
Volunteers and financial support to provide volunteers with the

assistance they need are critical to reaching Project
Gutenberg™’s goals and ensuring that the Project Gutenberg™
collection will remain freely available for generations to come. In
2001, the Project Gutenberg Literary Archive Foundation was
created to provide a secure and permanent future for Project
Gutenberg™ and future generations. To learn more about the
Project Gutenberg Literary Archive Foundation and how your
efforts and donations can help, see Sections 3 and 4 and the
Foundation information page at www.gutenberg.org.
Section 3. Information about the Project

Gutenberg Literary Archive Foundation
The Project Gutenberg Literary Archive Foundation is a non-
profit 501(c)(3) educational corporation organized under the
laws of the state of Mississippi and granted tax exempt status by
the Internal Revenue Service. The Foundation’s EIN or federal
tax identification number is 64-6221541. Contributions to the
Project Gutenberg Literary Archive Foundation are tax
deductible to the full extent permitted by U.S. federal laws and
your state’s laws.
The Foundation’s business office is located at 809 North 1500

West, Salt Lake City, UT 84116, (801) 596-1887. Email contact
links and up to date contact information can be found at the
Foundation’s website and official page at
www.gutenberg.org/contact
Section 4. Information about Donations to

the Project Gutenberg Literary Archive
Foundation
Project Gutenberg™ depends upon and cannot survive without
widespread public support and donations to carry out its mission
of increasing the number of public domain and licensed works
that can be freely distributed in machine-readable form
accessible by the widest array of equipment including outdated
equipment. Many small donations ($1 to $5,000) are particularly
important to maintaining tax exempt status with the IRS.
The Foundation is committed to complying with the laws

regulating charities and charitable donations in all 50 states of
the United States. Compliance requirements are not uniform
and it takes a considerable effort, much paperwork and many
fees to meet and keep up with these requirements. We do not
solicit donations in locations where we have not received written
confirmation of compliance. To SEND DONATIONS or
determine the status of compliance for any particular state visit
www.gutenberg.org/donate.
While we cannot and do not solicit contributions from states

where we have not met the solicitation requirements, we know
of no prohibition against accepting unsolicited donations from
donors in such states who approach us with offers to donate.
International donations are gratefully accepted, but we cannot

make any statements concerning tax treatment of donations
received from outside the United States. U.S. laws alone swamp
our small staff.
Please check the Project Gutenberg web pages for current

donation methods and addresses. Donations are accepted in a
number of other ways including checks, online payments and
credit card donations. To donate, please visit:
www.gutenberg.org/donate.
Section 5. General Information About Project

Gutenberg™ electronic works
Professor Michael S. Hart was the originator of the Project
Gutenberg™ concept of a library of electronic works that could
be freely shared with anyone. For forty years, he produced and
distributed Project Gutenberg™ eBooks with only a loose
network of volunteer support.
Project Gutenberg™ eBooks are often created from several

printed editions, all of which are confirmed as not protected by
copyright in the U.S. unless a copyright notice is included. Thus,
we do not necessarily keep eBooks in compliance with any
particular paper edition.
Most people start at our website which has the main PG search
facility: www.gutenberg.org.
This website includes information about Project Gutenberg™,

including how to make donations to the Project Gutenberg
Literary Archive Foundation, how to help produce our new
eBooks, and how to subscribe to our email newsletter to hear
about new eBooks.

Full Download Data Mining A Tutorial Based Primer 1st Edition Roiger Solutions Manual

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Full Download Data Mining A Tutorial Based Primer 1st Edition Roiger Solutions Manual

Uploaded by

Copyright:

Available Formats

Data Mining A Tutorial Based Primer 1st Edition Roiger Solutions Manual

Data Mining A Tutorial Based Primer 1st

Visit TestBankFan.com to get complete for all chapters

3. Is each scenario a classification, estimation or prediction problem?

Visit TestBankFan.com to get complete for all chapters

Data Mining Questions

2. Suppose we have two classes each with 100 instances.

3. Consider the confusion matrices shown below.

Visit TestBankFan.com to get complete for all chapters

1.F.5. Some states do not allow disclaimers of certain implied

1.F.6. INDEMNITY - You agree to indemnify and hold the

Section 2. Information about the Mission of

Volunteers and financial support to provide volunteers with the

Section 3. Information about the Project

The Foundation’s business office is located at 809 North 1500

Section 4. Information about Donations to

The Foundation is committed to complying with the laws

While we cannot and do not solicit contributions from states

International donations are gratefully accepted, but we cannot

Please check the Project Gutenberg web pages for current

Section 5. General Information About Project

Project Gutenberg™ eBooks are often created from several

This website includes information about Project Gutenberg™,

You might also like