Project Description1

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 6

KD 24203 Data Mining and Warehousing

Project

100%

Due Date: 6 July 2023 (Thursday)

Instructions:

• Compile your report into a PDF and upload to Itel.


• Make sure your PDF can be opened, double check the
version you uploaded to Itel.
1. Objectives

This project provides students with the opportunity to learn and apply the following skills;
• Identify real world problem that has data mining and warehousing solution.
• Perform appropriate data mining process to solve problem.
• Illustrate appropriate data warehouse design for the problem.
• Apply data mining software and toolkits to solve the problem.

2. Assessment

The learning outcomes assessed are:


CLO 3: Demonstrate a data mining process for an application, including data presentation,
modelling and evaluation, and data warehousing. (C3-PLO2)

CLO 4: Perform data mining process using data mining software and toolkits in a range of
applications. (P4-PLO3)

Refer to the Project Rubrics for details.

3. Group Formation

Group yourself into a team of 5 persons per team for this project.

Every team member is expected to contribute and participate actively in the entire process of
completing the project. Sharing of ideas and assistance in the completion of project among
members is required.

4. Project Tasks for Report

Students are required to identify a real-world problem that need solution from data mining
and data warehousing. Students are strictly not allowed to use existing project or work from
any resources to avoid plagiarism.

Assume that your team has been commissioned to initiate a project with the objectives to
determine the existing real-world problem and propose solution on solving the problem by
performing data mining process. You are required to use R software to perform the data
mining process which including the data preparation, analysis or modelling, and evaluation.
At the end, you are required to produce a report which consists of the following sections:
a) Introduction
i. Choose and describe ONE existing real-world problem which can be solved by
using data mining task that is Classification, Clustering, or Outlier Detection. Your
description should also include the background and the importance of solving the
problem.
ii. Describe the proposed data mining method used to solve the problem.

b) Literature Review
i. Review the existing most recent data mining research works which are related
to solving your chosen problem.
ii. Based on your review, choose and describe TWO data mining methods that are
best to solve the problem.

c) Data Preparation
i. Based on the identified problem, use appropriate ONE data set and data pre-
processing methods for preparing the chosen data set for analysis. You are
encouraged to collect the real data sets by your own using questionnaire.
ii. Perform the data preparation process by using R software.
iii. Describe the data set, the data preparation method, and the preparation
process.

d) Data Mining
i. Use the chosen data mining method for exploring, analyzing, and extracting
important information from the prepared data set.
ii. Perform the data mining process based on the chosen method by using R
software.
iii. Describe the data mining method, the resulting data mining model, and any
important information obtained from the mining process.

e) Evaluation
i. You are required to fine tune the parameter setting of the data mining method
in order to achieve high quality of model.
ii. Perform appropriate evaluation on the model resulting from the data mining
process by using R.
iii. Discuss the evaluation on the quality of the model.

f) Data Warehouse
i. Construct a data warehouse that is best to improve the data mining process.

g) Conclusion
i. Provide the summary of the important findings obtained from the project.

5. The Final Project Report Format

The final project report for all Parts should contain the following items:
(a) Cover sheet (Appendix - FORM 1)
(b) Table of Contents
(c) Body of answers
(d) Reference section (Students are required to use Harvard Referencing System format)
(e) Appendices: Plagiarism report

The report must be type-written using MS-Word. You are recommended to format your
report according to the following specification:

Media Students are required to submit a softcopy - well written and


properly formatted report in PDF format.
Softcopy to be submitted to the Itel.
Font Size A body text of font size 12 is required while for headings and
subheadings a larger font size must be used.

Font Style Use Times New Roman for body text. Main headings and sub-
headings should be clearly stated using suitable font styles (e.g.
Arial).

Line Spacing Typed material should be 1.5-line spaced.

Alignment Use Justify for alignment.

Headers and Appropriate footers and headers should be used to enhance clarity
Footers and presentation.

Page Numbering Ensure that all pages (except cover page) are numbered.

Paper Size Use A4 paper (29.7cm x 21cm).

Binding 2-hole plastic binder clip. Use only one side of the paper.

Table 1: Written Report Format

6. Submission Deadline

Project Part 1 report presentation (Item a and b): Week 9 25 May 2023 (Thursday)
Final project report (Part 1 and 2) deadline: Week 14 6 July 2023 (Thursday)

7. Late Submission

In certain circumstances, a student may be allowed to submit the project report late with valid
reason. S/he must inform the lecturer at least one week before the project is due. The
lecturer will evaluate whether the circumstance warrants submitting the project report late,
but no guarantee that the students will not be penalized.

As a general rule, no extension of time will be granted. The project description and its due
dates are normally disclosed in advance to students in order that they will be able to manage
their time according to different course study progress and complete this project on time.
8. Academic Integrity and Plagiarism

Any cheating attempt to cheat, plagiarism, collusion and any other attempts to gain an
unfair advantage in assessment will cause the students concerned to be penalized.

9. Group Member Contribution

No mark will be given for student who does not contributing in completing the project.
FACULTY OF COMPUTING AND INFORMATICS

KD24203 DATA MINING AND WAREHOUSING

Project Title:

Prepare By,

No. Name Matric No.


1.
2.
3.
4.
5.

You might also like