Professional Documents
Culture Documents
2021 ITS665 - ISP565 - GROUP PROJECT-revMac21
2021 ITS665 - ISP565 - GROUP PROJECT-revMac21
1. Search and select a dataset depending on your interests. It should contain enough
instances (at least 1000), attributes with at least 15 attributes up to 30 attributes, should
contain a good mix of numeric and nominal attributes and if possible the dataset has
some missing values. (If there is no missing values, than you need to perform other
relevant processes).
2. Describe about your project problem, data and the source of dataset.
3. Find two academic articles (literature reviews) related to the topic that you have selected
and discuss how it help you to understand the project.
4. Each group is required to develop one method only – classification. (If your group is
interested to do association and clustering, please refer to your lecturer).
For each task below, answer the following using WEKA tool.
Task for steps A2-A3-A4 is for data understanding, preparation and reduction.
Phase B is for model development and evaluation.
1
Prepared by : Sofi M/SAR
ISP565/ITS665 2021
By default, each group requires to develop classification model. Apply an algorithm under
selected study using your dataset. Present the outcome of the project. Each member has to
elaborate his/her role/contribution for groupwork.
METHOD ACTIVITIES AND EVALUATION
4. Apply reduction steps in A4. Report the reduction method that you have
applied.
5. Repeat step 1- 2 on the reduction datasets. Compare results between
full features/samples and reduced.
6. Compare the evaluation results of full dataset and after the dataset is
reduced using graph (Excel). Explain your results with the help of the
graph (Excel).
6. Repeat step 1-3 on the reduction datasets. Compare result between full
features/samples and reduced. Explain the differences of generated
clusters.
2
Prepared by : Sofi M/SAR
ISP565/ITS665 2021
Data
Preprocessing/Preparation
Dataset
Evaluation Evaluation
Tree Description
3
Prepared by : Sofi M/SAR
ISP565/ITS665 2021
1) Presentation slide - contains all the results as required in the question, list the group
members and pictures in the first slide
2) The complete dataset: the original dataset, in .CSV format including preprocessed dataset,
cleaned, normalized, reduced, train and test datasets etc.
3) Model of the experiments (in WEKA format)
4) Articles for the project
5) Upload in the Google drive, for CS2434A/4B --> shorturl.at/mpuH3
• Name the folder using this format:
a. groupID_datafilename_leadername
b. e.g. CS2434A_soccerdata_ali
4
Prepared by : Sofi M/SAR
ISP565/ITS665 2021
Lifelong learning – criteria Lowest 1-2 3-4 5-6 7-8 9-10 Highest
The group provide no dataset The group provides
Dataset references and
references and not able to dataset references and
(CLO4-A3 / PLO7) description
describe able to describe
10%
Appropriateness and The references are
relevance of references to The references are not related indeed related to
task & dataset dataset
Model Development – criteria Lowest 1-2 3-4 5-6 7-8 9-10 Highest
Students able to identify
Identifying techniques for Students unable to identify
the appropriate
DATA PREPARATION data preparation the appropriate techniques
techniques
(CLO3-C5 / PLO3)
Student able to analyse
15% Analysing the data Student unable to analyse the
the results of data
preparation results results of data preparation
preparation
Model Development – criteria Lowest 1-3 4-6 7-9 10-12 13-15 Highest
Students able to apply
MODEL Applying the DM algorithms Students unable to apply the
the DM algorithm on full
DEVELOPMENT for model building DM algorithm on full dataset
dataset
(part 1)
Students unable to apply the Students unable to apply the Students able to apply
(CLO3-C5 / PLO3)
DM algorithm on reduced DM algorithm on reduced the DM algorithm on
10%
dataset dataset reduced dataset
Model Development – criteria Lowest 1-2 3-4 5-6 7-8 9-10 Highest
MODEL
DEVELOPMENT
Students unable to evaluate Students able to
(part 2) Evaluating DM models
the DM models evaluate the DM models
(CLO1-C4 / PLO3)
5%
5
Prepared by : Sofi M/SAR
ISP565/ITS665 2021
Group Group
Student ID Project Title Dataset Link Articles’ Reference Link
number Members
CS2594A-1
CS2594A-2
CS2594A-3
CS2594B-1
CS2594B-2
CS2594C-1
CS2594C-1
6
Prepared by : Sofi M/SAR