Cse3054 - Data-Mining - Concepts-And-Techniques - Eth - 1.0 - 66 - Cse3054 - 61 Acp

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

CSE30 Data Mining: Concepts and Techniques L T P J C

3 0 0 4 4
Pre-requisite Nil Syllabus Version
1.0
Course Objectives:
1. To introduce the fundamental processes data warehousing and major issues in data
mining
2. To impart the knowledge on various data mining concepts and techniques that can be
applied to text mining, web mining etc.
3. To develop the knowledge for application of data mining and social impacts of data
mining.
Course Outcome:
1. Interpret the contribution of data warehousing and data mining to the decision-support
systems.
2. Prepare the data needed for data mining using preprocessing techniques.
3. Extract useful information from the labeled data using various classifiers.
4. Compile unlabeled data into clusters applying various clustering algorithms.
5. Discover interesting patterns from large amounts of data using Association Rule Mining
6. Demonstrate capacity to perform a self-directed piece of practical work that requires the
application of data mining techniques.
Student Learning Outcomes (SLO): 2,14,17
Module:1 Fundamental to Data Lake 6 hours
Different data repositories- Data warehouse- Data warehouse architecture: Multitiered
Architecture-Data warehouse models - Extraction, Transformation, and Loading- Metadata
repository - Data warehouse modeling: Data cube and OLAP-Data warehouse design and usage
Module:2 Introduction to Data Mining 3 hours
Introduction to data mining-Data mining functionalities-Steps in data mining process-
Classification of data mining systems-Major issues in data mining
Module:3 Data Wrangling and Preprocessing 5 hours
Data Preprocessing: An overview-Data cleaning-Data integration-Data reduction-Data
transformation and Data discretization
Module:4 Predictive Modeling 6 hours
General approach to classification-Decision tree induction- Bayes classification methods-
advanced classification methods: Bayesian belief networks- -
Support Vector Machines-Lazy learners
Module:5 Descriptive Modeling 8 hours
Types of data in cluster analysis-Partitioning methods- Hierarchical methods-Advanced cluster
analysis: Probabilistic model-based clustering- Clustering high-dimensional data-Outlier analysis
Module:6 Discovering Patterns and Rules 7 hours
Frequent Pattern Mining: Basic Concepts and a Road Map - Efficient and scalable frequent item
set mining methods: Apriori algorithm, FP-Growth algorithm- Mining frequent itemsets using
vertical data format- Mining closed and max patterns- Advanced Pattern Mining: Pattern Mining
in Multilevel, Multidimensional Space
Module:7 Data Mining Trends and Research Frontiers 8 hours
Other methodologies of data mining: Web mining-Temporal mining-Spatial mining-Statistical
data mining- Visual and audio data mining- Data mining applications- Data mining and society:
Ubiquitous and invisible data mining- Privacy, Security, and Social Impacts of data mining
Module:8 Recent Trends 2 hours
Total Lecture hours: 45 hours

Proceedings of the 61st Meeting of the Academic Council [18.02.2021] 234


Text Book(s)
1. Jiawei Han and Micheline Kamber, Data Mining: Concepts and Techniques, Morgan
Kaufmann Publishers, third edition ,2013
2. Pang-Ning Tan,Michael Steinbach, Anuj Karpatne, Vipin Kumar, Introduction to Data
Mining, second edition, Pearson, 2019
Reference Books
1. Ian.H.Witten, Eibe Frank and Mark.A.Hall, Data Mining:Practical Machine Learning Tools
and Techniques,third edition , 2017
2. Alex Berson and Stephen J. Smith, Data Warehousing, Data Mining & OLAP, Tata
McGraw Hill Edition, Tenth Reprint, 2008.
3. Hand, D., Mannila, H. and Smyth, P. Principles of Data Mining, MIT Press: Massachusets.
third edition, Pearson, 2013
Mode of Evaluation: CAT / Assignment / Quiz / FAT / Project / Seminar
Project Component:
Students should identify a problem to address through data mining concepts. The goal is to
select appropriate techniques and model specifications and apply the respective methods to
extract the knowledge related to the real word problem. Students will identify the potential use of
data mining techniques, formulate the problem, identify the right sources of data, preprocess
data, and prescribe actions to improve not only the process of decision making but also the
outcome of decisions. Students can use any data mining tool to generate better business decision.
Recommended by Board of Studies 11-02-2021
Approved by Academic Council No. 61 Date 18-02-2021

Proceedings of the 61st Meeting of the Academic Council [18.02.2021] 235

You might also like