Professional Documents
Culture Documents
Cse3054 - Data-Mining - Concepts-And-Techniques - Eth - 1.0 - 66 - Cse3054 - 61 Acp
Cse3054 - Data-Mining - Concepts-And-Techniques - Eth - 1.0 - 66 - Cse3054 - 61 Acp
Cse3054 - Data-Mining - Concepts-And-Techniques - Eth - 1.0 - 66 - Cse3054 - 61 Acp
3 0 0 4 4
Pre-requisite Nil Syllabus Version
1.0
Course Objectives:
1. To introduce the fundamental processes data warehousing and major issues in data
mining
2. To impart the knowledge on various data mining concepts and techniques that can be
applied to text mining, web mining etc.
3. To develop the knowledge for application of data mining and social impacts of data
mining.
Course Outcome:
1. Interpret the contribution of data warehousing and data mining to the decision-support
systems.
2. Prepare the data needed for data mining using preprocessing techniques.
3. Extract useful information from the labeled data using various classifiers.
4. Compile unlabeled data into clusters applying various clustering algorithms.
5. Discover interesting patterns from large amounts of data using Association Rule Mining
6. Demonstrate capacity to perform a self-directed piece of practical work that requires the
application of data mining techniques.
Student Learning Outcomes (SLO): 2,14,17
Module:1 Fundamental to Data Lake 6 hours
Different data repositories- Data warehouse- Data warehouse architecture: Multitiered
Architecture-Data warehouse models - Extraction, Transformation, and Loading- Metadata
repository - Data warehouse modeling: Data cube and OLAP-Data warehouse design and usage
Module:2 Introduction to Data Mining 3 hours
Introduction to data mining-Data mining functionalities-Steps in data mining process-
Classification of data mining systems-Major issues in data mining
Module:3 Data Wrangling and Preprocessing 5 hours
Data Preprocessing: An overview-Data cleaning-Data integration-Data reduction-Data
transformation and Data discretization
Module:4 Predictive Modeling 6 hours
General approach to classification-Decision tree induction- Bayes classification methods-
advanced classification methods: Bayesian belief networks- -
Support Vector Machines-Lazy learners
Module:5 Descriptive Modeling 8 hours
Types of data in cluster analysis-Partitioning methods- Hierarchical methods-Advanced cluster
analysis: Probabilistic model-based clustering- Clustering high-dimensional data-Outlier analysis
Module:6 Discovering Patterns and Rules 7 hours
Frequent Pattern Mining: Basic Concepts and a Road Map - Efficient and scalable frequent item
set mining methods: Apriori algorithm, FP-Growth algorithm- Mining frequent itemsets using
vertical data format- Mining closed and max patterns- Advanced Pattern Mining: Pattern Mining
in Multilevel, Multidimensional Space
Module:7 Data Mining Trends and Research Frontiers 8 hours
Other methodologies of data mining: Web mining-Temporal mining-Spatial mining-Statistical
data mining- Visual and audio data mining- Data mining applications- Data mining and society:
Ubiquitous and invisible data mining- Privacy, Security, and Social Impacts of data mining
Module:8 Recent Trends 2 hours
Total Lecture hours: 45 hours