Professional Documents
Culture Documents
DMDW
DMDW
DMDW
Practical No. 1
Aim: Perform different operations of extraction, transformation, and loading (ETL) processes on
a sample dataset using PowerBI.
Steps:
1. Start “Power BI Desktop”.
2. Click On “Get Data” On “Home Menu”.
3. Select “Excel” and “Connect”.
4. Select a File(eg.: Student.CSV) and Load a file.
5. Now, Transform the data. (Home → Edit Queries → Transform).
6. Select the row “Student Id” And do “Replace Value” (put Value to find: 1 & Replace
with: 101).
Before Replacing:-
After Replacing:-
7. Select the row “Age” And do “Remove Duplicates” (Remove Rows → Remove
Duplicate).
Before Remove:-
After Remove:-
After Extraction:
Practical No. 2
Aim: Integrate data from multiple sources by merging and transforming datasets using Python's
pandas library and data manipulation techniques.
Steps:
Step 1: Open Jupyter
Practical No. 3
Aim: Apply feature selection techniques like variance thresholding and correlation analysis
using python’s scikit-learn library to reduce dimensionality in a dataset.
Steps:
Practical No. 4
Aim: Build a decision tree classifier using python’s scikit learn library to predict customer churn
based on historical data.
Steps:
Practical No. 5
Aim: Implement Naive Bayes classifier in python using scikit learn to classify emails as spam
or non spam based on their content.
Steps:
Practical No. 6
Aim: Implement a linear regression method to make predictions based on the sample data set
using python.
Steps:
Practical No.7
Aim: Implement logistic regression method to make prediction based on the sample data set
using python
Steps:
Practical No.8
Aim: Implement K-means clustering algorithm in python using scikit learn to group customers
based on their purchasing behaviour.
Steps:
Practical No.9
Aim: Implement the Apriori algorithm in Python to mine frequent itemset from a retail
transaction dataset and extract association rules.
Steps: