Welcome to Scribd!

Skip carousel

Quality Control Sheet

Uploaded by

haneenalaa465

0% found this document useful (0 votes)

4 views2 pages

Original Title

Quality control sheet

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

4 views2 pages

Quality Control Sheet

Uploaded by

haneenalaa465

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

University of Science and Technology

School of computational science and artificial intelligence

Data governance

Quality control sheet

1. Write a code to identify and remove duplicate rows from a dataset.

2. Write a code to correct the data type of the “column 1” in a dataset from strings to numeric
values

3. You have a dataset that has a column ”color” with categorical variables that are encoded as
strings. You want to convert these categorical variables into numeric values using one-hot
encoding. Write a code snippet usings sklearn to perform one-hot encoding on the dataset.

4. You have a dataset that has a column ”color” with categorical variables that are encoded as
strings. You want to convert these categorical variables into numeric values using label encoding.
Write a code snippet usings sklearn to perform that on the dataset.

5. Create a code to scale data by scaling it to a common range to use it in linear regression
which needs the data to be normalized within a range from 0 to 1.

6. Create a code to scale data by scaling it to a common range to use it in linear regression
which needs the data to be normalized around 0.

7. Create a code to scale data that includes outliers by scaling it to a common range using a
suitable scaler.

8. write a Python code that inserts a new column into a dataframe, categorizing people based on
the "age" column as follows: age less than 11 is categorized as "child", age between 11 and 20
(inclusive) is categorized as "teenager", and age greater than 20 is categorized as "adult"

Sample data:

data = {'Name': ['John', 'Emma', 'Ryan', 'Sophia'],

'Age': [8, 15, 28, 35]}

9. write a python code that add a cloumn to dataset that add vlaues of 3 cloumns and add
20% tax

10. write a python code to merge both “data1” and “data2” dataframes
University of Science and Technology
School of computational science and artificial intelligence
Data governance

data1 = {'Name': ['John', 'Jane', 'Mike'],

'Age': [25, 30, 35]}

data2 = {'Name': ['Sarah', 'David'],

'Age': [28, 32]}

11. Consider a scenario where you have a large data frame that needs to be validated against a
schema using Pandera in Python. Explain how you can leverage the lazy=True parameter in
Pandera to optimize the validation process.
Provide an example code snippet demonstrating the usage of lazy=True in conjunction with
DataFrame schema validation.
Given that the used data frame is:
data = pd.read_csv('data.csv')

12. You have been given a dataset containing customer information, including their names,
ages, and email addresses.
Write a Python code snippet to perform data validation and profiling on this dataset using
Pandas and Pandera.
Data Validation:
a. Validate that the 'name' column contains only string values and does not have any missing or
null values.
b. Validate that the 'age' column contains only integer values and falls within a specific range
(18 to 65).
Data Profiling:
a. Calculate and display basic statistics for the 'age' column, such as minimum, maximum,
mean, and standard deviation.
b. Count and display the number of unique values in the 'name' column.

13-You are working with a dataset containing information about customers' preferences for
outdoor activities. The dataset includes columns for 'Customer ID', 'Age', 'Activity Type', and
'Rating'. Your task is to use Great Expectations to validate the data and ensure that the 'Rating'
column values are within a specific range based on the 'Activity Type'. Write Python code to
load the dataset, define custom expectations for rating ranges based on activity types, apply
these expectations, and print the validation result.

Fundamentals of Computing
Document3 pages
Fundamentals of Computing
Em en
0% (1)
Description: Hint: Perform Steps As Mentioned Below
Document11 pages
Description: Hint: Perform Steps As Mentioned Below
Anish Kumar
100% (1)
Applied Mathematics Notes
Document593 pages
Applied Mathematics Notes
a9841140155
No ratings yet
50 R Exercises
Document44 pages
50 R Exercises
ADUGNA DEGEFE
No ratings yet
PRACTICAL QUESTIONS For DSBDA
Document9 pages
PRACTICAL QUESTIONS For DSBDA
ngak1214
No ratings yet
Data Understanding and Preparation
Document48 pages
Data Understanding and Preparation
MohamedYounes
No ratings yet
Roll NO 2020
Document8 pages
Roll NO 2020
Ali Mohsin
No ratings yet
DSBDA Lab Manual
Document155 pages
DSBDA Lab Manual
Neha Kardile
No ratings yet
Dsbda Lab Manual
Document167 pages
Dsbda Lab Manual
sm3815749
No ratings yet
Practical Solutions XII-IP-2021
Document1 page
Practical Solutions XII-IP-2021
jitenprajapati045
No ratings yet
Sample Phase 2 Document
Document7 pages
Sample Phase 2 Document
Karishma Yaz
No ratings yet
Insurance Charge Prediction
Document3 pages
Insurance Charge Prediction
Sahil Suvagiya
No ratings yet
Ip Practical File
Document49 pages
Ip Practical File
gahlotkavya09
No ratings yet
Unit Iii
Document3 pages
Unit Iii
112 Pranav Khot
No ratings yet
INFORMATIC PRACTICES Practical
Document18 pages
INFORMATIC PRACTICES Practical
angad bains
No ratings yet
Python Program Question Unitwise
Document8 pages
Python Program Question Unitwise
radha
No ratings yet
DSBDAL Lab Manual
Document26 pages
DSBDAL Lab Manual
rasaj16681
No ratings yet
Dsbdal Lab Manual
Document107 pages
Dsbdal Lab Manual
rasaj16681
No ratings yet
Oomp Assignments For UoP
Document5 pages
Oomp Assignments For UoP
Vivek Agarwal
0% (1)
Assignment 5
Document2 pages
Assignment 5
Muthuram A
No ratings yet
Class 12 IP - Program List - Term1
Document2 pages
Class 12 IP - Program List - Term1
MANAN JOSHI JOSHI
No ratings yet
INDEX
Document2 pages
INDEX
Krish Bagaria
No ratings yet
Assvid
Document13 pages
Assvid
diyalap01
No ratings yet
Data Mining Problem 2 Report
Document13 pages
Data Mining Problem 2 Report
Babu Shaikh
No ratings yet
Data Science Lab 3
Document5 pages
Data Science Lab 3
Tayyaba Faisal
No ratings yet
Important Questions
Document4 pages
Important Questions
Adilrabia rsl
No ratings yet
User Manual (Mental Health Issue Among University Student
Document19 pages
User Manual (Mental Health Issue Among University Student
ANIS NABIHAH BINTI MOHD JAIS
No ratings yet
Github Data Science Projects
Document16 pages
Github Data Science Projects
Jude
No ratings yet
Data Visualization For Python - Sales Retail - r1
Document19 pages
Data Visualization For Python - Sales Retail - r1
Mazhar Mahadzir
No ratings yet
TEST
Document44 pages
TEST
Forward Bias
No ratings yet
Practical For Class XII A 2023-2024
Document2 pages
Practical For Class XII A 2023-2024
Vaibhav Chandra
No ratings yet
GR 12 Program List
Document14 pages
GR 12 Program List
rizwana fathima
No ratings yet
Data Structure Manual For BE Students
Document100 pages
Data Structure Manual For BE Students
raghu07_k
100% (1)
Practical List 2022-23
Document4 pages
Practical List 2022-23
ANIKET RATHOUR
100% (1)
Ip Practical File 2
Document30 pages
Ip Practical File 2
gahlotkavya09
No ratings yet
List of Programs - Python2022
Document2 pages
List of Programs - Python2022
chakshu agarwal
No ratings yet
Data Preprocessing Assignments
Document6 pages
Data Preprocessing Assignments
Akash Bhosale
No ratings yet
Internet Tech Lab Manual M.tech 1-2
Document62 pages
Internet Tech Lab Manual M.tech 1-2
samson cherla
No ratings yet
Team Alacrity - Amazon ML Challenge 2023 - Text File
Document8 pages
Team Alacrity - Amazon ML Challenge 2023 - Text File
omkar sameer chaubal
No ratings yet
Machine Learning Performance Evaluation Report
Document40 pages
Machine Learning Performance Evaluation Report
Peace Emmanuel
No ratings yet
DSML Problem Statements
Document8 pages
DSML Problem Statements
Mangesh Pawar
No ratings yet
Ass-2 Ds
Document29 pages
Ass-2 Ds
Vedant Andhale
No ratings yet
Module 5 Pandas Assignment Updated
Document3 pages
Module 5 Pandas Assignment Updated
rashid
No ratings yet
Spark Python Course APPLY Project Solution Guide Hints
Document2 pages
Spark Python Course APPLY Project Solution Guide Hints
Deepak
No ratings yet
Worksheet-1 (Python)
Document9 pages
Worksheet-1 (Python)
rizwana fathima
No ratings yet
XII IP Practical File 1 Complete
Document38 pages
XII IP Practical File 1 Complete
Anees Ahamed
No ratings yet
Predictive Modeling Business Report Seetharaman Final Changes PDF
Document28 pages
Predictive Modeling Business Report Seetharaman Final Changes PDF
Ankita Mishra
100% (1)
DBDAL LAB - MANUAL - Final
Document93 pages
DBDAL LAB - MANUAL - Final
ap89can
No ratings yet
Class12 Cs Practical File Format
Document5 pages
Class12 Cs Practical File Format
Deepesh kumar
No ratings yet
HIMANSHU
Document39 pages
HIMANSHU
mridul gupta
No ratings yet
CS Practical File 2022-23
Document53 pages
CS Practical File 2022-23
BrijendraSingh
No ratings yet
ML8 Naive Bayes Loan Status
Document2 pages
ML8 Naive Bayes Loan Status
Sahil Suvagiya
No ratings yet
Panther
Document29 pages
Panther
aman nailwal
No ratings yet
Certificate
Document25 pages
Certificate
Tanmay Mane
No ratings yet
t4 m2
Document49 pages
t4 m2
Amazon Mío
No ratings yet
Sodapdf
Document43 pages
Sodapdf
karic28667
No ratings yet
Machine Learning Project - Predicting Boston House Prices With Regression - by Victor Roman - Towards Data Science
Document20 pages
Machine Learning Project - Predicting Boston House Prices With Regression - by Victor Roman - Towards Data Science
Ghifari Raka
No ratings yet
Program
Document3 pages
Program
madhu naik
No ratings yet
Initial Pages Practical File 2023-24
Document3 pages
Initial Pages Practical File 2023-24
Vicky Singh
No ratings yet
XII IP Practical File 1 Complete
Document37 pages
XII IP Practical File 1 Complete
adj45228
No ratings yet
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
ASME B31E Seismic Analysis On Piping Systems 1678725596
Document19 pages
ASME B31E Seismic Analysis On Piping Systems 1678725596
Ponnaiah Sathiyaprabhu
100% (1)
Controllers Review
Document9 pages
Controllers Review
Chandrasekar Elankannan
No ratings yet
BCH Code PDF
Document35 pages
BCH Code PDF
Daniela Cardenas Lubo
No ratings yet
Dynamic Programming. 1: CS 3510 - Design and Analysis of Algorithms
Document8 pages
Dynamic Programming. 1: CS 3510 - Design and Analysis of Algorithms
mansha99
No ratings yet
Boolean Pres14 - 1
Document65 pages
Boolean Pres14 - 1
Amoama Evans
No ratings yet
Group D - Module 2 Problems Final
Document7 pages
Group D - Module 2 Problems Final
Myuran Sivarajah
No ratings yet
Nonlinear Interpolation
Document10 pages
Nonlinear Interpolation
toanvmpetrologx
No ratings yet
Cse Am 2021 Cs 8603 Distributed Systems 755643105 x10324 (Cs8603) Distributed Systems
Document2 pages
Cse Am 2021 Cs 8603 Distributed Systems 755643105 x10324 (Cs8603) Distributed Systems
Muthu
No ratings yet
Dsa Study Guide: Program. Paradigm Time & Space Complexity Data Structure Algorithms
Document1 page
Dsa Study Guide: Program. Paradigm Time & Space Complexity Data Structure Algorithms
AnkitAgrawaly2k4
No ratings yet
Problem Soling Strategy
Document39 pages
Problem Soling Strategy
HERMENI BINTI ASLI STUDENT
No ratings yet
Notes For Discrete-Time Control Systems (ECE-520) Fall 2010: by R. Throne The Major Sources For These Notes Are
Document174 pages
Notes For Discrete-Time Control Systems (ECE-520) Fall 2010: by R. Throne The Major Sources For These Notes Are
Marco Minotti
No ratings yet
How To Add Floating Point Numbers
Document4 pages
How To Add Floating Point Numbers
Marie Ashley
No ratings yet
RSA Manual 1
Document22 pages
RSA Manual 1
Rohan Ajagekar
No ratings yet
ME1401 - Finite Elements Analysis
Document5 pages
ME1401 - Finite Elements Analysis
Ekantha Moorthy
No ratings yet
Chaotic Map Based Frequency Hopping Sequence Generation
Document17 pages
Chaotic Map Based Frequency Hopping Sequence Generation
Suhad Kasim
No ratings yet
BSC-301 - Probability - Distribution 4
Document11 pages
BSC-301 - Probability - Distribution 4
Precisive One
No ratings yet
Introduction Data Science
Document23 pages
Introduction Data Science
Saad Awan
100% (1)
Assignment 2: EEL 709 Deepali Jain 2012ee10082
Document9 pages
Assignment 2: EEL 709 Deepali Jain 2012ee10082
Aashish
No ratings yet
Machine Learning Tutorial PDF
Document56 pages
Machine Learning Tutorial PDF
Krisna Hanjar Prastawa
No ratings yet
Unbalanced Assignment Problem Questions
Document11 pages
Unbalanced Assignment Problem Questions
SYED ADNAN ALAM
No ratings yet
Data Compression MCQ
Document45 pages
Data Compression MCQ
Tushar Bakshi
100% (1)
Animal Sound Classification Using A Convolutional Neural Network
Document5 pages
Animal Sound Classification Using A Convolutional Neural Network
Hln Frcnt
No ratings yet
Mergesort New
Document11 pages
Mergesort New
Bhupesh Dhapola
No ratings yet
Assignment 1 PDF
Document1 page
Assignment 1 PDF
white angel
No ratings yet
IRIS Species Predictor
Document8 pages
IRIS Species Predictor
IJRASETPublications
No ratings yet
Concept To Reality: Empower Your Organization With Generative AI
Document38 pages
Concept To Reality: Empower Your Organization With Generative AI
bahadursoma3
No ratings yet
AP Statistics - Chapter 10 Notes: Comparing Two Population Parameters 10.1: Comparing Two Proportions
Document1 page
AP Statistics - Chapter 10 Notes: Comparing Two Population Parameters 10.1: Comparing Two Proportions
Rhivia Lorat
No ratings yet
Sanjayram R Resume
Document2 pages
Sanjayram R Resume
mridul.k2021ai
No ratings yet
Discrete Mathematics IMP Questions
Document5 pages
Discrete Mathematics IMP Questions
akash Vasu vardhan
No ratings yet