Welcome to Scribd!

Eda 70 Marks Set 2 Exampaper

Uploaded by

0% found this document useful (0 votes)

85 views3 pages

This document provides instructions and questions for an exploratory data analysis examination consisting of 3 sections worth a total of 70 marks. Section A is worth 20 marks and includes questions to clean and manipulate various datasets. Section B is also worth 20 marks and focuses on outlier detection and data cleaning. Section C is the largest section worth 30 marks, asking questions about relationships in datasets and transformations.

Original Description:

Original Title

EDA_70_MARKS_SET_2_EXAMPAPER

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

85 views3 pages

Eda 70 Marks Set 2 Exampaper

Uploaded by

Roshan Kumar

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 3

Search inside document

EXPLORATORY DATA ANALYSIS

TOTAL MARKS:70 DURATION: 3 HOURS

Instructions
1. Candidates should answer all the questions in the same order provided in the question paper.
2. Any activity that compromises the integrity of the examination will not be permitted.
3. Candidates should complete the examination within the provided timeline.
4. Candidates are expected to check and ensure that the correct answer file (in. ipynb format) is
uploaded in LMS.

SECTION A: 20 MARKS

Q1. Read the file 'Automobile_data.csv' and answer the following questions: (5Marks)
A. For the Dataset given below. Write a code to remove Hyphen (-) and change the datatype of the
column as numeric? (2 Marks)

B. For the Dataset given below. Write a code to Convert 'N' Category as 0 and 'P' category as 1 for the
Shortlisted Column? (1 Mark)
EXPLORATORY DATA ANALYSIS

C. For the Dataset given below. Create a calculated field Male Ratio which calculates the ratio of Male
Population to the total population? (2 Marks)

Q2. Read the dataset (German Credit Data.csv) and answer the questions below (5 Marks)
A. Draw the Count Plot for the 'Status' Column? (1 Marks)
B. Split the Dataset into Train and Test. Also give us the reason behind your split (2 Marks)
C. Is the Data imbalanced? If so what types of sampling methods can be used and write the code for
any one type of sampling (No need to execute)? (2 Marks)
Q3. Read the dataset (German Credit Data.csv) answer the questions below (5 Marks)
A. Draw the Count Plot for 'Checkin_acc' Column? (1 Marks)
EXPLORATORY DATA ANALYSIS

B. How does the distribution of 'Age' column look like and perform the test of Normality? (2 Marks)
C. How do you handle object variables? Write down the code for encoding? (2 Marks)
Q4. Read the dataset(bank.csv) answer the questions below (5 Marks)
A. Check for Null Values? (1 Marks)
B. Treat the Null values and also the reason for the method used (2 Marks)
C. Check the spellings in the dataframe and treat them accordingly? (2 Marks)
SECTION B: 20 MARKS
Q5. Read the dataset(beer.csv) answer the questions below (10 marks)
A. Check for outliers and how to treat them? (5 Marks)
B. Check the spelling of the brands by removing the alphanumeric value? (5 Marks)

Q6. Read the dataset (IPL.csv) answer the questions below (10 marks)
A. Which player got the maximum premium (Price) on the base price and What is the average SOLD
PRICE for each 'age' category? (5 Marks)
B. What are the outliers in Sold Price? Filter out the outliers and display the Name of the player, sold
price and their Playing role and Who are the highest sold players? (5 Marks)

SECTION C: 30 MARKS
Q7. Read the dataset(bollywood.csv) answer the questions below. (15 marks)
A. Is there any relationship between Genre and Release time? (5 Marks)
B. Which movie got the highest profit and which genre of movie has the highest budget? (5 Marks)
C. Which year has the highest box office collection (5 Marks)

Q8. Read the dataset (GLAXO.csv) answer the questions below. (15 marks)
A. Create new columns by splitting the date column into Day, Month and Year? (5 Marks)
B. What was the highest daily swing in the price? (5 Marks) Hint: Price High - Price Low = Daily Swing
C. Check the distribution of the close price? What type of transformation can be applied? (5 Marks)

How to Reach the 9.0 in IELTS Academic Writing
From Everand
How to Reach the 9.0 in IELTS Academic Writing
IELTS Medical
Rating: 4 out of 5 stars
4/5 (9)
The ASQ Certified Six Sigma Yellow Belt Study Guide
From Everand
The ASQ Certified Six Sigma Yellow Belt Study Guide
Erica L. Farmer
No ratings yet
MAST 6474 Introduction To Data Analysis I MAST 6478 Data Analytics
Document4 pages
MAST 6474 Introduction To Data Analysis I MAST 6478 Data Analytics
Mygen
No ratings yet
Data Management Grade 7 Test
Document4 pages
Data Management Grade 7 Test
S P
100% (2)
Cqe Sample Exam
Document16 pages
Cqe Sample Exam
Alireza Radfarma
100% (2)
Olympiad Sample Paper 2: Useful for Olympiad conducted at School, National & International levels
From Everand
Olympiad Sample Paper 2: Useful for Olympiad conducted at School, National & International levels
Editorial Board
Rating: 5 out of 5 stars
5/5 (4)
LSAT PrepTest 81 Unlocked: Exclusive Data, Analysis & Explanations for the June 2017 LSAT
From Everand
LSAT PrepTest 81 Unlocked: Exclusive Data, Analysis & Explanations for the June 2017 LSAT
Kaplan Test Prep
No ratings yet
NPV 70 Marks Set 2
Document4 pages
NPV 70 Marks Set 2
Roshan Kumar
No ratings yet
Password Policy
Document4 pages
Password Policy
noorulathar
100% (2)
Abe 412-Irrigation and Drainage Engineering Laboratory Exercise No. 5
Document12 pages
Abe 412-Irrigation and Drainage Engineering Laboratory Exercise No. 5
Jan James Graza
No ratings yet
201 1ST Ass With Answers
Document19 pages
201 1ST Ass With Answers
Lyn Abuda
No ratings yet
Worksheet No. 5
Document2 pages
Worksheet No. 5
Xie Zhen Wu
No ratings yet
Usl 70 Marks Set 1
Document2 pages
Usl 70 Marks Set 1
Roshan Kumar
No ratings yet
Sas (Bas303)
Document5 pages
Sas (Bas303)
Apurva Narayan
No ratings yet
HANDOUTrm
Document9 pages
HANDOUTrm
deepti_sahoo_3
No ratings yet
Pcalc II-i Naveed Resit-2 Sem 1 2019-2020 2
Document6 pages
Pcalc II-i Naveed Resit-2 Sem 1 2019-2020 2
Mojahed Yahya
No ratings yet
ST Nov19
Document5 pages
ST Nov19
amaan shaikh
No ratings yet
AI Question Bank
Document4 pages
AI Question Bank
kiran
No ratings yet
Practice Paper 3
Document37 pages
Practice Paper 3
omaricmt
No ratings yet
ST Nov 18
Document4 pages
ST Nov 18
amaan shaikh
No ratings yet
2017A FE AM Question
Document30 pages
2017A FE AM Question
Shah Alam
No ratings yet
Om
Document7 pages
Om
Bishnu Ram Ghimire
No ratings yet
BCA 3rd Sem. Ass.2018-19
Document9 pages
BCA 3rd Sem. Ass.2018-19
Kumar Info
No ratings yet
Iseb SWT2
Document56 pages
Iseb SWT2
bmcurran
No ratings yet
Foundation Sample Questionsbyk Value
Document6 pages
Foundation Sample Questionsbyk Value
suhamadi
No ratings yet
ST April19
Document4 pages
ST April19
amaan shaikh
No ratings yet
IMS Third Assignment PHD Coursework
Document7 pages
IMS Third Assignment PHD Coursework
ragvij
No ratings yet
Big Data Analysis On ML Main Points
Document5 pages
Big Data Analysis On ML Main Points
Thomas Wondwosen
No ratings yet
M. Tech. Semester - I: Data Mining and Data Warehousing (MCSSE 104)
Document10 pages
M. Tech. Semester - I: Data Mining and Data Warehousing (MCSSE 104)
saurabh1116
No ratings yet
Individual Assignment (50 Marks) : STA104/QMT181 Introduction To Statistics
Document2 pages
Individual Assignment (50 Marks) : STA104/QMT181 Introduction To Statistics
Farah Husna
No ratings yet
University Examinations Examination For January/April 2015/2016 For Diploma in Computer Science
Document2 pages
University Examinations Examination For January/April 2015/2016 For Diploma in Computer Science
Collin Sambu
No ratings yet
Mca 5th Sem Assign
Document16 pages
Mca 5th Sem Assign
Vikas Gupta
No ratings yet
Cog 645
Document25 pages
Cog 645
Zin Maung
No ratings yet
Exam1 - Sample Questions
Document5 pages
Exam1 - Sample Questions
Yusuf Çubuk
No ratings yet
Stat Homework
Document4 pages
Stat Homework
Eph
No ratings yet
Class 10 - Computer Applications (FIT) - Mock Test I - 25.11.23
Document4 pages
Class 10 - Computer Applications (FIT) - Mock Test I - 25.11.23
Saptak Roy
No ratings yet
Question of Assignment
Document17 pages
Question of Assignment
Vishal Gupta
No ratings yet
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
Management Programme: Term-End Examination 1-June, 2015 Ms-5: Management of Machines and Materials
Document2 pages
Management Programme: Term-End Examination 1-June, 2015 Ms-5: Management of Machines and Materials
debaditya_hit326634
No ratings yet
CSE Final Exam Solutions
Document11 pages
CSE Final Exam Solutions
Josh Wein
No ratings yet
14 PHDRM
Document1 page
14 PHDRM
Arunkuma81
100% (1)
Week 1 Checkpoint Answer Template: Chapters 8 and 9.1 - 9.2
Document24 pages
Week 1 Checkpoint Answer Template: Chapters 8 and 9.1 - 9.2
branmuffin482
No ratings yet
SAMPLE PAPER-1 (Solved) Class - XI General Instructions
Document3 pages
SAMPLE PAPER-1 (Solved) Class - XI General Instructions
Sanjay Stark
No ratings yet
Ics 2210 System Analysis and Design 2
Document3 pages
Ics 2210 System Analysis and Design 2
123 321
No ratings yet
Syllabus of Apicet - 2016 Entrance Test
Document7 pages
Syllabus of Apicet - 2016 Entrance Test
anon_495268615
No ratings yet
B.SC., (IT) Examinations, May/June 2008 (Scheme: CBCS) Bridge Course
Document53 pages
B.SC., (IT) Examinations, May/June 2008 (Scheme: CBCS) Bridge Course
Ashwani Dayal
No ratings yet
(Fall 2011) CS-402 Data Mining - Final Exam-SUB - v03
Document6 pages
(Fall 2011) CS-402 Data Mining - Final Exam-SUB - v03
taaloos
No ratings yet
Bca Rs 3rd Sem July 2013 CRC Assignments
Document12 pages
Bca Rs 3rd Sem July 2013 CRC Assignments
Md Quasim
No ratings yet
(WWW - Entrance-Exam - Net) - PTU MCA 3rd Semester Sample Paper 18
Document2 pages
(WWW - Entrance-Exam - Net) - PTU MCA 3rd Semester Sample Paper 18
Mangesh Malvankar
No ratings yet
SLC 70 Marks Set 1
Document3 pages
SLC 70 Marks Set 1
Roshan Kumar
No ratings yet
Raw Material Requirements Per Unit of Given Model I II III
Document4 pages
Raw Material Requirements Per Unit of Given Model I II III
manveen kaur
No ratings yet
Artificial Inteligence & Expert Systems
Document4 pages
Artificial Inteligence & Expert Systems
Miraculous Miracle
No ratings yet
7402E002 - Essentials of IT
Document36 pages
7402E002 - Essentials of IT
APURV UPADHYAY
No ratings yet
Time Allowed: Three Hours 5 January 2018, 9AM-12PM: Instructions To Candidates
Document3 pages
Time Allowed: Three Hours 5 January 2018, 9AM-12PM: Instructions To Candidates
muthu rangi
No ratings yet
ADL 10 Marketing Research V4
Document22 pages
ADL 10 Marketing Research V4
solvedcare
No ratings yet
Master of Computer Applications (MCA) : Assignments JANUARY 2012
Document14 pages
Master of Computer Applications (MCA) : Assignments JANUARY 2012
Subramanyam Pillalamarri
No ratings yet
4:15pm On 28 OCT 2020 (Malaysia Time)
Document5 pages
4:15pm On 28 OCT 2020 (Malaysia Time)
anderson
No ratings yet
Higher Data Structures and Algorithms This Is A Sample Only Time Allowed: 3 Hours Total Marks: 100 Number of Parts: 5
Document9 pages
Higher Data Structures and Algorithms This Is A Sample Only Time Allowed: 3 Hours Total Marks: 100 Number of Parts: 5
BilboBagginses
No ratings yet
BUSI2045 Midterm Questions 2024 Spring
Document10 pages
BUSI2045 Midterm Questions 2024 Spring
rinniechan630
No ratings yet
Diploma Quiz 2 PYQ 4 ?
Document224 pages
Diploma Quiz 2 PYQ 4 ?
Soumyak Dutta
No ratings yet
Tutorial Questions - Msu07401
Document2 pages
Tutorial Questions - Msu07401
Omari Mauga
No ratings yet
SQL Ass
Document4 pages
SQL Ass
YAP JIA LING
No ratings yet
PMP Question Bank
From Everand
PMP Question Bank
Mohammad Usmani
Rating: 4 out of 5 stars
4/5 (34)
AP Statistics Flashcards, Fifth Edition: Up-to-Date Practice
From Everand
AP Statistics Flashcards, Fifth Edition: Up-to-Date Practice
Martin Sternstein
No ratings yet
PMI-ACP Exam Insights: Q&A with Explanations
From Everand
PMI-ACP Exam Insights: Q&A with Explanations
SUJAN
No ratings yet
United India Insurance Company Limited
Document11 pages
United India Insurance Company Limited
Roshan Kumar
No ratings yet
Statement of Account: Date Narration Chq./Ref - No. Value DT Withdrawal Amt. Deposit Amt. Closing Balance
Document5 pages
Statement of Account: Date Narration Chq./Ref - No. Value DT Withdrawal Amt. Deposit Amt. Closing Balance
Roshan Kumar
No ratings yet
Interim Report Group 01 PDF
Document20 pages
Interim Report Group 01 PDF
Roshan Kumar
No ratings yet
Usl 70 Marks Set 1
Document2 pages
Usl 70 Marks Set 1
Roshan Kumar
No ratings yet
SLC 70 Marks Set 1
Document3 pages
SLC 70 Marks Set 1
Roshan Kumar
No ratings yet
Substantive Proceduress
Document8 pages
Substantive Proceduress
ayyazm
No ratings yet
Lecture 3
Document44 pages
Lecture 3
Quynh Trang Dinh
No ratings yet
Control Your Advantages With Unimac Washers: Features Unilinc™ T-Series M30 M9 M4 Ux P-Series
Document2 pages
Control Your Advantages With Unimac Washers: Features Unilinc™ T-Series M30 M9 M4 Ux P-Series
AdrianaMtzR
No ratings yet
06 US V ANG TANG HO Digest With Full Case
Document14 pages
06 US V ANG TANG HO Digest With Full Case
Thirdy Demonteverde
No ratings yet
CRW85218 CRW85218 Malaysia English OIC-EH Oilfield 1016978
Document13 pages
CRW85218 CRW85218 Malaysia English OIC-EH Oilfield 1016978
Yong Lin Albon Tiong
No ratings yet
Carboguard 890 PDS
Document2 pages
Carboguard 890 PDS
Khemaraj Path
No ratings yet
Email Etiquette
Document27 pages
Email Etiquette
Sangesh Nattamai
No ratings yet
Thermocompressor Efficiency & Performance
Document4 pages
Thermocompressor Efficiency & Performance
rifqizafril
No ratings yet
Share Go Director Raftaar Training Module Jan 2023
Document64 pages
Share Go Director Raftaar Training Module Jan 2023
Shravan Khilledar
100% (1)
My Personal Learning Goals
Document2 pages
My Personal Learning Goals
Ava Halloran
No ratings yet
At-The-Restaurant A2
Document4 pages
At-The-Restaurant A2
Mjn Abbasi
No ratings yet
English Ki Tooti Hui Tang
Document2 pages
English Ki Tooti Hui Tang
Gaurav Pandey
No ratings yet
BH120F - Royal Cruiser
Document6 pages
BH120F - Royal Cruiser
Philippine Bus Enthusiasts Society
No ratings yet
Certificate of Annual Inspection
Document1 page
Certificate of Annual Inspection
Jc Jüsäyän
No ratings yet
Ijert Ijert: Design of 6 Bit Vedic Multiplier Using Vedic Sutra
Document8 pages
Ijert Ijert: Design of 6 Bit Vedic Multiplier Using Vedic Sutra
erparveenkaur86
No ratings yet
B737 B2 TNA Jul Revised
Document12 pages
B737 B2 TNA Jul Revised
uspaulrussel
No ratings yet
Drift Analysis For Lateral Stability PDF
Document5 pages
Drift Analysis For Lateral Stability PDF
adriano4850
No ratings yet
The Path Is Easier: With Heat Solutions From Kalsec
Document21 pages
The Path Is Easier: With Heat Solutions From Kalsec
Andres Ruiz
No ratings yet
Preparing Your Pathway To Australia
Document14 pages
Preparing Your Pathway To Australia
KRISTIANTO
No ratings yet
EEN 443: Power Distribution Research Assignment: Mohammed Riad Elchadli Mohammed Shiful Maged Al Sharafi
Document4 pages
EEN 443: Power Distribution Research Assignment: Mohammed Riad Elchadli Mohammed Shiful Maged Al Sharafi
Mohammed Shiful
No ratings yet
Ali CV
Document1 page
Ali CV
Ali Eid
No ratings yet
Software Verification: Methodology
Document25 pages
Software Verification: Methodology
andrew_hm925635
No ratings yet
GK Class 8
Document2 pages
GK Class 8
HARSH RANJAN
No ratings yet
Quiz 20 Past Quizzes
Document23 pages
Quiz 20 Past Quizzes
Sarah Joy Corpuz-Cabasag
100% (1)
Platinum Weekly - 03 March 2023 - Rustenburg Newspaper
Document56 pages
Platinum Weekly - 03 March 2023 - Rustenburg Newspaper
Sarah Lombard
No ratings yet
Astm A335 PDF
Document11 pages
Astm A335 PDF
JACILDO SOARES CAVALCANTE
100% (1)