Professional Documents
Culture Documents
Evidence Sheet: Chapter 2: Data & Analysis
Evidence Sheet: Chapter 2: Data & Analysis
Write the Python code to load the data from “palmerpenguins” package. Copy and paste your code in the space
below and a screenshot of the output
GC2.3 Activity 2.4.1
Research and Identify the purpose of each of the following libraries used in Python
Library Purpose
Pandas is mainly used for data analysis. Pandas allows importing data from
Pandas
various file formats such as CSV and Microsoft Excel. Pandas allows various data
manipulation operations such as merging, reshaping, selecting, as well as data
cleaning, and data wrangling features
NumPy is the fundamental package for scientific computing in Python. It is a library that
NumPy provides a multidimensional array object, various derived objects (such as masked
arrays and matrices), and an assortment of routines for fast operations on arrays,
including mathematical, logical, shape manipulation, sorting, selecting, I/O, basic linear
algebra, basic statistical operations, random simulation and much more
GC2.3 Activity 2.4.1
Research and Identify the purpose of each of the following libraries used in Python
Library Purpose
Scikit-learn is probably the most useful library for machine learning in Python.
Sickit Learn The sklearn library contains a lot of efficient tools for machine learning and
statistical modeling including classification, regression, clustering and
dimensionality reduction
Matplotlib
Matplotlib is a comprehensive library for creating static, animated, and interactive
visualizations in Python. Matplotlib is a cross-platform, data visualization and
Seaborn graphical plotting library for Python and its numerical extension NumPy
Use the dataset to build and train KNN mode to predict if the patient is suffering from a heart disease or not.