Python For Data Science

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 8

Python for data science

Santhosh kumar K P
Assistant Professor
Dept. of Computer Science
Sacred Heart College, Kochi
Course contents
• Why Python for data analysis
• IPython basics and Jupyter Notebooks
• NumPy Basics, Universal Functions, Data
Processing Using Arrays: Mathematical and
Statistical Methods
• Introduction to pandas Data Structures,
Essential Functionality, Summarizing and
Computing Descriptive Statistics
• Introduction to deep learning
Why Python for data analysis
• Python is open source, interpreted, high level language,
object-oriented programming
• Data scientist use python for various data science
projects/application.
• Python provide great functionality to deal with
mathematics, statistics and scientific function
• It provides great libraries to deals with data science
application.
• Its ease of use and simple syntax which makes it easy
to adapt for people who do not have an engineering
background
• Run the code anywhere, including Windows, Mac OS X,
UNIX, and Linux
Most Commonly used libraries for
data science
Numpy:
▫ NumPy stands for Numerical Python.
▫ Numpy is Python library that provides mathematical function to handle
large dimension array.
▫ It provides various method/function for Array, Metrics, and linear
algebra.
Pandas:
▫ Popular Python library for data manipulation and analysis.
▫ Pandas provide useful functions to manipulate large amount of
structured data.
▫ Pandas provide easiest method to perform analysis.
▫ Pandas is designed for quick and easy data manipulation, aggregation,
and visualization.
▫ There two data structures in Pandas –
Series – It Handle and store data in one-dimensional data.
DataFrame – It Handle and store Two dimensional data.
Most Commonly used libraries for
data science
Matplotlib:
▫ useful Python library for Data Visualization.
▫ Provides various method to Visualize data, make line
graphs, pie charts, histograms, and other professional
grade figures.
▫ Matplotlib has interactive features like zooming and
planning and saving the Graph in graphics format.
Scipy:
▫ Scipy provides great functionality to scientific mathematics
and computing programming.
▫ SciPy contains sub-modules for optimization, linear
algebra, integration, interpolation, special functions, FFT,
signal and image processing, ODE solvers, Statmodel and
other tasks common in science and engineering.
Most Commonly used libraries for
data science
Scipy Libraries
• scipy.integrate: numerical integration routines and differential equation
solvers
• scipy.linalg: linear algebra routines and matrix decompositions extending
beyond those provided in numpy.linalg.
• scipy.optimize: function optimizers (minimizers) and root finding
algorithms
• scipy.signal: signal processing tools
• scipy.sparse: sparse matrices and sparse linear system solvers
• scipy.special: wrapper around SPECFUN, a Fortran library implementing
many common mathematical functions, such as the gamma function
• scipy.stats: standard continuous and discrete probability distributions
(density functions, samplers, continuous distribution functions), various
statistical tests, and more descriptive statistics
• scipy.weave: tool for using inline C++ code to accelerate array computations
Most Commonly used libraries for
data science
IPython
• Interactive Python is the component in the standard scientific
Python toolset.
• It is an enhanced Python shell designed to accelerate the writing,
testing, and debugging of Python code.
• It is particularly useful for interactively working with data and
visualizing data with matplotlib.
• An infrastructure for interactive parallel and distributed computing
Scikit – learn:
• Sklearn provides various algorithms and functions that are used in
machine learning.
• Sklearn is built on NumPy, SciPy, and matplotlib.
• Sklearn provides easy and simple tools for data mining and data
analysis.
Prerequisites
• Basic understanding of python
• Python variables, expressions, operations
• Control structures
• Strings and lists
• Functions, importing packages
• File operations (open, read, write)
• Classes and Objects in python

You might also like