Download as pdf or txt
Download as pdf or txt
You are on page 1of 52

DataCamp Python for Spreadsheet Users

PYTHON FOR SPREADSHEET USERS

Welcome to Python!

Chris Cardillo
Data Scientist at DataCamp
DataCamp Python for Spreadsheet Users

What is Python?
General purpose programming language
Has code packages for data science
Object-oriented
DataCamp Python for Spreadsheet Users

Object-oriented spreadsheets
DataCamp Python for Spreadsheet Users

Object-oriented spreadsheets
DataCamp Python for Spreadsheet Users

Object-oriented Python
IN PYTHON IN SPREADSHEETS
math.SUM(2,3)
DataCamp Python for Spreadsheet Users

The power of the dot


math.SUM()
math is our package

SUM is our function in the package

The dot (.) lets us access the function

inside the package


DataCamp Python for Spreadsheet Users

Scripts

A Recipe for Buttered Toast

1. Turn on toaster

2. Put bread in toaster for 1 minute

3. Take bread out of toaster

4. Put butter on toast


DataCamp Python for Spreadsheet Users

Importing packages
import math
DataCamp Python for Spreadsheet Users

Importing pandas
DataCamp Python for Spreadsheet Users

Importing pandas
import pandas
DataCamp Python for Spreadsheet Users

Loading Excel data


import pandas

pandas.read_excel('fruit.xlsx')

fruit.xlsx
(the kind of data pandas likes)
DataCamp Python for Spreadsheet Users

pandas as pd

before after

import pandas import pandas as pd

pandas.read_excel('fruit.xlsx') pd.read_excel('fruit.xlsx')
DataCamp Python for Spreadsheet Users

Variable assignment
import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

fruit.xlsx
DataCamp Python for Spreadsheet Users

Looking at data with print()


import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

print(fruit)
DataCamp Python for Spreadsheet Users

Script and console panes in DataCamp


SCRIPT.PY is the script

IPYTHON SHELL is the console


DataCamp Python for Spreadsheet Users

PYTHON FOR SPREADSHEET USERS

Your turn!
DataCamp Python for Spreadsheet Users

PYTHON FOR SPREADSHEET USERS

DataFrames and their


methods

Chris Cardillo
Data Scientist at DataCamp
DataCamp Python for Spreadsheet Users

Where we left off


import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

print(fruit)
DataCamp Python for Spreadsheet Users

Anatomy of a pandas DataFrame


DataCamp Python for Spreadsheet Users

Anatomy of a pandas DataFrame


DataCamp Python for Spreadsheet Users

Anatomy of a pandas DataFrame


DataCamp Python for Spreadsheet Users

Anatomy of a pandas DataFrame


DataCamp Python for Spreadsheet Users

Anatomy of a pandas DataFrame


DataCamp Python for Spreadsheet Users

Anatomy of a pandas DataFrame


DataCamp Python for Spreadsheet Users

Anatomy of a pandas DataFrame


DataCamp Python for Spreadsheet Users

DataFrame methods
.head()

.info()

.describe()

.sort_values()
DataCamp Python for Spreadsheet Users

The .head() method


import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

print(fruit.head())
DataCamp Python for Spreadsheet Users

The .head() method


import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

print(fruit.head(2))
DataCamp Python for Spreadsheet Users

The .info() method


import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

print(fruit.info())
DataCamp Python for Spreadsheet Users

The .describe() method


import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

print(fruit.describe())
DataCamp Python for Spreadsheet Users

The .sort_values() method


import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

fruit = fruit.sort_values('name')
fruit = fruit.reset_index(drop=True)

print(fruit)
DataCamp Python for Spreadsheet Users

The .sort_values() method


import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

fruit = fruit.sort_values('price_usd', ascending=False)


fruit = fruit.reset_index(drop=True)

print(fruit.head(3))
DataCamp Python for Spreadsheet Users

PYTHON FOR SPREADSHEET USERS

Your turn!
DataCamp Python for Spreadsheet Users

PYTHON FOR SPREADSHEET USERS

Filtering rows and creating


columns

Chris Cardillo
Data Scientist at DataCamp
DataCamp Python for Spreadsheet Users

Filtering in spreadsheets
DataCamp Python for Spreadsheet Users

Accessing a single column


DataCamp Python for Spreadsheet Users

Accessing a single column


fruit['name']
DataCamp Python for Spreadsheet Users

Comparison operators
== (equal to)

!= (not equal to)

< (less than)

> (greater than)

<= (less than or equal to)

>= (greater or equal to)


DataCamp Python for Spreadsheet Users

Comparisons
fruit['name'] fruit['name'] == "Apple"
DataCamp Python for Spreadsheet Users

Filtering
fruit[fruit['name'] == "Apple"]
DataCamp Python for Spreadsheet Users

Filtering
fruit[fruit['price_usd'] > 1]
DataCamp Python for Spreadsheet Users

Filtering
fruit[fruit['price_usd'] > 1].reset_index(drop=True)
DataCamp Python for Spreadsheet Users

Basic filtering pattern


df[df['this_column'] == 'that value']
DataCamp Python for Spreadsheet Users

Basic filtering pattern


IN PYTHON IN SPREADSHEETS
fruit[fruit['name'] == "Apple"]
DataCamp Python for Spreadsheet Users

Creating a new column


DataCamp Python for Spreadsheet Users

Creating a new column


DataCamp Python for Spreadsheet Users

Mathematical operators
+ (addition)

- (subtraction)

* (multiplication)

/ (division)
DataCamp Python for Spreadsheet Users

Creating a new column


fruit['cost'] = fruit['price_usd'] * 2
DataCamp Python for Spreadsheet Users

Creating a new column


DataCamp Python for Spreadsheet Users

Creating a new column


DataCamp Python for Spreadsheet Users

Creating a new column


fruit['cost'] = fruit['price_usd'] * fruit['quantity']
DataCamp Python for Spreadsheet Users

PYTHON FOR SPREADSHEET USERS

Your turn!

You might also like