Professional Documents
Culture Documents
Ids 10
Ids 10
Science
Pandas
Generate Series
2
Series Index and Slicing
3
Manipulate Series
• dict_1 = {"Nama": "Ani", "Umur": 21, "WNI": True, "Gender": "Female", "GPA": 3.5}
• dict_2 = {"Nama": "Budi", "Umur": 20, "WNI": True, "Gender": "Male", "GPA": 3.3}
• dict_3 = {"Nama": "Charlie", "Umur": 23, "WNI": False, "Gender": "Male", "GPA": 3.2}
• dict_4 = {"Nama": "Devi", "Umur": 20, "WNI": True, "Gender": "Female", "GPA": 3.7}
• bio_ani = pd.Series(dict_1)
• bio_budi = pd.Series(dict_2)
• bio_charlie = pd.Series(dict_3)
• bio_devi = pd.Series(dict_4)
• df_bimbingan = pd.DataFrame([bio_ani,bio_budi,bio_charlie,bio_devi])
5
Generate DataFrame (2)
6
Indexing DataFrame
• df_bimbingan['Nama’]
• df_bimbingan[['Nama', 'Umur’]]
• df_bimbingan.iloc[1]
• df_bimbingan.iloc[1:3]
• df_bimbingan.iloc[1:4,2:5]
7
Query DataFrame
• df_bimbingan[df_bimbingan["Gender"]=="Male"]
• df_bimbingan[df_bimbingan["GPA"]>=3.5]
• df_bimbingan[df_bimbingan["GPA"]>=3.5 &
df_bimbingan["Gender"]=="Male"]]
8
DataFrame Manipulation
• df_bimbingan.info()
• df_bimbingan.Gender.unique()
• df_bimbingan.Gender.value_counts()
• df_bimbingan.Umur.max()
• df_bimbingan.GPA.mean()
• df_bimbingan.GPA.describe()
10
DataFrame Pivot
• df_bimbingan.pivot(columns='Gender', values='Umur’)
• df_bimbingan.pivot(columns='Gender',
values='Umur').mean()
11
DataFrame Group by
• df_bimbingan.groupby('Gender').mean(numeric_only=True)
12
DataFrame Join
13
Read and Write Data
• df_bimbingan.to_csv('df_bimbingan.csv')
• df_bimbingan_edited = pd.read_csv('df_bimbingan.csv')
14
Exercise (1)
• url =
'https://raw.githubusercontent.com/datasciencedojo/datasets/mast
er/titanic.csv'
• df = pd.read_csv(url, index_col=0)
• df.head()
15
Exercise (2)
16