Professional Documents
Culture Documents
Ip Project Complete
Ip Project Complete
Ip Project Complete
The work and conduct of Preeti Dhakarwal during the completion of her
project was excellent.
Signature Signature
Principal Signature
ACKNOWLEDGEMENT
I Preeti Dhakarwal of Class XII ‘Science’ would sincerely like to thank
Mr.Mohit Jain Sir my Infomatics Practices teacher, who has been a
Constant source of inspiration and guidance throughout the project Work.
I am highly indebted to my teacher for his guidance and constant
supervision as well as for providing necessary information regarding the
project and also for his support in completing the project.
I would also like to thank my Principal Dr. Shweta Vinchurkar who has
encouraged in General the Science students to be more analytical in
solving the Problems.
With Thanks
XII- ‘Science’
Signature:-
FRONT
END
What is python?
Python is an interpreted,object-oriented,high-level
andgeneral-purpose programming language developed by
Guido van Rossum.It was originally released on 20 February
1991.Designed to be easy aswell as fun,the name “python”
is a nod to theBritish comedy group Monty Python.
Python is commomly used for developing websites and
software,task automation,data analysis,and data
visualisation.Since it’s relatively easy to learn,Python has
been adopted by many non-programmers,such as
accountants and scientists,for a variety of everyday
tasks,like organizing finances.
WHAT IS NUMPY?
NumPy is a Python library used for working with arrays. It
also has functions for working in domain of linear algebra,
fourier transform, and matrices. NumPy was created in 2005
by Travis Oliphant. It is an opem source project and you can
use it freely.
NumPy stands for Numerical Python.
The array object in NumPy is called ndarray, it provides a lot
of supporting functions that make working with ndarray
very easy. Arrays are very frequently used in data science.
BACK
END
CSV
Comma-separated values (CSV) is a text file format
that uses commas to separate values. A CSV file
stores tabular data in plain text, where each line of
the file typically represents one data record. Each
record consists of the same number of fields, and
these are separated by commas in the CSV file.CSV
files are commonly used for Data analysis because
they can be easily imported into various software
tools and platforms, such as Microsoft Excel or
Google Sheets. This makes it easy to analyze and
manipulate large data sets, such as sales, surveys,
or financial data.
SOURCE
CODE
import numpy as np # numerical
computing
import pandas as pd # data
processing, CSV file I/O (e.g.
pd.read_csv)
import matplotlib.pyplot as plt
#visualization
import seaborn as sns #modern
visualization
plt.rcParams['figure.figsize'] =
(14, 8)
#sns.set_style("darkgrid")
df = pd.read_csv("ipl1.csv")
print('-------------------------
--------------------------------
------------------')
print('-------------------------
--------------------------------
------------------')
print(df.info())
print('-------------------------
--------------------------------
------------------')
print('-------------------------
--------------------------------
------------------')
print('Total Matches are
' ,df['id'].max())
print('-------------------------
--------------------------------
------------------')
print('-------------------------
--------------------------------
------------------')
print('How many seasons data we
have got in the dataset? ')
print(df['season'].unique())
print('-------------------------
--------------------------------
------------------')
print('-------------------------
--------------------------------
------------------')
print('Which Team had won by
maximum runs? ')
print(df.iloc[df['win_by_runs'].
idxmax()])
print('-------------------------
--------------------------------
------------------')
print('-------------------------
--------------------------------
------------------')
print('Which Team had won by
maximum wickets? ')
print(df.iloc[df['win_by_wickets
'].idxmax()]['winner'])
print('-------------------------
--------------------------------
------------------')
print('-------------------------
--------------------------------
------------------')
print('Which Team had won by
(closest margin) minimum runs?
')
print(df.iloc[df[df['win_by_runs
'].ge(1)].win_by_runs.idxmin()]
['winner'])
print('-------------------------
--------------------------------
------------------')
print('-------------------------
--------------------------------
------------------')
print('Which Team had won by
minimum wicket? ')
print(df.iloc[df[df['win_by_wick
ets'].ge(1)].win_by_wickets.idxm
in()])
print('-------------------------
--------------------------------
------------------')
print('-------------------------
--------------------------------
------------------')
print('Which season had most
number of matches?')
sns.countplot(x='season',
data=df)
plt.show()
print()
print('-------------------------
--------------------------------
------------------')
print('-------------------------
--------------------------------
------------------')
print('The Most Successful IPL
Team is:::')
data = df.winner.value_counts()
sns.barplot(y = data.index, x =
data, orient='h')
print()
print('-------------------------
--------------------------------
------------------')
print('-------------------------
--------------------------------
------------------')
print('The Players who got
maximum times Man of the Match
are:::')
top_players =
df.player_of_match.value_counts(
)[:10]
sns.barplot(x="day",
y="total_bill")
fig, ax = plt.subplots()
ax.set_ylim([0,20])
ax.set_ylabel("Count")
ax.set_title("Top player of the
match Winners")
top_players.plot.bar()
sns.barplot(x =
top_players.index, y =
top_players,
orient='v',palette="Blues")
plt.show()
OUTPUT
SCREENSHOTS
Table Information
Total No. of Matches