Professional Documents
Culture Documents
Python Practive Code
Python Practive Code
csv"
names = ['preg', 'plas', 'pres', 'skin', 'test', 'mass', 'pedi', 'age', 'class']
data = read_csv(filename, names=names)
data.plot(kind='density', subplots=True, figsize=(15,17),
layout=(3,3),sharex=False, sharey=False)
plt.show()
filename = "train_university_admission_data.csv"
names = ['admit','gre','gpa','prestige']
dfTrain = pd.read_csv(filename, names=names)
dfTest = pd.read_csv(filename, names=names)
filename = "train_university_admission_data.csv"
names = ['admit','gre','gpa','prestige']
dfTrain = pd.read_csv(filename, names=names)
#dfTest = pd.read_csv(filename, names=names)
filename = "train_university_admission_data.csv"
names = ['admit','gre','gpa','prestige']
dfTrain = pd.read_csv(filename, names=names)
#dfTest = pd.read_csv(filename, names=names)
dfTrain.groupby('prestige').size()
dfTrain.groupby('admit').mean()
https://wipro365.sharepoint.com/sites/ku-practice-4044/ANALYTICS-
COMPETENCY/Recordings/Forms/AllItems.aspx?id=%2Fsites%2Fku%2Dpractice
%2D4044%2FANALYTICS%2DCOMPETENCY%2FRecordings%2FNon%20Academy%2FAIML%2DPython%20for
%20DS%5FL1L2%2DGuru%2DMay20&viewid=aa63c8dc%2D8568%2D4868%2Daad3%2Def4ed0b40a5f
dfTrain.groupby('prestige').mean()[['gre','gpa']]
pd.crosstab(dfTrain['admit'], dfTrain['prestige'],
rownames=['Admit']).plot(kind='bar')
pd.crosstab(dfTrain['admit'], dfTrain['prestige'],
rownames=['Admit']).plot(kind='bar')
pd.crosstab(dfTrain['admit'],dfTrain['prestige'],rownames=['Admit'])
pd.groupby('dftrain['admiy'].sum()
unstack() function?
from Shivansh Chaudhri to everyone:
dfTrain[dfTrain['admit']==0].sum()
from Gururajan Narasimhan to everyone:
pd.crosstab(dfTrain['admit'], dfTrain['prestige'],
rownames=['Admit']).plot(kind='bar')
doing normalization:
----------------------
X = array[:,0:8]
Y = array[:,8]
scaler = Normalizer().fit(X)
normalizedX = scaler.transform(X)
# summarize transformed data
set_printoptions(precision=3)
print(normalizedX[0:7,:])
doing standardization:
----------------------
X = array[:,0:8]
Y = array[:,8]
scaler = StandardScaler().fit(X)
rescaledX = scaler.transform(X)
# summarize transformed data
set_printoptions(precision=3)
print(rescaledX[1:6,:])
X = array[:,0:8]
Y = array[:,8]
scaler = MinMaxScaler(feature_range=(10, 30))
rescaledX = scaler.fit_transform(X)
# summarize transformed data
set_printoptions(precision=4)
print(rescaledX[0:5,:])
X = array[:,0:8]
Y = array[:,8]
scaler = MinMaxScaler(feature_range=(10, 30))
rescaledX = scaler.fit_transform(X)
# summarize transformed data
set_printoptions(precision=4)
print(rescaledX[0:5,:])
peaor son is usied for the normal disctribution -- mostly used one
kinder and used for ranked distribution
peaor man -- is used for ranked and normal disctributed