Professional Documents
Culture Documents
EDA - Visualization - Ipynb - Colab
EDA - Visualization - Ipynb - Colab
EDA - Visualization - Ipynb - Colab
ipynb - Colab
https://colab.research.google.com/drive/1bPKboF2NkJBP5w7twnNxYEMTkWFwYXSY?authuser=0#scrollTo=faCn9ZSa821P&uniqifier=1&printM… 1/8
5/13/24, 6:38 PM EDA_Visualization.ipynb - Colab
1 # Calculating mean
2 mean = df.mean()
3
4 # Calculating median
5 median = df.median()
6
7 # Calculating mode
8 mode = df.mode().iloc[0]
9
10 print("Mean:")
11 print(mean)
12 print("\nMedian:")
13 print(median)
14 print("\nMode:")
15 print(mode)
Mean:
Parent Qualification 1.923414
DGIF_1 2.382932
DGIF_2 0.829322
DGIF_4 2.562363
Background 0.691466
BIF_1 (Who influence your decision in subject selection) 3.133479
BIF_2 (Factor influencing decision) 2.586433
BIF_4 (Placements) 2.332604
SGIF_1 (Dependency) 2.708972
SGIF_2 (Frequency of Support) 2.507659
SGIF_3 (Confedence Level) 1.702407
IAIF_2 1.728665
IAIF_3 1.838074
ESIF_1 2.019694
ESIF_2 (What Decide Future Goal) 1.590810
Overall 1.715536
dtype: float64
Median:
Parent Qualification 2.0
DGIF_1 3.0
DGIF_2 1.0
DGIF_4 3.0
Background 1.0
BIF_1 (Who influence your decision in subject selection) 4.0
BIF_2 (Factor influencing decision) 3.0
BIF_4 (Placements) 2.0
SGIF_1 (Dependency) 3.0
SGIF_2 (Frequency of Support) 3.0
SGIF_3 (Confedence Level) 1.0
IAIF_2 2.0
IAIF_3 1.0
ESIF_1 1.0
ESIF_2 (What Decide Future Goal) 2.0
Overall 1.0
dtype: float64
Mode:
Parent Qualification 1
https://colab.research.google.com/drive/1bPKboF2NkJBP5w7twnNxYEMTkWFwYXSY?authuser=0#scrollTo=faCn9ZSa821P&uniqifier=1&printM… 2/8
5/13/24, 6:38 PM EDA_Visualization.ipynb - Colab
DGIF_1 3
DGIF_2 1
DGIF_4 3
Background 1
BIF_1 (Who influence your decision in subject selection) 4
BIF_2 (Factor influencing decision) 4
BIF_4 (Placements) 2
SGIF_1 (Dependency) 4
SGIF_2 (Frequency of Support) 4
SGIF_3 (Confedence Level) 1
IAIF_2 1
IAIF_3 1
ESIF_1 1
ESIF_2 (What Decide Future Goal) 0
Overall 0
Name: 0, dtype: int64
https://colab.research.google.com/drive/1bPKboF2NkJBP5w7twnNxYEMTkWFwYXSY?authuser=0#scrollTo=faCn9ZSa821P&uniqifier=1&printM… 4/8
5/13/24, 6:38 PM EDA_Visualization.ipynb - Colab
https://colab.research.google.com/drive/1bPKboF2NkJBP5w7twnNxYEMTkWFwYXSY?authuser=0#scrollTo=faCn9ZSa821P&uniqifier=1&printM… 5/8
5/13/24, 6:38 PM EDA_Visualization.ipynb - Colab
1 # Ploting Scatterplot
2 sns.pairplot(df)
3 plt.title('Scatter Plots')
4 plt.show()
https://colab.research.google.com/drive/1bPKboF2NkJBP5w7twnNxYEMTkWFwYXSY?authuser=0#scrollTo=faCn9ZSa821P&uniqifier=1&printM… 6/8
5/13/24, 6:38 PM EDA_Visualization.ipynb - Colab
1 data = pd.read_excel('/content/encoded_data.xlsx')
2 # Extract the numeric columns
3 numeric_columns = data.select_dtypes(include=['number'])
4 # Plot Histogram
5 plt.figure(figsize=(10, 6))
6 for col in numeric_columns.columns:
7 sns.histplot(data[col], kde=True, bins=20, alpha=0.5, label=col)
8 plt.title('Histogram')
9 plt.xlabel('Value')
10 plt.ylabel('Frequency')
11 plt.legend()
12 plt.show()
1 plt.figure(figsize=(10, 6))
2 sns.boxplot(data=numeric_columns, orient='h')
3 plt.title('Box Plot')
4 plt.xlabel('Value')
5 plt.show()
https://colab.research.google.com/drive/1bPKboF2NkJBP5w7twnNxYEMTkWFwYXSY?authuser=0#scrollTo=faCn9ZSa821P&uniqifier=1&printM… 8/8