Professional Documents
Culture Documents
Phase 3
Phase 3
PRESENTED BY
V.MATHAN KUMAR
II Yr BE.CSE
Roll NO:911722104069
INTRODUCTION:
Phase 3 of our fraud detection in online transactions project pivots towards data
visualization, a pivotal facet of data analysis and interpretation. Proficient data visualization
techniques are instrumental in conveying insights, trends, and patterns within the vast
datasets inherent to online transaction monitoring. By leveraging visual representations,
stakeholders gain a clearer understanding of intricate relationships, empowering them to
make informed decisions critical for the advancement and security of online financial
operations.
OBJECTIVES:
Create informative and visually appealing visualizations to explore and communicate key
insights from the dataset.
Utilize various visualization techniques to effectively represent different types of data.
Enhance user engagement and understanding through interactive visualizations.
Document the data visualization process comprehensively for transparency and
reproducibility.
DATA DESCRIPTION:
The dataset consists of transaction data captured from an online platform, representing
various attributes related to user interactions and financial activities. Key data points include
transaction amount, timestamp, user ID, location, payment method, and transaction status
(e.g., approved or declined). Each row in the dataset details individual transaction
characteristics, providing a comprehensive view of user behavior and transaction patterns.
This dataset enables the analysis of potential fraudulent activities and the identification of
trends and anomalies in online transactions.
DATA VISUALIZATION TECHNIQUES:
1. UNIVARIATE VISUALIZATIONS:
CODE:
CODE:
plt.bar(data['category'].value_counts().index,
data['category'].value_counts().values)
plt.xlabel('Category')
plt.ylabel('Frequency')
plt.title('Bar Chart of Category Column')
plt.show()
OUTPUT:
2.BIVARIATE VISUALIZATIONS:
CODE:
plt.scatter(data['gender'], data['cc_num'])
plt.xlabel('gender')
plt.ylabel('cc_num')
plt.title('Scatter Plot of Gender vs cc_num')
plt.show()
OUTPUT:
CODE:
import seaborn as sns
sns.boxplot(x='category', y='cc_num', data=data)
plt.xlabel('Category')
plt.ylabel('cc_num')
plt.title('Box Plot of cc_num by Category')
plt.show()
OUTPUT:
3. MULTIVARIATE VISUALIZATIONS
CODE:
sns.pairplot(data)
plt.title('Pair Plot of Numerical Variables')
plt.show()
OUTPUT:
4.INTERACTIVE VISUALIZATIONS:
CODE:
import plotly.express as px
fig = px.scatter(data, x='street', y='city', hover_data=['cc_num'])
fig.show()
OUTPUT:
CODE:
import pandas as pd
import dash
from dash import dcc
from dash import html
import plotly.express as px
data['trans_date_trans_time'] = pd.to_datetime(data['trans_date_trans_time'])
app = dash.Dash(__name__)
app.layout = html.Div([
dcc.Graph(
id='interactive-plot',
figure=fig
)
])
if __name__ == '__main__':
app.run_server(debug=True)
OUTPUT:
ASSUMED SCENARIO:
CONCLUSION:
THANK YOU