Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 24

BUSINESS ANALYTICS LAB TEST-1 MBA222C6

LAB ASSESSMENT-1
REPORT
Course: BUSINESS Subject Code: MBA222C6
ANALYTICS

STUDENT NAME PREETHI JOHN STUDENT USN 1MS22BA081

TABLE OF CONTENTS

Contents
SECTION-A..............................................................................................................................2
QUESTION-1.........................................................................................................................2
QUESTION-2.........................................................................................................................3
QUESTION-3.........................................................................................................................4
SECTION-B..............................................................................................................................5
QUESTION-1.........................................................................................................................5
QUESTION-2.........................................................................................................................6
QUESTION-3.........................................................................................................................8

1|Page
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

SECTION-A

QUESTION-1
1.1. Export the data into Microsoft Access. Write all the steps which you would
follow in order to correctly import the data

Answer:
Follow the steps to import the excel data into Microsoft Access:
a. Open Microsoft Access and click on “Blank Database”
b. Create the blank database
c. Import the data from three sheets
i. From “External Data” menu selects “New Data
Source”—From file --- Excel
ii. Firstly select “CUSTOMERS DATA” and then next
iii. Select “First row contains columns headings”
iv. Import spreadsheet wizard – select “first row contains
column headings” and NEXT
v. Check the datatype of each column (Note: all the columns
must be identified as short text)
vi. Choose the primary key as “INVESTOR ID”
vii. Finish
viii. Repeat the step for next worksheets “INVESTMENT
DATA”, STATUS, PURPOSE, CUSTOMER VALUE.
Ensure to use “INVESTOR ID” as primary key

1.2. Take screenshots (Windows key + PrtScr key) at every stage and paste the
same after each step followed. (do this procedure only for one worksheet and not for
all 5 sheets)

 Step-1 Go to Access, click on blank document

 Step-2 Create Blank DATABASE, and create blank Data

2|Page
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

Step-3 Go to external data and import the file you require from excel
1. Click on External data tab
2. Click on import from excel

3. Browse the file ( FINANCE CASE) you require

4. Make sure all the data variables are in short text format

3|Page
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

5. Choose your own primary key, wherein I have used INVESTOR ID as the
Primary key

4|Page
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

6. Select the first row as headings


7. Import the file

8. Then click on CLOSE


9. Repeat the same steps to get the remaining sheets gets imported

5|Page
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

1.3. Which is the primary key considered? Why was it chosen?

I have chosen Investor ID as the primary Key. Because it is a unique data and a
linking key of all the sheets
1. Uniqueness: A primary key must be unique for each record in the table. This uniqueness
ensures that each record can be uniquely identified and distinguished from others.
2. Data Integrity: The primary key helps maintain data integrity by preventing duplicate
records from being entered into the table. It enforces the integrity of the data and ensures
accuracy.
3. Efficient Retrieval: The primary key is used to index the table, making data retrieval faster
and more efficient. It allows the database management system to quickly locate and access
specific records.
4. Relationships: Primary keys are often used to establish relationships between tables in a
relational database. Foreign keys in other tables reference the primary key of a table, creating
links between related data.
5. Joins: Primary keys are commonly used in join operations to combine data from multiple
tables. When joining tables, primary keys help identify the corresponding records to be
merged.
6. Clustered Indexing: In many database systems, the primary key determines the physical
order of data storage on disk. This can enhance query performance for certain types of
queries.
7. Simplicity: A primary key should be simple, stable, and unlikely to change. Using an
existing attribute, such as a unique identifier or a natural key, can simplify database design
and maintenance.
8. Autonumbering: Often, databases use auto-incrementing integers as primary keys. This
ensures a new unique value is generated automatically for each new record added to the table.

6|Page
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

QUESTION-2
2.1. Create relationship diagram by creating one-to-one relationship between 5
tables provided (Note: first worksheet should be linked with all the rest) – Take
screenshot of the relationship diagram and explain the steps followed.
Step-1 Click the database tools tab
Step-2 Click the relationship button

Step-3 Click the show table button

Step-4 Click on the tables you want to add

7|Page
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

Step-5 Click Close


Step-6 Click the related field of the first table and second table

Step-7 Click the enforce referential Integrity Option


Step-8 Click on Create

8|Page
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

2.2. Using query command, retrieve the variables considered important for
conducting summary analysis. (Note: The variables important for the case will be
provided)

Once relationships are maintained one can generate customised dataset using querying. The
steps are as followed;
Step-1 Go to “create”
Step-2 Query design within queries

Step-3 Select the columns you want to select

9|Page
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

Step-4 Select “RUN” and save the file obtained as “case study”

Step-5 Export the data as “txt”/”xslx” file for further analysis

10 | P a g e
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

QUESTION-3
3.1. Export the data variables important for analysis and save the data in excel
format. Take a screenshot of the excel sheet with headings

SECTION-B
QUESTION-1
1.1. Based on the variables exported from Microsoft access, answer the following
questions

SL.N VARIABLE Identify the data types Identify the scale of


O NAME of each variable measurement of each variable

1) Investor id ------------------ ---------------------


2) Gender Qualitative Nominal
3) Annual income Quantitative Continuous
4) Education level Qualitative Nominal
5) Investment type Qualitative Nominal

11 | P a g e
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

6) Investment return Quantitative Continuous


7) Investment risk Qualitative Ordinal
8) Investment goal Qualitative Nominal
9) Investment Qualitative Ordinal
knowledge
10)

SL.NO VARIABLE Identify the type of Identify the visualization


NAME univariate analysis methods which can be followed
which can be done for for each variable
each variable

1) Investor id ---------------------------- ------------------------------------


-
2) Gender Frequency table Pie chart, bar graph, sun burst,
Tree Map
3) Annual income Descriptive analysis Box plot, histogram, line plot
4) Education level Frequency table Pie chart, bar graph, sun burst,
Tree Map
5) Investment type Frequency table Pie chart, bar graph, sun burst,
Tree Map
6) Investment return Descriptive analysis Box plot, histogram, line plot
7) Investment risk Frequency table Pie chart, bar graph, sun burst,
Tree Map
8) Investment goal Frequency table Pie chart, bar graph, sun burst,
Tree Map
9) Investment Frequency table Pie chart, bar graph, sun burst,
knowledge Tree Map
10)

QUESTION-2
2.1. Conduct visualization for each variable provided. (you can choose pivot tables
or charts directly).

A. Variable 1 (Obtain atleast two chart for each variable)


GENDE
R FREQ
Female 493
Male 507

12 | P a g e
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

GENDER
Female Male

INFERENCE : Here the Gender is nearly equally distributed, Female are 496 in number
and Male are 507, where number of male investors are more than female investors.

B. Variable 2 (Obtain atleast two chart for each variable)

Char t Title 499649


4366.7982127402 2617743751000
261774.375 2 261228 138090.28434611
19068926630.791 48910210547
100% 6 2
90%
80%
70%
60% - -
50% 1.1242722392370
0.0182982855645
40% 9 688
30%
20%
10%
0% 0
r t
m
e
ea
n
rro i an od
e
tio
n
nc
e sis ss ng
e
um um m un
co M E ed M iv a ir a rto ne a im im Su Co
lIn rd M Va Ku ew R in ax
a da De Sk M M
nnu a n rd p l e
A St da m
tan Sa
S

13 | P a g e
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

Chart Title

100%

90%

80%

70%

60%

50%

40%

30%

20%

10%

0%

INFERENCE : The dataset shows moderate mean (261,774.375) with relatively low
variability (standard deviation of 138,090.2843) and the distribution is slightly skewed and
exhibits negative kurtosis.
C. Variable 3 (Obtain atleast two chart for each variable)

Education Level FREQ


master 262
bachelor 396
high school 236
Ph.D 106

EDUCATION LEVEL EDUCATION LEVEL


400
350
300
250 11%
26%
200 master
24%
150
100 bachelor
50 40% high school
0
master bachelor high Ph.D Ph.D
school

INFERENCE : Here, we can conclude that maximum number of investors are bachelor
degree holders and 26% of them have masters, where as remaining 34% of the investors are
high school and Ph.D holders.

14 | P a g e
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

D. Variable 4 (Obtain atleast two chart for each variable)


Investment
Type FREQ
Mutual funds 273
Real estate 96
Stocks 233
Bonds 398

Investment Type
Mutual funds
Real estate
27% Stocks
40%
Bonds
10%
23%

Inves tm ent Type


400
350
300
250 398
200
273
150 233
100 96
50
0
Mutual funds Real estate Stocks Bonds

INFERENCE : Stocks and bonds together contribute 63% of the investments, whereas
investors have least interest in real estate (10%).

E. Variable 5 (Obtain atleast two chart for each variable)

15 | P a g e
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

Investment Return
100%
90%
80%
70%
60%
50%
40%
30%
20%
10%
0%
r t
ur
n
ea
n ro ia
n
od
e
tio
n
nc
e sis ss ng
e
um um m un
et Er ed ia rto ne Su Co
R M
rd
M via r u w Ra im im
t M
De Va K
Sk
e in ax
en nda d le M M
s tm St
a
da
r
m
p
ve an Sa
In St

16 | P a g e
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

INFERENCE : The dataset's central tendency is near the median (0.8) and mean (0.6974),
exhibiting low variability (standard deviation 3.191808403). The distribution appears slightly
negatively skewed with negative kurtosis, while the range spans from -5 to 6.
F. Variable 6 (Obtain atleast two chart for each variable)
InvestmentRis
k FREQ
1 207
2 211
3 183
4 215
5 183

InvestmentRisk

7% 1
13% 2
33% 3
4
5
20%

27%

Investment Risk
215
211
207

183

183

1 2 3 4 5

17 | P a g e
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

INFERENCE : In this distribution, The frequency table represents a dataset with


investment risk ranging from 1 to 5. The values occur as follows: 1 level risk has 207 times, 2
level risk has 211 times, 3 level risk has 183 times, 4 level risk has 215 times, and 5 level risk
has 183 times.

G. Variable 7 (Obtain atleast two chart for each variable)


Investment
Goals FREQ
Wealth 403
Retirement 260
Education 337

Investment Goal Investment Goal


Wealth Education
34% Retirement 34% Wealth
40% Education 40%
Re-
tire-
ment
26%
26%

INFERENCE : The frequency table illustrates the distribution of investment goals.


Among the options, "Wealth" has the highest frequency (403), followed by "Education"
(337), and "Retirement" (260).

H. Variable 8 (Obtain atleast two chart for each variable)


Investment
Knowledge FREQ
1 171
2 185
3 217
4 201
5 224

18 | P a g e
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

INFERENCE : The frequency table depicts levels of "Investment Knowledge." The


distribution shows that 5 holds the highest frequency (224), indicating relatively higher
investment knowledge, followed by 3 (217), 4 (201), 2 (185), and 1 (171) with ascending
frequencies.

QUESTION-3

3.1. Conduct univariate analysis of each qualitative variable separately and draw inference

GENDE FRE
R Q
Female 493
Male 507

19 | P a g e
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

Here the Gender is nearly equally distributed, Female are 496 in number and Male are 507,
where number of male investors are more than female investors.
INFERENCE: The given frequency table indicates a nearly balanced distribution between
genders. With 507 males and 493 females, there is no significant gender disparity in the
dataset, suggesting a relatively equitable representation of both genders in the analyzed
context.
EducationLev
el FREQ
master 262
bachelor 396
high school 236
Ph.D 106

INFERENCE : Here, we can conclude that maximum number of investors are bachelor
degree holders and 26% of them have masters, where as remaining 34% of the investors are
high school and Ph.D holders.
The frequency table outlines the education levels within the dataset. Among the categories,
"bachelor" has the highest frequency (396), followed by "master" (262), "high school" (236),
and "Ph.D" (106). This suggests that a significant portion of the dataset holds a bachelor's
degree, followed by master's and high school education levels. The relatively lower frequency
of "Ph.D" indicates a smaller representation of individuals with doctoral degrees in the
dataset.

Investment
Type FREQ
Mutual funds 273
Real estate 96
Stocks 233
Bonds 398

20 | P a g e
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

INFERENCE: The frequency table presents data on various investment types. "Bonds" have
the highest frequency (398), indicating that they are the most common investment type in the
dataset. "Mutual funds" follow with a frequency of 273, while "Stocks" have 233 instances.
"Real estate" appears the least frequent with 96 occurrences. This suggests that bonds are the
preferred investment choice among the given options, followed by mutual funds and stocks.
Real estate seems to be the least favored investment type in this dataset.

InvestmentRisk FREQ
1 207
2 211
3 183
4 215
5 183

INFERENCE: The frequency table shows how "InvestmentRisk" levels are distributed
throughout the sample. Each risk level appears almost equally often in the distribution,
indicating a generally balanced pattern. Levels 1 and 2 occur 207 and 211 times, respectively,
while levels 3 and 5 do so 183 times apiece. The Level 4 frequency is 215, which is the
highest. This indicates that there is no obvious dominance of any one risk level, and that the
dataset includes a wide variety of investment risk preferences.

InvestmentGoa
ls FREQ
Wealth 403
Retirement 260
Education 337

INFERENCE: The provided data displays the distribution of investment goals among a
group. "Wealth" is the most common goal, selected by 403 individuals. "Education" follows
with 337 individuals, and "Retirement" is chosen by 260. This suggests that a majority aim
for wealth accumulation, while a significant number prioritize education and retirement
planning.

Investment
Knowledge FREQ
1 171
2 185
3 217

21 | P a g e
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

4 201
5 224

INFERENCE: People's investment knowledge varies across different levels. The majority
have moderate knowledge (level 3: 217), while others range from low (level 1: 171) to high
(level 5: 224) understanding.
The frequency distribution represents levels of "Investment Knowledge." Among the options,
level 5 has the highest frequency (224), indicating a relatively higher investment knowledge
for those individuals. Level 3 follows closely with 217 instances, suggesting a strong
understanding of investment concepts

3.2. Conduct univariate analysis of each quantitative variable and draw inference

INFERENCE: The mean return for this dataset is approximately 0.525, indicating a relatively
small average return. The standard deviation of about 3.455 suggests a medium level of risk
in the returns. This means that the individual returns deviate from the mean by 3.455,
indicating a certain level of volatility in the dataset.
The provided data represents a series of investment returns. The returns vary widely,
including positive and negative values. The highest return is 6, and the lowest is -5. This data
suggests a range of investment outcomes, with both profitable and loss-making returns.
Further analysis, such as calculating the mean and standard deviation, can provide more
insights into the overall performance and risk associated with these investments.

22 | P a g e
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

INFERENCE:
Minimum income: 9,104
Maximum income: 499,649
Range : 490,545
The data represents a wide range of annual incomes, with the lowest income being 9,104 and
the highest income being 499,649.
Mean The average annual income is around $235,573.
Median : The middle annual income value is between the 250th and 251st values when
sorted.
Standard Deviation: Incomes vary from the average, showing how spread out they are around
$235,573. Higher deviation means more income variability.

23 | P a g e
BUSINESS ANALYTICS LAB TEST-1 MBA222C6

WRITE SUMMARY OF THE RESULTS OBTAINED. WHAT DECISIONS CAN BE


MADE TO HELP THE COMPANY (200 WORDS) BASED ON YOUR ANALYSIS OF
9-10 VARIABLES
The analysis of the provided dataset shows the valuable insights that can help in decision
making to optimize investment strategies, minimize risks, and help investors achieve their
financial objectives efficiently. The dataset is well-balanced in terms of gender
representation, indicating equitable inclusion of both males and females.
In terms of education levels, a significant portion of the dataset holds bachelor's
degrees, followed by master's and high school education. However, the lower frequency of
Ph.D. holders suggests a smaller representation of individuals with doctoral degrees.
Bonds emerge as the most common investment type, followed by mutual funds and stocks,
while real estate appears less invested. This suggests a demand for relatively stable and
conservative investment options.
The distribution of investment risk levels is relatively even, implying a wide variety of
risk preferences among investors. Similarly, investment goals are diverse, with wealth
accumulation being the most common, followed by education and retirement planning.
The dataset showcases varying levels of investment knowledge, with higher
frequencies in levels 5 and 3, indicating a solid understanding of investment concepts.
Regarding income, the dataset spans a wide range, from a minimum of $9,104 to a maximum
of $499,649, with an average income of approximately $235,573. The standard deviation
suggests moderate income variability.
To capitalize on these findings, the company could tailor investment
recommendations based on the prevalent education levels, risk preferences, and investment
goals. Strategies could focus on diversifying investment portfolios, promoting conservative
options for risk-averse investors, and considering personalized guidance for those with higher
investment knowledge. Furthermore, the company could offer specialized services for
individuals with Ph.D. degrees and explore opportunities to bridge the gender gap in financial
planning.
In conclusion, leveraging these insights would enable the company to provide more effective
financial advice, enhance customer satisfaction, and establish enduring relationships with
investors by aligning investment strategies with individual profiles and goals.

24 | P a g e

You might also like