Professional Documents
Culture Documents
Lab Manual On Excel Revised
Lab Manual On Excel Revised
The following is the format you have to maintain in all the experiments
1) Aim: Write the aim of the experiment. For example, the aim of this experiment is
toe understand the basic functions of excel
2) Techniques used: Function such as Sum, filter
3) Procedure followed: Write the steps followed in each of the exercises in this
experiment
4) Results: Paste the results one below the other under each exercise
5) Post it in MOODLE
Microsoft Excel is considered the industry standard piece of software in data analysis. There are
two ways of calculation: using 1) Formulas and 2) Functions
Formulas are expressions that operate on values in range of cells or a cell. For example, we have
values in column A as follows:
1) We wish to find the sum of the range of the values from A1 to A10. Then the formula is
=A1+A2+A3 …… +A10
Functions in excel are predefined formulas. These functions eliminate laborious tasks involving
manual entry of formulas while giving them commonly used jargons. For example, to find the
sum of the weight of the students given from values A1 through A10, the function
=SUM(A1:A10). The <SUM> function sums up all the values from A1 to A10.
In order to execute this function, choose a cell and insert the function =sum(B2:B11)
Qn1: Find the sum of the weight of the students given in cells B2 through B11. Use the file
named: Basics of excel circulated.
Step 4: Press OK
2) Similarly, you may add values through rows. Functions =SUM(B13:K13) B13 through
K13 will add values lying in row 13 and columns from B to K.
Qn2: Find the sum of the weight of the students given in cells B13 through K13. Use the file
named: Basics of excel circulated.
Step 4: Press OK
Qn3: Add 4 rows and 5 columns in the excel you have created in questions 1 and 2. Use the file
named: Basics of excel circulated.
4) Using filters
In large data, you may not require to look or analyse or examine every single row values
at the same time. In other words, you may be interested in looking at only selected
values based on a criteria. In this case, filters allow you to pare down your data and
allow you to see only those data that is of interest to you. Filters can be added to each
column in each data.
Filters are added by clicking the Data tab and selecting “Filter”. A down-arrow pops in
each column. Clicking the arrow next to the column headers allow you to organise the
data in ascending or descending order as well as which specific rows you want to show.
Or you want to organise the data based on a specific criteria.
Qn4. Please organise the data in ascending order using ‘name’. Use the file named: BBA RIT
Name list 2020-21.
Organise it using “male” as the filter and then “female” as the filter
If you wish to transpose the data presented into rows and columns (i.e., rows -> columns
and columns -> rows), doing it manually would consume lot to copy and paste each
individual header. Excel allows us to do this in a simple way using the transpose option.
Screenshot 1
Screenshot 2
Qn. 6. Transpose Programm, Register No. and name into rows. Use the file named: BBA RIT
Name list 2020-21.
You may be interested in splitting up data in one cell into two different cells. Or you
want to pull out someone’s company name through their email address. Or separate
someone’s full name into a first and last name for your email marketing
8) Using formulas
We can add, subtract, multiply and divide using the ‘=’ function. Parenthesis is used to
ensure certain calculations are done first.
If we wish to add the values in columns B2 through B11 and C2 through C11
= (B2:B11+C2:C11)
Qn. 9: Find the average of the values in columns B, C, D, E, F and G using the excel
function “average”. Use the file named: Basics of excel circulated
Conditional formatting allows you to change the colour of cell values based on
information within the cell. For example, you may be interested in flagging students who
are above 70 kgs in each section.
Step1: Highlight the group of cells you want to use conditional formatting
Step2: Choose ‘conditional formatting’ from the Home menu
Step3: Select your logic from the dropdown menu. You may also create your own rule if
the logic is different from the default set of items in the menu
Step4: A window will popup prompting you to provide more information on the ’rule’
Step5: Select OK
Qn10: Select the students who are above 70 kgs in each section. Use the file named:
Basics of excel circulated
If we wish to brand those who are more than 75 kgs as obese, then use the logical
function, IF THEN
Qn.11: Declare all the students as “obese” if they weigh more than 75kgs and others as
“not obese”. Use the file named: Basics of excel circulated
Qn12: Count the number of obese and non obese students in all the sections. Use the file
named: Basics of excel circulated
Use the functions MAX and MIN to list the maximum and minimum values in the
dataset. This is used only with numerical data
The following is the format you have to maintain in all the experiments
1) Aim: Write the aim of the experiment. For example, the aim of this experiment is to
understand how descriptive statistics is run using Excel
2) Techniques used: Function such as Frequency distribution, measures of central
tendency and measures of dispersion
3) Procedure followed: Write the steps followed in each of the exercises in this
experiment
4) Results: Paste the results one below the other under each exercise.
5) Post it in MOODLE the excel file with the steps involved in doing each technique
and the output.
As you learnt in the theory, frequency distribution tables gives you a snapshot of how the
data is spread out in the distribution. It is a summary table that shows the frequency of
each value or the number of times each value is repeated in a distribution or data.
The aim is to get the frequency of values in a data set. There are different ways of
constructing the frequency distribution table using excel. Let us construct using the
COUNTIFS function. This function is executed as follows in order to get the frequency
table
Step1: Use the data in the second sheet of the excel file named, “Basics of excel
circulated
Step2: Set the class intervals as seen from columns H5 to H11
Step2: Insert =COUNTIFS($A$5:$A$14,">=45",$A$5:$A$14,"<50"). Define the range
of the values to be counted. For example, against 45 to 50, enter in double quotes >=45
as the lower range and <50 as the upper range. Similarly, you may enter the specified
ranges in each class intervals
Step 3: Highlight the frequency distribution table > right click > Choose format > Click
border > Choose the desired border
(Note: The dollar sign is inserted so that the cell value does not change even if the drag
through the rows down. You may be aware the ‘row’ position changes if we drag down
the rows).
2) Descriptive Statistics
Step 1: Open the data labeled Fund Returns in the third worksheet named, ‘Experiemnt 2
– Fund returns’
Step2: From the menu choose Data > Data Analysis > Descriptive Statistics > Ok
(If you don’t see Data Analysis under Data, you must Add-in the analysis Toolpak
option)
Step3: In the Descriptive Statistics dialog box, click on the box next to Input Range, then select
the data. If you included the fund names when you highlighted the data, make sure you click on
the option next to Labels in First Row. Click the box in front of Summary Statistics. Then click
OK
Step4: The output table is difficult to read. Highlight the data and choose
Home>Format>Column>Autofit selection. Note that Excel provides numerous descriptive
statistics. I have put the measures of CT in boldface.
Excel offers a basket of charts and graphs to visually present the data. While the number
of charges and graphs include a huge library, it is important to select a chart type that best
fits with your objective.
Line chart
To create line chart, follow the steps on fund returns data
Step1: Select the range B3:C13 that contains data on returns on metals and income
Step2: on the insert tab, in the charts group, click the Line symbol
Step3: Click line with markers. Use this chart when we intend to show trends over time
such years, months and days or categories.
Step4: A new menu design will appear after the line chart is created.
Step12. Modify the titles of the axis by double clicking inside the boxes
Step13: You may also try changing the chart layouts
Bar chart
Using the steps you followed for line charts, please construct Bar chart.
Area chart
Using the steps you followed for line charts, please construct Area chart.
Pie chart
Using the steps you followed for line charts, please construct Pie chart.
Experiment 4
Hypothesis testing
Aim: To test for significant difference between a sample mean and an assumed and hypothesized mean
Techniques used: One sample t test. (Please write why you use one sample t test)
Write the null (H0) and Alternate Hypothesis (H1)
Tool used:Excel
Steps involved:
It was reported that Indians on an average spent 5.33 hours of screen time before the pandemic
related lockdown. However, lockdown had mandated Work-from-home and Study-from-home
thus increasing smartphone screen time. In order to test this, a survey was conducted among 30
households and the data are furnished from cells A2 to A31. Using excel, find if there is any
signficanct difference between the sample mean and hypthesised mean
Step 2: Select Formulas > Insert Function > Z.Test. This command returns the p-value associated with a
right-tailed test
Step 3: Supply the following three arguments in the dialog box:
f) Also conclude if the smartphone screen time during the pandemic lockdown period has
increased compared to the pre-lockdown period
Experiment 5
Hypothesis testing 2
Aim: To test for significant difference between two independent sample means.
Technique used: Two independent sample test. Please write why you use independent sample
‘z’ or ‘t’ test
Problem statement: A survey was conducted among 14 males and 16 females on the number of
hours of smartphone scree time spent. The data on males is given through A2 and A15 and
females in B2:B16. Test if the screen time spent is the same or different between males and
females using p-value approach.
Steps involved
(In this case, you have been asked to test for differences using the p-value approach.
Please examine the value given as P(T<=t) one tail. If this values is greater than the
significance value (in our problem it is 0.5), then accept the null hypothesis. If it is lower
than the 0.05, the reject the null hypothesis.)
Experiment 6
Hypothesis testing 3
Aim: To test for significant difference between two dependent sample means.
Technique used: Two dependent sample test. Please write why you use dependent sample ‘z’ or
‘t’ test
Problem statement: The nutritionist wants to use the data from the 40 Starbucks cardholder in
order to determinne if the posting of caloric information has reduced the intake of average food
calories. Please test using p-value approach at 5% significance level.
Steps involved
Step1:
(In this case, you have been asked to test for differences using the p-value approach.
Please examine the value given as P(T<=t) one tail. If this values is greater than the
significance value (in our problem it is 0.5), then accept the null hypothesis. If it is lower
than the 0.05, the reject the null hypothesis.)
Experiment 7
Hypothesis testing 4
Aim: To test for significant difference between more than independent sample means.
Technique used: One way Analysis of Variance. Please write the assumptions behind the use of
One Way ANOVA
Problem statement: An economic association wanted to determine if the salary drawn by those
who graduated in Economics, Medicine and History are the same. The data were collected from
9 economics graduates, 7 medicine graduates and 9 history graduates. Please test using p-value
approach at 5% significance level.
Steps involved
Step1:
(In this case, you have been asked to test for differences using the p-value approach.
Excel produces the ‘p’ value as one of the outputs. You may notice that you get two
tables: 1) Summary which contains the descriptive statistics and 2) ANOVA. The p-
value is shown under ANOVA table. If the ‘p’ value is less than the test significance
level, in our case is 0.05, then null hypothesis is rejected. If this values is greater than the
significance value (in our problem it is 0.05), then accept the null hypothesis.
8) Write the conclusion if the salary difference across the three streams or not
Experiment 8
Correlation
Aim: To test for relationship between two variables measured using numeric data
Technique used: Karl Pearson Correlation. (Please write the assumptions behind the use of
Karl Pearson Correlation)
Steps involved
Step1:
Regression
Aim: To test for independent-dependent relationship between two variables measured using
numeric data
Technique used: Simple Regression. (Please write the assumptions behind the use of Simple
Regression)
Steps involved
Step1:
Technique used: Chisquare. (Please write the assumptions behind the use of Simple
Regression)
Problem statement: A company has 10,000 pieces of furniture. About one tenth of them were
distributed over four halls. Find out if the distribution of the furniture across the halls are same
or different.
Steps involved
Step1:
1) Open the data labeled Furniture_hall. This file contains the observed values
2) Determine the expected values using the formula (Column total * Row Total)/Total
sample size for each observed value
3) Use the function “=chisquare.test” to determine the chi-square value. Enter the “actual
range’ and the ‘expected range’
4) Click ok