Download as xlsx, pdf, or txt
Download as xlsx, pdf, or txt
You are on page 1of 7

what is statistics

collecting interpreting anaylazing presenting data

what is data
any collection of info

all datas have two components: individuals and variables

individual: people or objects that are focus of our data study. Who/what we are gathering data on.
variables: charateristic of the individuals that is being measured or recoreded for study. If the individuals is who the study abo
are interested in studying

ex: conduct a study whre we examine every buildng on the usf campus and record the year it was built
individual: the building on the USF campus
variable: the year built

data can be any collection of info, variables in a dataset come in two braod types. Categories variables and quantitive variabl
the type of variable we have in out dataset opens and closed doors to various analytic tech we can use.

categorial variables: serve as names labels or categories for objects and things. Represents non-numerical data when it is num
cannot be performed in meaningful way.

Quantitative variables: no. represent quantities. These are typically measurement, defin clear units, and arithmetic can be pe

ex
we record the city of birth for each students in this classroom height age
categories numerical quantitative defiened unit r
viduals is who the study about, then the variables respresent what those individuals we

ables and quantitive variables.

umerical data when it is numerical, no. themseleves acting as labels and arithmetic

ts, and arithmetic can be performed and make sense.

zip code rate the movie


quantitative defiened unit represents a quantity (years) categorical, simple labels for places categorical no. are simply l
rate the movie time to woke up
categorical no. are simply labels quantitative, time always
Majors
Accounting Examining categories variables. It is easier because their non-numerical nature he
Accounting
Accounting imgaine we worked for the math department at USF. The chair of the math depart
Accounting who take math 106 so we can better design the course to meet their needs. So he
Accounting we examine the majors of past math 106 stduents and record them in the data
Communications
Computer Science What sort of analysis can we perform to beter understand the majors of 106 math
Computer Science
Computer Science one thing we always do with dataset is to sort it.
Economics
Entrepreneur Sorting a dataset:
Entrepreneur 1. select the entire dataset including headers
Entrepreneur 2. click "data" tab up above
Entrepreneur 3. select "sort"
Entrepreneur 4. select specific options we want
Entrepreneur
Environmental Studies by sorting the data, more clear which categories appear a lot or a little.
Environmental Studies we can go further however and compute how many times a specific category appe
Finance
Finance Frequency: of a categories dataset represents how many times that category appe
Finance
Finance example: compute the frequency of the category 'Marketing'
Finance '17 since it appear 17 times
Finance
Finance there are few other typoes of frequncy that can shed a diff light on how many time
Finance these are percent frequency and relative frequency
Hospitality
Hospitality percent frequency: represent what perecentages of the total dataset that the parti
Hospitality to compute it, we simply take the frequency of the category
International Bus the answer will be in %
International Bus
International Bus realtive frequency: what the ratio of the total of the total dataset a particular cate
International Bus it is exactly the same as %, only this value should always be b
International Bus
International Bus ex: compute the frequency, presetn frequency, and relative frequency of 'm
International Bus
International Bus Frequency= 17
International Bus Precent frequency= 16.19% 17/105
International Bus Relative Frequency= 0.161905
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Management
Marketing
Marketing
Marketing
Marketing
Marketing
Marketing
Marketing
Marketing
Marketing
Marketing
Marketing
Marketing
Marketing
Marketing
Marketing
Marketing
Marketing
Media Studies
Psychology
Psychology
Psychology
Undeclared Art
Undeclared Art
Undeclared Business
Undeclared Business
Undeclared Business
Undeclared Business
Undeclared Business
Undeclared Business
Undeclared Business
Undeclared Business
Undeclared Business
Undeclared Business
Undeclared Science
Undeclared Science
Undeclared Science
Undeclared Science
eir non-numerical nature heavy limits the amount of analysis tech we have avaliable.

The chair of the math department wants to learn more about the majora of the students
e to meet their needs. So he has asked us woith learning more about this topic.
d record them in the data

tand the majors of 106 math students?

ar a lot or a little.
mes a specific category appear in the dataset: frequency

any times that category appears

a diff light on how many times cate appear in a dataset

he total dataset that the particular categories made up.


e frequency of the category and divide by the total amount of data in the datset.

otal dataset a particular categories makes up.


this value should always be be decimal, not %

, and relative frequency of 'marketing'

You might also like