Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

Key

Name: ______________________________________________________________________________ Date:__________ Period:_________


OnRamps Computer Science
Assignment: Big Data Concepts
b 1.
____ The difference between useful data and usable data is that
a. useful data can be collected, while usable data cannot.
b. usable data is anything that can be collected, while useful data is that data which can be analyzed
to solve a problem.
c. useful data may not serve a purpose, while usable data always serves a purpose.
d. usable data is used in school, and useful data is data used in real-life situations.
b 2.
____ A radar gun that produces seemingly random readings is producing
a. no data c. useful data
b. usable data d. extra revenue for the city
a 3.
____ A word cloud is an example of which type of data analysis?
a. summarization c. regression
b. cluster d. outlier detection
c 4.
____ What is the term used to describe the process of taking all of the metric values in a particular field
and proportionally remapping them onto a scale of 0 to 1?
a. data scrubbing c. normalizing data
b. data storage d. data persistence
b 5.
____ Once something goes online, it never really goes away. We refer to this as
a. data consistency c. digital exhaust
b. data persistence d. "What happens in Vegas stays in Vegas."
c 6.
____ Looking for relationships in data is called
a. data scrubbing c. data mining
b. data storage d. online dating
a 7.
____ The Rolling Stones observed that "you can't always get what you want." However, sometimes "you get
what you want" by giving something in return. We often must consider whether to surrender
personal data to get what we want. This is a choice between __________ and _________.
a. privacy and utility c. scraping and spidering
b. freeware and paid software d. wearing clothes and being naked in the sunlight

c 8.
____ Going through a database to fix formatting issues, correct inaccurate entries, etc., is called.
a. data persistence c. data scrubbing
b. data exhaust d. falsifying data
f 9.
____ (T/F) Usable data is always useful data.
t 10.
____ (T/F) Useful data is always usable data.

#s 12-21: Matching
d 11. digital exhaust
____ a. a quantitative (numeric) measure of data
f 12. deepfake
____ b. a label used to describe and categorize metrics ; non-numeric
____
a 13. metric c. crowdsourcing gets you close to the right answer
c 14. central-limit theorem
____ d. data left behind as we use the internet
i 15. cluster analysis
____ e. data point that is very different from most other data points
b 16. dimension
____ f. use artificial intelligence to create visual or audio of fake events
g 17. filter bubble
____ g. search algorithm only lists what it thinks you agree with
e 18. outlier
____ h. an artistic visual presentation of data
h 19. viz
____ i. finding groups of data that are similar
j 20. regression
____ j. using data trends to predict how one factor affects another
Name: ______________________________________________________________________________ Date:__________ Period:_________
OnRamps Computer Science
Assignment: Big Data Concepts

volume
21. Big Data sets are defined by high _____________________, velocity
high ________________________, variety
and high _____________________.

Machine learning
22. _________________________________________ is when a computer uses data to craft its own behavior. Two types:

Supervised learning
a. _______________________________________________ is when the computer is given inputs (such as baseball
signs) and outputs and it develops an algorithm to make predictions (such as when a runner will try
to steal a base).
Unsupervised learning
b. _______________________________________________ is when the outputs are unknown so the computer is
given data and asked to discover patterns and relationships in the data.
23. Circle all of the following which are file types used to store large amounts of data:

.doc .xls .jpg .json csv .html .odt

24. When using a search engine, we often simply type in terms we want included in the search.
" " [quotes]
a. What symbolism is used to indicate the search engine should find an exact term? _____________________
b. What symbol is used to indicate the search engine should exclude a term? _________________________
- [dash or minus sign]
25. Identify the different types of data analysis or visualization represented below.
Cluster analysis
Linear regression Jitter

Heat map
Outlier

Heat map Radar chart Automated summarization

Note: This review is only meant to cover vocabulary and basic concepts. You may also see questions from the
readings, practice quizzes, and presentations included in the Big Data unit.

You might also like