Professional Documents
Culture Documents
1-Intro To Data Literacy
1-Intro To Data Literacy
Every day there are millions of opportunities to improve people's lives by making better use of data. Are you interested
in disease research, education patterns, industrial efficiency, patient care, or government spending? The opportunities
are endless!
To begin making better use of data, consider an important aspect of data literacy, exploration. The key to successful data
exploration is the formulation of good questions.
Asking why…
When you begin to explore the answer to any question, you will likely find yourself asking follow-up questions. Usually it
is not enough to stop after answering your original question. In the example above, if it turns out that health outcomes
among people with chronic illnesses in the United States vary with dog ownership, the next question to ask is "why."
The "5 Whys" technique, originally developed by Toyota Motors founder Sakichi Toyoda, proposes asking "why" of a
problem that's been identified, and then continuing to ask "why" for each answer or explanation given. While the
primary goal of the technique is to determine the root cause of a defect so as to be able to fix it, it can be used to dig
down into the causes of any outcome.
The prominent information technologist Stephen Few identified a list of traits that help
people to work effectively with data, traits that he calls "aptitudes and attitudes."
Interest, Curiosity, and Imagination –
Are you genuinely passionate about a topic? Does this passion engage your mind? These are examples of INTEREST.
Do you enjoy figuring things out, wonder how things work, and crave information? These are examples of CURIOSITY.
Do you have a knack for coming up with new things to try, over and over, until you achieve your goal? This is an example
of IMAGINATION.
Self-Motivation –
Are you driven to explore and to understand? Do you pursue new questions without hesitation? Do you go beyond the
scope of what's expected without waiting to be told what to do? These are examples of SELF-MOTIVATION.
Are you willing to accept what you find? Are you willing to admit when you're wrong? These are examples of OPEN-
MINDEDNESS AND FLEXIBILITY.
Do you know how to prioritize questions in order to see the difference between a worthwhile pursuits versus one that
would require a great deal of time but yield few results? This is an example of an AWARENESS of what's worthwhile.
Can you spot patterns that are meaningful and ignore ones that are not? This is an example of PATTERN SPOTTING.
Even when you're confident about an answer or a result, do you look at it from a different perspective to possibly learn
something new? This is an example of having HEALTHY SKEPTICISM.
Methodicalness –
Do you learn an efficient set of steps and then repeat them according to a proven method? This is an example of being
METHODICAL.
Are you able to see "the big picture" by putting a collection of parts together to form a whole? That's an example of an
ABILITY TO SYNTHESIZE.
Even if you're new to working with data, do you make sure you become familiar with the facts so you don't jump to
conclusions that may be inaccurate? That's an example of developing FAMILIARITY WITH THE DATA.
The skills of data analysis can be developed through training, experience, and lots of practice.
Data fundamentals…
You know that data literacy is the ability to explore, understand, and communicate with data. But what exactly is "data"?
A collection of data is a collection of facts. Even more specifically, consider this expanded definition. Jeffrey Leek, a data
scientist working as a professor at Johns Hopkins Bloomberg School of Public Health, adapted this expanded definition
from Wikipedia:
Set of items –
Sometimes called the population, this is the group of objects you are interested in.
Variable –
A measurement, property, or characteristic of an item that may vary or change. (This is opposed to a constant
measurement, such as pi, that does not vary.)
Qualitative variable –
A qualitative variable describes qualities or characteristics, such as country of origin, gender, name, or hair color.
Quantitative variable –