Professional Documents
Culture Documents
Data Lab Student Guide Part 1
Data Lab Student Guide Part 1
Date:
ACTIVITY 1
2. In your group, find the privacy policy for your given site and identify two or three pieces of
information that are collected for users of the site.
don't be a jerk and don't try to take other people work and we are good,
for youtube
File Types
3. What does the word delimited mean? Why is this necessary when talking about data files?
fixed boundaries
4. Given a data file, how can you determine the type of data that might be contained in a specific
column?
you could tell because some of them tells you
Posing Questions
5. On your own, identify two broad areas of interest that you have. Examples include health, sports,
etc.
music, games
6. For each of the two areas of interest you identified, determine one question to which you might
want to know the answer. These should be questions that are not easily answered with an online
search. Two examples of questions that are easy to answer with an online search are who won the
1998 Superbowl (The Denver Broncos), and what is the height of the world’s tallest building (at the
time of publication, Burj Khalifa, 2717 ft/828 m).
what are the best songs for the 2000 to now, what games have been able
to stand the test of time
7. Are there data sets that might apply to one of the two questions that you refined in question 6?
Find two different data sets that might be used to answer one of your questions. List the site and
any search criteria used to find the data sets. If not, consider revising one of your questions so that
you can identify an applicable data set.
https://www.kaggle.com/insiyeah/musicfeatures and https://www.kaggle.
com/pieca111/music-artists-popularity
8. How many records are in each of the data sets you identified? Describe one benefit of using a
larger data set with more records.
over 100 its really good because you know that they really did it instead of
just faking it
Check Your Understanding
9. Do you know how the data in the data set you identified was collected? If so, please describe. If
not, describe one way that the data might have been collected.
it was collected data was collected through the python
10. In your opinion, are there situations where the benefit provided from the data collected is worth
the risk to personal privacy? Why or why not?
nope because that mess with my freedom, what I do in my free time is
only for me and me alone to know
Name:
Date:
ACTIVITY 2
Designing and
Implementing a Custom
Class
You have likely created a class from a given description or specification several times. This is an
important skill, but equally important is the ability to determine essential information to include
when creating a class. What is “essential” can vary based on perspective, or can be determined
by a question that is being asked or a problem that is attempted to be solved. This activity will
give you an opportunity to practice making this type of determination. Consider the following
selection.
https://www.kaggle.com/crawford/80-cereals/version/2#cereal.csv
1. Each row of the table represents an instance of an object. What is the best name for that
object?
Cereal
2. Your teacher will provide you with a question to answer related to the above table. Write the
question here:
What is the average rating of cereals with a sodium level below 150 mg
3. You will now design a class to describe that object and answer your given question. Write the
class header:
public class Cereal {
5. List the data types and names you will use for the fields.
public String name;
public int sodium;
public double rating;
6. Create a new Java file named Cereal.java and implement the class described above. Your
class should contain all necessary instance variables, constructors, accessors methods, and a
toString method.
7. Write a main method to test your Cereal class by implementing multiple instances of
Cereal objects. This program should include lines instantiating Cereal objects.
9. Identify one additional question that can be answered from the given data that you are not
able to answer based on your implementation of Cereal.java.
how much calories does captain crunch haves?
10. What modification could you make in order to answer this question?
public calories;