Professional Documents
Culture Documents
4.11 Big Data Questions
4.11 Big Data Questions
i. independent parts of the code can be run in parallel across different devices
Other Questions
1.
2.
3.
4.
5.
a. The data has a large volume, a high velocity or a large amount of variety
b. Functional programming
c. it is easy to understand what parts of the code are independent of each other, and it
is possible to run independent code in parallel on separate machines
6.
a. CREATE TABLE Booking (
BookingID int ,
ActName string,
StageName string,
Day string,
StartTime,
PRIMARY KEY(BookingID),
);
b. BookingID
c. ActName, it references the primary key Name from the Act table
d. There is no redundant data, there is a primary key in each table, all data is atomic, no
partial dependencies, no many to many relationships, no non key dependencies
e. one many from stage to booking
GROUP BY Day
The data has a wide variety, appearing as different types in structured, unstructured
and semi-structured formats
d. Data types cannot change once defined. It is important that data types remain the
same so that the data can be grouped and analysed more easily, so a paradigm that
uses immutable data is very useful