Download as pdf or txt
Download as pdf or txt
You are on page 1of 5

COLLAGE OF INFORMATICS

DEPARTRMENT OF INFORMATION TECHNOLOGY

Course Name:Big Data Analytic

Course code:5112

Prepared by

1.Roba Halkano

Submitted to Dr.Kula.K(PHD)

Bule Hora:Ethiopia

Aprl 6, 2022
Define structured data and unstructured data
Structured data
 Structured data is data that adheres to a pre-defined data model and is therefore
straightforward to analyse. Structured data conforms to a tabular format with
relationship between the different rows and columns. Common examples of structured
data are Excel files or SQL databases.
 Think of data that fits neatly within fixed fields and columns in relational
databases and spreadsheets. Examples of structured data include names, dates,
addresses, credit card numbers, stock information, geolocation, and more.
Structured data is highly organized and easily understood by machine language
 Structured data consists of information already managed by the organization in
databases and spreadsheets; it is frequently numeric in nature.
 The most common format for structured data is text and numbers.
 Structured data has been defined beforehand in a data model.
 Structured data is clearly defined types of data in a structure.

Unstructured data
 Unstructured data is information that is unorganized and does not fall into a
predetermined model or format.
 Unstructured simply means that it is datasets (typical large collections of files) that
aren't stored in a structured database format. Unstructured data has an internal
structure, but it's not predefined through data models. It might be human generated, or
machine generated in a textual or a non-textual format.
 In the modern world of big data, unstructured data is the most abundant. It’s so prolific
because unstructured data could be anything: media, imaging, audio, sensor data, text
data, and much more. Unstructured simply means that it is datasets (typical large
collections of files) that aren’t stored in a structured database format. Unstructured data
has an internal structure, but it’s not predefined through data models. It might be human
generated, or machine generated in a textual or a non-textual format.
A.Structured vs Unstructured Data: Differences

 Structured data is clearly defined and searchable types of data, while unstructured data is
usually stored in its native format.

 Structured data is quantitative, while unstructured data is qualitative.

 Structured data is often stored in data warehouses, while unstructured data is stored in
data lakes.

 Structured data is easy to search and analyze, while unstructured data requires more work
to process and understand.

 Structured data exists in predefined formats, while unstructured data is in a variety of


formats.

 Structured data is clearly defined and searchable types of data, while unstructured data is
usually stored in its native format.

 Structured data is quantitative, while unstructured data is qualitative.

 Structured data is often stored in data warehouses, while unstructured data is stored in
data lakes.

 Structured data has a pre-defined data model,Unstructured data does not have a pre-
defined data model

 Structured data is highly specific and is stored in a predefined format, where unstructured
data is a conglomeration of many varied types of data that are stored in their native
formats.

Structured data use case examples

Online booking. Different hotel booking and ticket reservation services leverage the advantages
of the pre-defined data model as all booking data such as dates, prices, destinations, etc. fit into a
standard data structure with rows and columns.

ATMs. Any ATM is a great example of how relational databases and structured data work. All
the actions a user can do follow a pre-defined model.
Inventory control systems. There are lots of variants of inventory control systems companies
use, but they all rely on a highly organized environment of relational databases.

Banking and accounting. Different companies and banks must process and record huge amounts
of financial transactions. Consequently, they make use of traditional database management
systems to keep structured data in place.

Unstructured data use case examples


Sound recognition. Call centers use speech recognition to identify customers and collect
information about their queries and emotions.

Image recognition. Online retailers take advantage of image recognition so that customers can
shop from their phones by posting a photo of the desired item.

Text analytics. Manufacturers make use of advanced text analytics to examine warranty claims
from customers and dealers and elicit specific items of important information for further
clustering and processing.

Chatbots. Using natural language processing (NLP) for text analysis, chatbots help different
companies boost customer satisfaction from their services. Depending on the question input,
customers are routed to the corresponding representatives that would provide comprehensive
answers.
B.Structured and Unstructured Data Similarities

Structured and unstructured data are similar in some ways.Let,s take a look at some
of them.

 They both support text and number,though structured data tends to be number-
heavier while unstructured data contains more text.

 Structured and unstructured data system are Scalable.

 Both support Cloud deployment.

 They offer valuable insights to businesses.

 Structured and unstructured data both have human-generated data and


computer-generated data as data sources.

 Both can exist n HTML formats.web traffic data like Gooogle analytic exists
online as an HTML files, while social media posts also exist in the same format.

You might also like