Data Processing Task

You might also like

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 18

DATAPROCESSING TASK

- the process of converting data (itself is meaningless data) into useful


information.
- it is done manually.

For example:
When data is collected and translated into usable information, data processing occurs. Usually performed
by a data scientist or team of data scientists, it is important for data processing to be done correctly as not to
negatively affect the end product, or data output.
Data processing starts with data in its raw form and converts it into a more readable format (graphs, documents,
etc.), giving it the form and context necessary to be interpreted by computers and utilized by employees
throughout an organization.
6 STAGE OF DATA PROCESSING
1. Data collection
Collecting data is the first step in data processing. It is a stage of gathering raw facts
from the environment and preparing it for the input process.

- This picture is an example of manual data


collection, the input data is entered by the operator
manually using traditional input devices like mouse,
keyboard, touch screen, graphic tablets, tracker balls
for typing in the data as text and figures using
particular software applications like Word, Excel, and
PowerPoint.
2.Data preparation
Once the data is collected, it then enters the data preparation stage. Data preparation, often referred to as “pre-
processing” is the stage at which raw data is cleaned up and organized for the following stage of data processing. It is a
process of constructing a dataset of data from different sources for future use in processing step of cycle.

- The picture shows the steps in data preparation. The first step
of a data preparation pipeline is to gather data from various
sources and locations. Before any processing is done, we wish
to discover what the data is about. At this stage, we understand
the data within the context of business goals. Visualization of
the data is also helpful here. The next stage is to cleanse the
data of missing values and invalid values. We also reformat data
to standard forms. Next we transform the data for a specific
outcome or audience. We can enrich data by merging different
datasets to enable richer insights. Finally, we store the data or
directly send it out for analytics.
3. Data input

It refers to supply of data for processing. It is  the stage where verified data is coded or converted into
machine readable form so that it can be processed through an application.  Most data need to follow a
formal and strict syntax since a great deal of processing power is required to breakdown the complex data
at this stage.

- It shows devices use in this stage which is data


input. Data entry is done through the use of a
keyboard, scanner, or data entry from an existing
source. 
4.PROCESSING

- it is stage that the data is subjected to various means and methods of powerful technical
manipulations using Machine Learning and Artificial Intelligence algorithms to generate an output or
interpretation about the data. The process may be made up of multiple threads of execution that
simultaneously execute instructions, depending on the type of data.

- CPU or Central Processing Unit plays a key


role at this stage. CPU collects all of the raw
data that you’ve inputted into the computer
and it converts the data into useful
information by carrying out software
instructions.
5. Data output
The output stage is where non-data scientists can finally use the data. It is translated, readable, and frequently
takes the shape of graphs, videos, photos, plain text, and other formats. Output need to be interpreted so that it can
provide meaningful information that will guide future decisions of the company.

- This picture shows output devices . The most commonly


used output devices are the screen, which is also called a
monitor or VDU and printer.
6. Data storage
Storage is the ultimate stage of data processing. After the data has been analyzed, it is saved for future use.
While some information will be useful right away, most of it will be useful later. Furthermore, correctly kept data
is required to comply with data protection legislation such as GDPR. When data is correctly saved, individuals of
the organization may access it quickly and readily when they need it.

- These are examples of storage devices. Where we


can store data, instruction and information in
permanent memory for future reference.
EXAMPLE OF QUANTITATIVE DATA

1. DISTANCE IN KILOMETERS

kilometre (km), also spelled kilometer, unit of length equal


to 1,000 metres and the equivalent of 0.6214 mile (see metric
system).
2.Number of weeks in a year

A year has 52 weeks plus 1 day.

One calendar year has 365 days, which are divided


into 7-day weeks. To know how many weeks there
are in a year, you just divide the number of days
there are in a year (365) by how many days there are
in a week (7).

The result will be the average number of weeks


there are in a year, which is approximately 52.143,
or 52 weeks plus 1 day.
3.Revenue in dollars
Sales dollars and revenue dollars are comparative
terms associated with a business. They are used to
indicate profit from business transactions. All sales
dollars are revenue, but not all revenues are
necessarily from sales.
4.Weight in kilograms

There are approximately 2.2 pounds in 1


kilogram. To convert kilograms to pounds,
multiply your figure by 2.2. If you wish to
convert pounds to kilograms, divide by 2.2.
5.Length in centimeters
To measure centimeters, use a ruler with the side
marked either cm or mm. Align the edge of the object
with the first centimeter line on the ruler, then find the
length in whole centimeters, or the larger numbers on
the ruler.
EXAMPLE OF QUALITATIVE DATA

1.DOCUMENTS
Documents can come in a variety of forms, whether
they are public records (handbooks, policy outlines,
plans, curriculums), personal documents (calendars,
emails, logs), or physical artifacts (handbooks, flyers,
posters, agendas)
2.CASE STUDIES
Case studies are studies or investigations of a particular person, or
group of people. 

3.AUDIO RECORDING
Audio recordings can be derived from recordings of in depth
interviews, focus groups, or anything recorded during
observational studies. They can also be content such as
podcasts, newscasts, speeches, or other recorded content. 
4.PHOTOGRAPHS
Photographs are any images that are captured by camera.
These can be photographs taken while in the field, photos
taken of a research subject, or of their work or living space, or
any other artifacts related to the subject of your research. 

5.VIDEO RECORDING
Video recordings can include footage taken from in depth
interviews, focus groups, or observational studies. They can
also be derived from online video content such as YouTube
videos, films, news reports, or videos of events.
ACCURACY
Accuracy is a crucial data quality characteristic because inaccurate information can cause significant
problems with severe consequences.
Example: In the US database, dates follow the MM/DD/YYYY format, whereas, in the EU database and
other countries of the world, it's DD/MM/YYYY.

COMPLETENESS
Completeness” refers to how comprehensive the information is. When looking at data completeness, think about
whether all of the data you need is available; you might need a customer’s first and last name, but the middle initial may
be optional.
Example: a customer's first name and last name are mandatory but middle name is optional; so a record can be
considered complete even if a middle name is not available.

RELIABILITY
Reliability is a vital data quality characteristic. When pieces of information contradict themselves, you can’t trust
the data. You could make a mistake that could cost your firm money and reputational damage. 

Example: from the healthcare field; if a patient’s birthday is January 1, 1970 in one system, yet it’s June 13, 1973 in
another, the information is unreliable.. 
RELEVANCE
When you’re looking at data quality characteristics, relevance comes into play because there has to be a good
reason as to why you’re collecting this information in the first place. You must consider whether you really need this
information, or whether you’re collecting it just for the sake of it.

Example:  Your smartphone features a number of sensors: GPS, a compass (magneto meter), an
accelerometer, a microphone, a light sensor and maybe even a fingerprint scanner.

TIMELINESS
Timeliness, as the name implies, refers to how up to date information is. If it was gathered in the past hour,
then it’s timely – unless new information has come in that renders previous information useless. 

Example: timeliness is the degree to which data delivery from a source system conforms to a schedule for
delivery. In large data assets, data is made available once processing is complete.
THANK YOU

You might also like