Download as pdf or txt
Download as pdf or txt
You are on page 1of 6

Subject Specific Skills

CHAPTER – 2 AI PROJECT CYCLE

Problem Scoping
Whenever we begin a new project, we encounter a number of challenges.
In fact, we are surrounded with issues! These issues might be minor or
major; sometimes we overlook them, and other times we require immediate
attention.

To understand a problem, determine the different aspects that affect the


problem, and define the project’s goal are problem scoping.

How to Identify the Problem Scoping in AI Project


Follow the following steps to identify the problem scoping from the project –

 Understand why the project was started.


 Define the project’s primary objectives.
 Outline the project’s work statement.
 Determine the most important goals.
 Choose important milestones.
 Determine the major constraints.
 Make a list of scope exclusions.
The 4 W’s of Problem Scoping are Who, What, Where, and Why. This 4
W’s helps to identify and understand the problem in a better manner.

a. Who – The “Who” element helps us to understand and categorize who is


directly and indirectly affected by the problem, and who are known as
Stakeholders.

b. What – The “What” section aids us in analyzing and recognizing the


nature of the problem, and you may also gather evidence to establish that
the problem you’ve chosen exists under this block.

c. Where – What is the situation, and where does the problem arise.

d. Why – Refers to why we need to address the problem and what the
advantages will be for the stakeholders once the problem is solved.

Data Acquisition
The method of collecting correct and dependable data to work with is
known as data acquisition. Data can be in the form of text, video, photos,
audio, and so on, and it can be gathered from a variety of places such as
websites, journals, and newspapers.

What is Data
Data is a representation of facts or instructions about an entity that can be
processed or conveyed by a human or a machine, such as numbers, text,
pictures, audio clips, videos, and so on.

There is two type of data –

1. Structured Data
2. Unstructured Data

a. Structured Data
When data is in a standardized format, has a well-defined structure, follows
a consistent order, and is easily accessible by humans and program. This
data is in the form of numbers, characters, special characters etc.
b. Unstructured Data
Unstructured data is information that doesn’t follow traditional data models
and is therefore difficult to store and manage. Video, audio, and image
files, as well as log files, are all examples of unstructured data.

Dataset
Dataset is a collection of data in tabular format. Dataset contains numbers
or values that are related to a specific subject. For example, students’ test
scores in a class is a dataset.

The dataset is divided into two parts

a. Training dataset – Training dataset is a large dataset that teaches a


machine learning model. Machine learning algorithms are trained to make
judgments or perform a task through training datasets. Maximum part of the
dataset comes under training data (Usually 80%)

b. Test dataset – Data that has been clearly identified for use in tests,
usually of a computer program, is known as test data. 20% of data used in
test data

Acquiring Data from Reliable Sources


There are six ways to collect data.

a. Surveys
A research method for gathering data from a predetermined sample of
respondents in order to get knowledge and insights into a variety of issues.

b. Cameras
We can collect visual data with the help of cameras, this data is
unstructured data that can be analyzed via Machine learning.

c. Web Scripting
Web scribing is a technique for collecting structured data from the internet,
such as news monitoring, market research, and price tracking.
d. Observation
Some of the information we can gather through attentive observation and
monitoring.

e. Sensors
With the help of sensors also we can collect the data. A device that detects
or measures a physical property are called sensors, such as biomatrix.

f. Application program interface


An API is a software interface that enables two apps to communicate with
one another.

Data Exploration
In order to better understand the nature of the data, data analysts utilize
data visualization and statistical tools to convey dataset characterizations,
such as size, amount, and accuracy.

Data exploration is a technique used to visualize data in the form of


statistical methods or using graphs.

Why Data Exploration


Exploration helps you gain a better understanding of a dataset, making it
easier to explore and use it later. It also helps to quickly understand the
data’s trends, and patterns.

About Data Visualization Chart


Data visualization charts are graphical representations of data that use
symbols to convey a story and help people understand large volumes of
information.

a. Column Chart – A column chart is a basic Visualization chart that uses


vertical columns to represent data series. Because column lengths are
easy to compare, column charts are an effective approach to demonstrate
the changes in the data.
b. Bar Chart – A bar chart is a visual representation of category data. The
data is displayed in a bar chart with multiple bars, each representing a
different category.

Modelling
AL, ML & DL

Venn Diagram of AI
Artificial Intelligence
Artificial intelligence (AI) is the simulation of human intelligence in robots
that have been trained to think and act like humans. The term can also
refer to any machine that demonstrates, like humans, the ability to learn
and solve the problem is Artificial Intelligence.

Machine Learning
Machine learning is a part of an Artificial Intelligence application in which
we give data to the machine and allow them to learn for themselves. It’s
essentially getting a machine to accomplish something without being
specifically programmed to do so.

Deep Learning
Deep learning is a part of Artificial Intelligence that uses neural networks
with multilayer. Deep learning analyzes the data, learns the data and
solves the problem the same as a human. Deep learning requires the
machine to be educated with a large quantity of data in order to train itself.

Evaluation
After a model has been created and trained, it must be thoroughly tested in
order to determine its efficiency and performance; this is known as
evaluation.

You might also like