Download as pdf or txt
Download as pdf or txt
You are on page 1of 10

Chapter Six

6. Communicating, Perceiving, and Acting


6.1. What is Natural Language Processing (NLP)?
Natural Language Processing (NLP) is a branch of artificial intelligence (AI) that focuses on the
interaction between computers and humans through natural language. It enables computers to
understand, interpret, and generate human language in a way that is both meaningful and useful.
NLP combines computational linguistics, computer science, and AI techniques to bridge the gap
between human communication and computer understanding. Some of the key tasks in NLP
include:
 Text Parsing: Breaking down text into its constituent parts, such as words, phrases,
and sentences.
 Named Entity Recognition (NER): Identifying and categorizing named entities
mentioned in text, such as people, organizations, locations, dates, etc.
 Part-of-Speech (POS) Tagging: Assigning grammatical parts of speech to words in a
sentence (e.g., noun, verb, adjective).
 Sentiment Analysis: Determining the sentiment or opinion expressed in a piece of text,
whether it's positive, negative, or neutral.
 Machine Translation: Translating text from one language to another automatically.
 Text Generation: Creating human-like text based on input or in response to specific
prompts.
 Question Answering: Understanding questions posed in natural language and
providing accurate answers. Hence, question answering focuses on building systems
that automatically answer the questions asked by humans in a natural language.

Figure: Utilizing Natural Language Processing (NLP) for Question and Answering

1
 Text Summarization: Generating concise summaries of longer texts while preserving
key information.
 Spam detection: Used to detect unwanted e-mails getting to a user's inbox.

Figure: Spam detection using NLP

NLP has numerous applications across various domains, including customer service chatbots,
virtual assistants, information retrieval systems, and sentiment analysis in social media, language
translation services, and much more. As NLP techniques continue to advance, the possibilities for
its application are constantly expanding.

Refer to more information about the history of NLP.

Figure: The connections between NLP, Computer Science, Artificial Intelligence, and Human
Interaction

6.1.1. Types and phases of NLP


Natural Language Processing (NLP) involves various types and phases, each addressing different
aspects of language understanding and processing. Here's an overview of the types and phases of
NLP:

2
Types of NLP
 Rule-based NLP: In this approach, linguistic rules are defined manually by experts to
analyze and process text. These rules govern how the system interprets and generates
language. While effective for certain tasks, rule-based NLP can be limited in handling
ambiguity and complexity.
 Statistical NLP: Statistical methods involve using large amounts of data to train
models that can automatically learn patterns and associations within language.
Techniques such as machine learning algorithms, probabilistic models, and statistical
parsing are employed to analyze and generate text.
 Hybrid NLP: This approach combines rule-based and statistical techniques to leverage
the strengths of both methods. It can provide more robust and accurate language
processing by incorporating linguistic rules with data-driven statistical models.
 Deep Learning NLP: Deep learning techniques, particularly neural networks, have
revolutionized NLP in recent years. Models like recurrent neural networks (RNNs),
convolutional neural networks (CNNs), and transformer architectures (e.g., BERT,
GPT) are used for tasks such as text classification, language generation, and machine
translation.

6.1.2. Natural Language for Communication & Perception


"Natural Language for Communication & Perception" refers to the ability of humans and,
increasingly, machines to understand, process, and produce language to communicate effectively
and perceive the world around them. This concept encompasses both the use of language as a
means of communication between individuals and as a tool for perceiving and interpreting
information from the environment.

Communication
 Interpersonal Communication: Humans use natural language to convey thoughts, ideas,
emotions, and intentions to others. This includes spoken language in face-to-face
conversations, written language in letters and emails, and digital communication through
chat messages and social media.
 Human-Computer Interaction (HCI): Natural language interfaces enable users to interact
with computers and devices using spoken or written language, rather than traditional input

3
methods like keyboards or touchscreens. Virtual assistants, chatbots, and voice-controlled
systems are examples of HCI applications that leverage natural language processing (NLP)
technologies.
 Multilingual Communication: Natural language serves as a bridge between people who
speak different languages, facilitating communication and understanding across cultural
and linguistic barriers. Translation and interpretation services, as well as language learning
tools, play a crucial role in enabling multilingual communication.

Perception
Text Understanding: Natural language processing enables machines to analyze and interpret text
data, extracting meaning, identifying entities and relationships, and making inferences. Text
understanding technologies are used in various applications such as information retrieval,
sentiment analysis, and content categorization.

 Speech Recognition: Machines can perceive and transcribe spoken language into text
using automatic speech recognition (ASR) systems. This technology enables voice-
controlled interfaces, dictation software, and real-time transcription services.
 Language Generation: Natural language generation (NLG) techniques allow machines to
produce human-like text based on input data or predefined rules. NLG applications include
generating reports, writing summaries, and composing personalized messages.
 Contextual Understanding: Effective communication and perception require
understanding language in context, considering factors such as tone, intent, and cultural
nuances. Contextual understanding is essential for tasks like sentiment analysis,
conversational agents, and personalized recommendations.
 Visual and Multimodal Perception: Natural language can be integrated with other
modalities, such as images, videos, and sensor data, to enhance perception and
understanding. Multimodal technologies enable applications like image captioning, visual
question answering, and augmented reality interfaces.

In general, natural language serves as a powerful tool for communication and perception, enabling
humans and machines to interact with each other and the world in richer, more intuitive ways.
Advances in natural language processing and understanding continue to drive innovation across a

4
wide range of fields, from communication technology and artificial intelligence to human-
computer interaction and cognitive science.

Pros:

 Helps users to ask questions about any subject and get a direct response
within seconds.
 Helps computers to communicate with humans in their languages.
 It is very time-efficient.
 Most companies use NLP to improve the efficiency of documentation
processes, accuracy of documentation, and identify the information
from large databases.
Cons:

 NLP may not show context.


 NLP is unpredictable
 NLP may require more keystrokes.
 NLP is unable to adapt to the new domain, and it has a limited function
which is why NLP is built for a single and specific task only.

6.2. What is computer vision?


Computer Vision is an AI technology that enables robots to perceive visual data. It holds
significant importance across domains like safety, security, healthcare, accessibility, and
entertainment. Computer vision automatically extracts, analyzes, and interprets valuable
information from single or multiple images, achieved through the development of algorithms for
automated visual comprehension.
Hardware components of a Computer Vision System include:
 Power supply
 Image acquisition device (e.g., camera)
 Processor
 Software
 Display device for system monitoring
 Accessories like camera stands, cables, and connectors

5
Tasks of Computer Vision include:

1. Optical Character Recognition (OCR): In the domain of computers, Optical Character


Reader, a software to convert scanned documents into editable text, which accompanies a
scanner.
2. Face Detection: Many state-of-the-art cameras come with this feature, which enables to
read the face and take a picture of that perfect expression. It is used to let a user access the
software on the correct match.
3. Object Recognition: They are installed in supermarkets, cameras, high-end cars such as
BMW, GM, and Volvo.
4. Estimating Position: It is estimating the position of an object concerning the camera as in
the position of the tumor in the human body.
Refer to more information about image formation, simple image features, classifying
images, detecting objects, and the 3D world.

6.3. What is Robotics?


Robotics is a branch of artificial intelligence that focuses on designing and developing effective
and intelligent robots. These robots serve as artificial agents that interact within real-world
environments. Their primary objective is to manipulate objects by perceiving, selecting, moving,
and altering their physical properties. This enables robots to perform tasks that may be repetitive
or physically demanding, freeing human workers from such laborious activities while ensuring
consistency and reliability in operations.

Aspects of Robotics

 The robots have mechanical construction, form, or shape designed to accomplish a


particular task.
 They have electrical components that power and control the machinery.
 They contain some level of a computer program that determines what, when, and how a
robot does something.

6
Table: Comparison between Other AI Programs and Robotics Systems
AI Programs Robots
They usually operate in computer-stimulated They operate in the real physical world
words
The input to an AI program is in symbols and Inputs to robots is analog signals in the form of
rules speech waveforms or images
Need general-purpose computers to operate on They need special hardware with sensors and
effectors

6.3.4. Reinforcement Learning in Robotics


Reinforcement Learning (RL) in robotics involves training autonomous agents to perform tasks
through trial and error, where they learn from feedback received from the environment. Moreover,
reinforcement Learning (RL) in robotics is a subfield of artificial intelligence that focuses on
training robots to perform tasks autonomously by interacting with their environment. Unlike
traditional programming methods, where tasks are explicitly defined and programmed, RL enables
robots to learn behaviors through exploration and experimentation.

Key Components:

 Agent: The robot or autonomous system that interacts with the environment.
 Environment: The physical or virtual world in which the agent operates.
 Actions: The set of possible actions that the agent can take in the environment.
 State: The current configuration or observation of the environment.
 Rewards: Feedback signals provided by the environment to indicate the desirability of
actions taken by the agent.

Process

 Exploration: The agent explores the environment by taking actions and observing the
resulting states and rewards.
 Exploitation: Based on past experiences, the agent learns to choose actions that maximize
expected rewards.
 Learning: The agent updates its policy or strategy based on the observed rewards, aiming
to improve its performance over time.

7
6.3.6. Robotics, types of robotics
Robotics is a multidisciplinary field that involves the design, construction, operation, and use of
robots. Robots are programmable machines capable of carrying out tasks autonomously or semi-
autonomously. There are various types of robotics, each serving different purposes and operating
in different environments. Here are some common types of robotics:

1. Industrial Robotics: Used in manufacturing for tasks like assembly, welding, and material
handling.
2. Mobile Robotics: Includes robots designed for movement in various environments, such
as drones and autonomous vehicles.
3. Medical Robotics: Used in healthcare for surgeries, rehabilitation, and patient care.
4. Humanoid Robotics: Resemble human form and are used in research, entertainment, and
assistance roles.
5. Aerial Robotics (Drones): Unmanned aerial vehicles used for tasks like aerial
photography, surveillance, and delivery services.
6. Consumer Robotics: Designed for personal use, such as robot vacuum cleaners and
personal assistants.
7. Space Robotics: Used in space exploration missions for tasks like satellite servicing and
planetary exploration.
These types represent key areas of robotics with significant applications and impact across various
industries and domains.

6.3.8. Application areas, pros and cons


Robotics encompasses a wide range of applications across various industries, each with its own
set of advantages and challenges. Here are some common application areas of robotics along with
their respective pros and cons:
1. Manufacturing and Industrial Automation
 Pros: Increase efficiency, productivity, and precision in manufacturing processes. Robots
can work continuously without fatigue, handle hazardous materials, and perform repetitive
tasks with high accuracy.
 Cons: High initial investment costs for setup and maintenance. Requires specialized
programming and may lead to job displacement for certain manual labor roles.

8
2. Healthcare and Medical Robotics
 Pros: Assist in surgeries, rehabilitation, and patient care, improving precision and reducing
the risk of human error. Robotics enable minimally invasive procedures, shorter recovery
times, and enhanced patient outcomes.
 Cons: Complex regulatory approval processes and concerns about safety, reliability, and
ethical considerations. Limited accessibility due to high costs and technological
dependencies.
3. Service and Hospitality Robots
 Pros: Assist in tasks such as cleaning, delivery, customer service, and entertainment,
enhancing efficiency and customer satisfaction. Can operate 24/7, reducing labor costs and
improving service quality.
 Cons: Limited adaptability and social interaction capabilities compared to human
counterparts. Concerns about privacy, security, and potential job displacement in service
industries.
4. Agricultural Robotics
 Pros: Automate tasks like planting, harvesting, and monitoring crops, increasing yield,
reducing labor costs, and minimizing environmental impact. Precision agriculture
techniques enabled by robotics improve resource utilization and crop management.

(a)

(b) (c)

Figure: AI in Agriculture

9
 Cons: Challenges in adapting to diverse terrains, crops, and weather conditions. Initial
investment costs and the need for integration with existing agricultural practices.

10

You might also like