Professional Documents
Culture Documents
NLP Unit 1
NLP Unit 1
Prerequisites:
language in the form of text or voice data and to ‘understand’ its full
1. Information Extraction
2. Question Answering
3. Sentiment Analysis
Speech recognition, Intent classification, Urgency detection, Auto-correct, Market Intelligence, Email
filtering, Voice assistants and chatbots, Advertisement to target audience, Recruitment
Information Extraction (IE)
1. Working with an enormous amount of text data is always hectic
and time-consuming.
algorithms.
different food items like burgers, pizza, sandwiches, milkshakes, etc. They
have created a website to sell their food and now the customers can order
any food item from their website and they can provide reviews as well, like
The first review is definitely a positive one and it signifies that the customer was
really happy with the sandwich. The second review is negative, and hence the
company needs to look into their burger department. And, the third one doesn’t
signify whether that customer is happy or not, and hence we can consider this as a
neutral statement.
Machine Translation
Machine Translation (MT) is the task of automatically
converting one natural language into another, preserving
the meaning of the input text, and producing fluent text in the
output language.
Most of these jobs have to be done in both source and target language.
SYSTRAN was used for the Apollo-Soyuz project (1973) and by the
● Domain-independent
Disadvantages
text corpora.
It was first introduced in 1955, but it gained interest only after 1988
Disadvantages
translation.
AI
Microsoft
system.
Advantages
Disadvantages
language.
● The lexical analysis divides the text into paragraphs, sentences, and words.
Here “Mumbai goes to Sara”, which does not make any sense, so this
grammar.
Dependency Grammar and Part of Speech (POS) tags are the important
structure. NL is so complex and, most of the time, sequences of text are dependent
on prior discourse.
This concept occurs often in pragmatic ambiguity. This analysis deals with how the
immediately preceding sentence can affect the meaning and interpretation of the
next sentence. Here, context can be analyzed in a bigger context, such as paragraph
It actually comes from the field of linguistics (as a lot of NLP does), where the
context is considered from the text.
Why is this important? Because a lot of text’s meaning does have to do with
the context in which it was said/written.
● Synonyms
● Ambiguity
● Domain-specific language
● Low-resource languages