Professional Documents
Culture Documents
2-Lecture Two - (Back Ground of NLP)
2-Lecture Two - (Back Ground of NLP)
2-Lecture Two - (Back Ground of NLP)
...interdisciplinary field...
Several fields including linguistics, psycholinguistics,
mathematics, computer science, and electrical engineering
contribute to the research and development of NLP.
01/02/23 3
NLP: Definition
...computational techniques...
Multiple models, methods and algorithms are employed to
accomplish a particular type of language analysis.
...tasks or applications...
The goal of NLP is to accomplish human-like language processing
for various tasks and applications such as machine translation,
information retrieval, question-answering, etc.
01/02/23 5
NLP: Definition
The task of NLU is equivalent to the role of reader/listener, whereas the task of Natural Language Generation (NLG) is that of the
writer/speaker.
01/02/23 6
NLP: Importance of NLP
01/02/23 7
NLP: Difficulty of NLP
Since computers are orders of magnitude faster, many find it hard to believe
that computers are not good at processing natural languages.
01/02/23 8
NLP: Difficulty of NLP
01/02/23 9
NLP: Brief History
Research and development on NLP started along with the advent of computers.
The field emerged in the US from the strong desire of having a Machine Translation
system that automatically translates texts from Russian journals into English.
However, the initial efforts to develop an accurate machine translation system were
not successful as automatic translation could not be realized just by translating words.
01/02/23 10
NLP: Brief History
01/02/23 11
NLP: Brief History
Such understandings helped many researchers and developers realize that they needed a more
adequate theory of language.
01/02/23 12
NLP: Course Coverage and
Knowledge Requirement
01/02/23 13
Levels of Linguistic Analysis:
Morphology
Morphology is the study of the componential nature of words.
At morphological level, the smallest parts of words that carry meanings and affixes are analyzed.
01/02/23 14
Levels of Linguistic Analysis: Syntax
01/02/23 15
Levels of Linguistic Analysis:
Semantics
Semantics deals with the meaning of words, phrases and sentences.
Semantic analysis requires knowledge of:
Lexical semantics : the meanings of the component words,
Compositional semantics: how components combine to form larger meanings.
01/02/23 16
Levels of Linguistic Analysis:
Discourse
Discourse level deals with the properties of the text as a whole that
convey meaning by making connections between component sentences.
Several methods are used in discourse processing, two of the most
common being:
Anaphora resolution: replacing words such as pronouns, which are
semantically vacant, with the appropriate entity to which they refer; and
Discourse/text structure recognition: determining the functions of sentences
in the text (which adds to the meaningful representation of the text).
01/02/23 17
Levels of Linguistic Analysis:
Discourse
Discourse level deals with the properties of the text as a whole that convey meaning by making connections between component sentences.
01/02/23 18
Levels of Linguistic Analysis:
Pragmatics
Pragmatics is concerned with the purposeful use of language in situations and utilizes context
over and above the contents of the text for understanding.
Pragmatics deals with world knowledge – outside the contents of the document.
01/02/23 19
Applications of NLP: Spelling Correction
and Grammar Checking
01/02/23 20
Applications of NLP: Information
Retrieval
Information Retrieval provides a list of potentially relevant documents in response to a user’s query.
01/02/23 21
Applications of NLP: Information
Extraction
Information extraction focuses on the recognition, tagging and extraction of certain key elements of information (e.g. persons,
companies, locations, organizations, etc.) from large collections of text into a structured representation.
01/02/23 22
Applications of NLP: Machine
Translation
Machine Translation is an automatic translation of text from one language to another.
01/02/23 23
Applications of NLP: Question-
Answering
Question-Answering provides the user with either just the text of the answer itself of answer-providing passages.
01/02/23 24
Applications of NLP: Dialogue
Systems
Dialogue Systems are agents that converse with human beings in a coherent structure using several modes of communication such
as text, speech, gesture, etc.
01/02/23 25
Applications of NLP: Text
Summarization
Text summarization is an application of NLP that reduces a larger text into a shorter, yet richly constituted representation of the
original document.
01/02/23 26
Related Fields: Modes of Language
Representation
01/02/23 27
Related Fields: Speech Recognition
01/02/23 28
Related Fields: Optical Character
Recognition (OCR)
Optical Character Recognition (OCR) is a computerized system
that converts non-editable text to machine-encoded text.
If the text to be converted is handwritten, the system is also
known as intelligent Character Recognition (ICR).
01/02/23 29
Morphological Analysis
Introduction: Terminologies
01/02/23 31
Introduction: Terminologies
01/02/23 32
Introduction: Terminologies
01/02/23 33
Introduction: Kinds of Morphemes
01/02/23 37
Introduction: Morphological Types
Derivational Morphology:
Derivational Morphology is a morphology concerned with the way in
which words are derived from morphemes through processes such as
affixation or compounding.
This derivation process usually changes the part-of-speech category.
01/02/23 40
Introduction: Morphological Rules
01/02/23 44
Sentences
01/02/23 45
Simple Sentences
01/02/23 46
Parsing
01/02/23 47
Parsing Strategies: Top-Down Parsing
Top-down parsing starts with the symbol S and then searches through different ways to
rewrite the symbols until the input sentence is generated.
01/02/23 48
Parsing Strategies: Top-Down Parsing
Bottom-up parsing starts with words in a sentence and uses production rules backward to
reduce the sequence of symbols until it consists solely of S.
01/02/23 49
Towards Efficient Parsing: Separating
Lexical Rules
The efficiency of parsing algorithms can be improved if lexical rules are stored separately
in a structure called lexicon, which specifies the possible categories for each word.
The following example shows the lexical rules separated from other grammatical rules.
01/02/23 50
Towards Efficient Parsing: Chunking
01/02/23 51
Towards Efficient Parsing: Chunking
01/02/23 54
Cultural Analysis and Linguistic
Semantics
Culture of the society has in impact on semantic analysis.
How do we represent the meanings of tella, injera, besso, tej, teff,
etc… so as to translate them in to a foreign language?
There is a close link between the life of the society and the lexicon
of the language spoken.
For example: ice, snow በረዶ
Politeness in Amharic, not in English
01/02/23 55
First Order Predicate Calculus:
Elements of FOPC
A predicate represents a property or relation between terms that
can be true or false.
In a given interpretation, an n-ary predicate can defined as a function
from tuples of n terms to {True, False}.
For example: Brother(Abebe, Kebede), Left-of(Square1, Square2),
GreaterThan(plus(1,1), plus(0,1)).
Connectives are used to compose complex representations.
Truth table
01/02/23 56
First Order Predicate Calculus:
Elements of FOPC
An atomic sentence is simply a predicate applied to a set of
terms.
For example: Owns(Abebe, Car1)
Sold(Abebe, Car1, Kebede)
Semantics is True or False depending on the interpretation.
The standard propositional connectives ( ∨, ¬, ∧, ⇒ ) can be
used to construct complex sentences:
For example: Owns(Abebe, Car1) ∨ Owns(Kebede, Car1)
Sold(Abebe, Car1, Kebede) ⇒ ¬Owns(Abebe, Car1)
Semantics same as in propositional logic.
01/02/23 57
First Order Predicate Calculus:
Example
Sheraton Addis is a hotel.
Hotel(SheratonAddis)
Sheraton Addis serves Ethiopian food.
Serves(SheratonAddis, EthiopianFood)
01/02/23 58
Semantic Networks
01/02/23 59
Semantic Networks
Example:
01/02/23 60
Discovering Latent Semantics
01/02/23 61
Discovering Latent Semantics
Implements the idea that the meaning of a passage is the sum of the
meanings of its words:
meaning(word1) + meaning(word2) + … + meaning(wordn) =
meaning(passage)
01/02/23 62
Vector Space Model
01/02/23 64
Thank You !!!
01/02/23 65