Professional Documents
Culture Documents
4.NLP CIC 4 PDF
4.NLP CIC 4 PDF
4.NLP CIC 4 PDF
Lexical Semantics
Semantics
Lexical semantics
Lexical semantic relations
WordNet
Word Sense Disambiguation
• Lesk algorithm
• Yarowsky’s algorithm
Lexical Semantics
Sword
Tomato
King
Extensional definitions are used when listing examples would give more applicable
information than other types of definition, and where listing the members of a set
tells the questioner enough about the nature of that set.
For example, the lexeme BANK (noun) consists of bank and banks, but not
Hypernymy/hyponymy
Synonymy
Antonymy
Homonymy
Polysemy
Metonymy
Holonymy/meronymy
Lexical Semantics
hypernym: A word with a broad meaning constituting a category into which words
with more specific meanings fall; a superordinate. For example, colour is a
hypernym of red.
Hyponym Hypernym
Snake Reptile
Mango Fruit
Synonymy
(Roughly) same meaning
Antonymy
happy sad
descendant ancestor
black white
up down
Lexical Semantics
Homonym — are words which sound alike or are spelled alike, but have different
meanings
Fluke: A fish, and a flatworm
• e.g., son vs. sun, sea vs. see, sell vs. cell, their, there
• e.g., lead (noun) vs. lead (verb) , close vs. close, minute vs. minute
Lexical Semantics
Polysemy — the coexistence of many possible meanings for a word or phrase/
Multiple related meanings.
S: (n) newspaper, paper (the physical object that is the product of a newspaper
publisher)
S: (n) newspaper, newsprint (cheap paper made from wood pulp and used for
printing newspapers)
Lexical Semantics
Polysemy — the coexistence of many possible meanings for a word or phrase/
Multiple related meanings.
Bank
a financial institution.
rely Upon is different but related, as it derives from the theme of security initiated by 1.
Lexical Semantics
S: (n) position, place (the particular portion of space occupied by something) "he put
the lamp back in its place"
S: (n) military position, position (a point occupied by troops for tactical reasons)
S: (n) position, posture, attitude (the arrangement of the body and its limbs) "he
assumed an attitude of surrender"
S: (n) status, position (the relative position or standing of things or especially persons
in a society) "he had the status of a minor"; "the novel attained the status of a
classic"; "atheists do not enjoy a favourable position in American life"
S: (n) position, post, berth, office, spot, billet, place, situation (a job in an organization)
"he occupied a post in the treasury”
Lexical Semantics
Ex: suit for business executive, or the turf for horse racing
The White House or The Oval Office - used in place of the President or White
House staff
Usage:
Computationally determining which sense of a word is activated by its use in a particular context.
—Probabilistic/Statistical models.
Hybrid Approaches
— Argument-structure of verbs.
— Description of properties of words such that meeting the selectional preference criteria
can be decided.
E.g. This flight serves the “region” between Mumbai and Delhi
Find the overlap between the features of different senses of an ambiguous word (sense bag) and
The sense which has the maximum overlap is selected as the contextually appropriate sense.
Lesk’s Algorithm
More like a family of algorithms which, in essence, choose the sense whose
dictionary definition shares the most words with the target word’s neighbourhood.
dictionary definition of Si
• Compute Overlap(B, signature(Si))
Lesk’s Algorithm
Sense Bag: contains the words in the definition of a candidate sense of the ambiguous word.
Context Bag: contains the words in the definition of each sense of each context word.
E.g. “On burning coal we get ash.”
Coal
Ash
Sense 1
Sense 1
A piece of glowing carbon or burnt wood.
Sense 2
Step 1: For each sense of the target word find the thesaurus category to which that
sense belongs.
Step 2: : Calculate the score for each sense by using the context words. A context words
will add 1 to the score of the sense if the thesaurus category of the word matches
that of the sense
Sense1: Sense2:
Money Finance 1 Location 0
Interest 1 0
Fetch 0 0
Annum 1 0
Total 3 0
Word Sense Disambiguation
Unsupervised Learning is where you only have input data (X) and no
corresponding output variables. The goal for unsupervised learning is to model the
underlying structure or distribution in the data in order to learn more about the
data
• Supervised: All data is labeled and the algorithms learn to predict the output
from the input data.
Sense A:
• plant as in a lifeform Other data
Sense B:
• plant as in a factory
Word Sense Disambiguation
Step 3 : Train a classifier (Decision-List classifier)
Nearby words provide strong and consistent clues as to the sense of a target
word.