To Information Retrieval & Question Answering: E. Jembere Based On Lecture Slides Form Kathy Mckeown Lecture Slides

You might also like

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 9

Introduction

to
Information Retrieval
&
Question Answering

E. Jembere

Based on Lecture slides form Kathy


McKeown lecture slides

1
QA: Example 1
• Who won the Nobel Peace Prize in 1991?
But many foreign investors remain sceptical, and western governments
are withholding aid because of the Slorc's dismal human rights record
and the continued detention of Ms Aung San Suu Kyi, the opposition
leader who won the Nobel Peace Prize in 1991.

The military junta took power in 1988 as pro-democracy demonstrations


were sweeping the country. It held elections in 1990, but has ignored
their result. It has kept the 1991 Nobel peace prize winner, Aung San
Suu Kyi - leader of the opposition party which won a landslide victory in
the poll - under house arrest since July 1989.

The regime, which is also engaged in a battle with insurgents near its
eastern border with Thailand, ignored a 1990 election victory by an
opposition party and is detaining its leader, Ms Aung San Suu Kyi, who
was awarded the 1991 Nobel Peace Prize. According to the British Red
Cross, 5,000 or more refugees, mainly the elderly and women and
children, are crossing into Bangladesh each day.

• What is the nationality of the 1991 Nobel Prize


winner?
QA: Example 2
The Goal of QA
• QA aim to produce specific phrases, sentences
or short passages in response to a user’s
information need expressed in a Natural
Language query.
• Information Retrieval is the task of returning
documents that are relevant to a particular
natural language query.
• Hence, QA depend on Information Retrieval
Techniques.

04/15/2023 © Jurafsky and Martin, Speech and Language Processing 4


Outline
• Information retrieval
 Vector space models for IR.
 Evaluation of IR systems
• Factoid Question answering
 Question processing
 Question classification
 Passage retrieval
 Answer processing
 Evaluation of Factoid QA system

04/15/2023 5
Information Retrieval
• Basic assumption: semantics carried in a
document can be captured by analyzing
(counting) the words that occur in it.
• I see what I eat means the same thing as I
eat what I see.
 The ordering and the constituency of words are
immaterial
• This collection of words (I, see, what, eat)
is known as the bag of words approach.

6
Some key terminology
• A document refers generically to a unity
of text indexed in the system and
available for retrieval
• A collection refers to a set of documents
being used to satisfy a user query
• A term refers to a lexical entity that occurs
in a document
• A query represents a user’s information
query expressed as a set of terms

04/15/2023 © Jurafsky and Martin, Speech and Language Processing 7


Architecture of an Ad hoc IR
System

• The user poses a query, and the system returns


a possibly ordered list of potentially relevant
documents

04/15/2023 8
Ad Hoc Retrieval
1. Represent documents as a set of weights in a
vector space.
2. Convert the query to a vector using the same
vector space and weighting scheme that were
used to represent the documents.
3. Compute the similarity between the query
vector and all the candidate documents
4. Return the documents ordered according to
how similar are they to the query

You might also like