Professional Documents
Culture Documents
IR Model Question Paper
IR Model Question Paper
1.
a)
b)
c)
Define tokenization.
d)
_____________ is the strategy for determining a stop list to sort the terms.
e)
f)
The process of keeping frequently used disk data in main memory is called _______.
g)
h)
i)
j)
What are the different variant tf-idf functions for weighting of scores?
k)
l)
m)
n)
o)
p)
q)
r)
s)
t)
20*1=20
Marks
2. a)
b)
c)
3. a)
b)
c)
4. a)
b)
c)
d)
5. a)
b)
c)
6. a)
b)
c)
07-marks
Differentiate between extended Boolean model and ranked retrieval with a suitable
example.
Explain the working principle of Stemming and lemmatization process with a neat
example.
06-marks
What do you mean by wildcard query? Explain the different situations in which the
wildcard queries are used.
Explain the different steps involved for computing the edit distance between any two
strings such as s1 and s2 using an algorithm.
Diagrammatically explain the working procedure of Distributed indexing.
06-marks
05-marks
How do you achieve efficient scoring and ranking of the documents? Illustrate with an
example.
What is cluster pruning? Explain with an example how to achieve it.
Explain the process of evaluation of unranked retrieval sets and ranked retrieval
results.
06-marks
Explain the process of simplifying XML document by DOM object with a diagram.
What are the different challenges in XML retrieval? Explain in brief.
Mention the various probabilistic approaches to relevance feedback.
06-marks
06-marks
08-marks
07-marks
08-marks
06-marks
06-marks
05-marks
04-marks
06-marks
08-marks