Professional Documents
Culture Documents
CT2 Set A
CT2 Set A
CT2 Set A
1 CO1 3 3 - - - - - - - - - - - - -
2 CO2 3 2 - 3 - - - - - - - - - - -
3 CO3 3 2 2 3 - - - - - - - - - - -
4 CO4 3 3 2 2 - - - - - - - - - - -
5 CO5 3 2 2 2 - - - - - - - - - 2 -
Part - A
(10*1 = 10 Marks) Answer all Questions.
Q. No Questions Marks BL CO PO PI Code
1 Consider the following sentence. “Horse ran up the hill. 1 2 5 1 2.1.2
It was very steep. It soon got tired.” What type of
ambiguity is introduced due to the word “it”?
a. Syntactic
b. Pragmatics
c. cataphoric
d. Anaphoric
2 Spam email detection comes under which domain? 1 2 5 1 2.1.3
a. Text categorization
b. NER
c. Text Classification
d. Sentiment Analysis
Ans:
Vector space model:
The vector space model is an algebraic model that
represents objects (like text) as vectors. This makes it
easy to determine the similarity between words or the
relevance between a search query and a document.
Cosine similarity is often used to determine the
similarity between vectors.
Term Frequency:
Term frequency (TF) means how often a term occurs in
a document. In the context of natural language, terms
correspond to words or phrases.
1. Rule-based
2. Retrieval-based
3. Generative methods
4. Ensemble methods
5. Grounded learning
6. Interactive learning
17 Explain the vector space model of information retrieval 10 4 3 4 1.7.1
Sol:
The Vector Space Model is an algebraic model used for
Information Retrieval. It represents a natural language
document in a formal manner by the use of vectors in a
multi-dimensional space and allows decisions to be
made as to which documents are similar to each other
and to the queries fired.