Professional Documents
Culture Documents
M.S. Spring Semester Final Exam 2020 CSE 537: Text Mining Department of Computer Science and Engineering University of Dhaka
M.S. Spring Semester Final Exam 2020 CSE 537: Text Mining Department of Computer Science and Engineering University of Dhaka
b. What are the problems in using a single term as a topic in topic mining? 3
Define with example.
c. I. If we want to train a generative probabilistic topic model on 3+1
our corpus of 𝑁 documents with 𝑉 be the set of vocabulary +1
and we want to consider 𝑘 topics, then what shall be the
parameters of our model and what would they measure?
Define with small example.
II. How many parameters are there in total?
III. What constraints they must follow?
d. In the text categorization problem, when labeled data is available three 6
categories of classifiers can be used to solve the problem. Write their
names with short description on how they work.
5. a. How can a term be represented as term vector based on the words in the 10
context? How does it help to discover paradigmatic relations?
b. Mention the advantages of a neural language model. Describe the skip- 10
gram neural language model in detail.