Welcome to Scribd!

M.S. Spring Semester Final Exam 2020 CSE 537: Text Mining Department of Computer Science and Engineering University of Dhaka

Uploaded by

0% found this document useful (0 votes)

22 views2 pages

1. The document provides a sample final exam for a text mining course. It contains 5 questions with multiple parts each. Question 1 asks about conditional entropy and vector space models. Question 2 is about logistic regression and text categorization evaluation. Question 3 covers topic modeling parameters. Question 4 describes language models and sentiment analysis algorithms. Question 5 discusses term vectors and neural language models. 2. The exam tests students on key concepts in text mining including vector space models, logistic regression, text categorization evaluation, topic modeling, language models, and neural networks. Students must demonstrate understanding of algorithms, parameters, objective functions, and how to apply various methods to problems involving text and language data. 3. The questions assess both

Original Description:

Text Mining Question

Original Title

CSE-537_Text Mining

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

22 views2 pages

M.S. Spring Semester Final Exam 2020 CSE 537: Text Mining Department of Computer Science and Engineering University of Dhaka

Uploaded by

anikatahsin

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

M.S.

Spring Semester Final Exam 2020

CSE 537: Text Mining
Department of Computer Science and Engineering
University of Dhaka
Answer any 3 questions.
1. a. What are the minimum and maximum possible values of the conditional 2+2
entropy H(X|Y)? Under what situations do they occur?
b. I. Explain with example how basic vector space model can be 4+2
used for Paradigmatic relation mining. +6
II. What are the shortcomings of this method of mining
Paradigmatic relations?
III. What techniques can be adopted to make the mining method
robust against these shortcomings?
c. Among H(X|Y), H(Z|Y) and H(X|Z), which are comparable and which 4
are not. Explain why.

2. a. Given a training dataset 1+2+

𝑥 ,𝑦 , 𝑥 ,𝑦 ,⋯ 𝑥 ,𝑦 7
where, 𝑥 ∈ 𝑅 , 𝑦 ∈ 0,1 and 𝜃 ∈ 𝑅 is the parameter vector,
answer the following question:
I. If we want to use Logistic regression, then what is the range
of values for ℎ 𝑥 ? What does it actually measure (write
using probability notation)?
II. If we define ℎ 𝑥 𝑔 𝜃 𝑥 , then draw the graph of
𝑔 𝜃 𝑥 .
III. Show step by step derivation of the objective function of the
aforementioned model. Also define the intuition behind this
objective function using graphs.

b. Say in a text categorization problem your algorithm will categorize N 3+ 7

documents into k categories. Answer the following questions:
I. Why Classification Accuracy is not a good measure to
evaluate performance of your algorithm?
II. How can you measure the average performance of your
algorithm over all categories and over all documents?
Describe with dummy examples.
3. a. Give a training dataset 5+1
𝑥 ,𝑦 , 𝑥 ,𝑦 ,⋯ 𝑥 ,𝑦
𝑤ℎ𝑒𝑟𝑒, 𝑥 ∈ 𝑅 𝑖𝑠𝑎𝑠𝑒𝑛𝑡𝑒𝑛𝑐𝑒 ∧ 𝑦 ∈
1,2, ⋯ , 𝑘 𝑖𝑠𝑡ℎ𝑒𝑟𝑎𝑡𝑖𝑛𝑔𝑜𝑓𝑡ℎ𝑒𝑠𝑒𝑛𝑡𝑒𝑛𝑐𝑒,
answer the following question:
I. Design a multiple logistic regression classifier for rating
prediction of any sentence.
II. How many parameters are there in your model?

b. What are the problems in using a single term as a topic in topic mining? 3
Define with example.
c. I. If we want to train a generative probabilistic topic model on 3+1
our corpus of 𝑁 documents with 𝑉 be the set of vocabulary +1
and we want to consider 𝑘 topics, then what shall be the
parameters of our model and what would they measure?
Define with small example.
II. How many parameters are there in total?
III. What constraints they must follow?
d. In the text categorization problem, when labeled data is available three 6
categories of classifiers can be used to solve the problem. Write their
names with short description on how they work.

4. a. Describe the unigram language model for discovering one topic 10

b. Explain the multilevel logistic regression algorithm for sentiment 10
analysis

5. a. How can a term be represented as term vector based on the words in the 10
context? How does it help to discover paradigmatic relations?
b. Mention the advantages of a neural language model. Describe the skip- 10
gram neural language model in detail.

Study Notes To Ace Your Data Science Interview
Document7 pages
Study Notes To Ace Your Data Science Interview
Dănuț Daniel
No ratings yet
B6301-Advanced Ai Concepts
Document1 page
B6301-Advanced Ai Concepts
SRINIVASA RAO GANTA
No ratings yet
Page 1 of 4
Document4 pages
Page 1 of 4
ABDUL WARIS AFTAB
No ratings yet
ML QP
Document6 pages
ML QP
Ashok
No ratings yet
ML Question Bank
Document7 pages
ML Question Bank
arunwaghmare5
No ratings yet
HT TP: //qpa Pe R.W But .Ac .In: Pattern Recognition
Document4 pages
HT TP: //qpa Pe R.W But .Ac .In: Pattern Recognition
Duma Dumai
No ratings yet
Analysis and Design of Algorithm
Document2 pages
Analysis and Design of Algorithm
Adhikari Sushil
No ratings yet
Further Mathematics/Mathematics (Elective) Aims of The Syllabus
Document18 pages
Further Mathematics/Mathematics (Elective) Aims of The Syllabus
Hajara Abubakari Sadiq
No ratings yet
M.Tech I Semester Supplementary Examinations August/September 2018
Document1 page
M.Tech I Semester Supplementary Examinations August/September 2018
Naveen Kumar
No ratings yet
The Figures in The Margin Indicate Full Marks. Candidates Are Required To Write Their Answers in Their Own Words As Far As Practicable
Document3 pages
The Figures in The Margin Indicate Full Marks. Candidates Are Required To Write Their Answers in Their Own Words As Far As Practicable
gwgwgwg wtwt
No ratings yet
Computer Programming Using C & Numerical Methods
Document1 page
Computer Programming Using C & Numerical Methods
Sanjana Xavier
No ratings yet
SAMPLE PAPER-I Class XII (Informatics Practices) QP With MS & BP
Document7 pages
SAMPLE PAPER-I Class XII (Informatics Practices) QP With MS & BP
Sarathi Sarathi
No ratings yet
MYP1 - Add and Subtract Unit Fractions (BC)
Document7 pages
MYP1 - Add and Subtract Unit Fractions (BC)
Daniel Tulula
No ratings yet
Object Oriented Programming in C++
Document4 pages
Object Oriented Programming in C++
Arun Sharma
No ratings yet
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
Document7 pages
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
Shiva Shankara
No ratings yet
AI Unitwise Imp Questions
Document3 pages
AI Unitwise Imp Questions
kshivanyk22
No ratings yet
OOP MSBTE Question Paper Winter 2007
Document3 pages
OOP MSBTE Question Paper Winter 2007
api-3728136
No ratings yet
It-3035 (NLP) - CS End May 2023
Document10 pages
It-3035 (NLP) - CS End May 2023
Rachit Srivastav
No ratings yet
Qualification Exam Question: 1 Statistical Models and Methods
Document4 pages
Qualification Exam Question: 1 Statistical Models and Methods
Almalieque
No ratings yet
ADA Previous Year Papers
Document12 pages
ADA Previous Year Papers
piloxa
No ratings yet
Punjab Engineering College (Deemed To Be University) End-Semester Examination November, 2018
Document2 pages
Punjab Engineering College (Deemed To Be University) End-Semester Examination November, 2018
Agrim Dewan
No ratings yet
Further Mathematics or Mathematics Elective
Document18 pages
Further Mathematics or Mathematics Elective
nyavimichael77
No ratings yet
Econ F241 Comprehensive
Document2 pages
Econ F241 Comprehensive
yuvraj12389
No ratings yet
Master of Computer Applications (MCA) : Assignments JANUARY 2012
Document14 pages
Master of Computer Applications (MCA) : Assignments JANUARY 2012
Subramanyam Pillalamarri
No ratings yet
156bn - Machine Learning
Document1 page
156bn - Machine Learning
sai Prasad
No ratings yet
Machine Learning With Templates: Michael Stephen Fiske Aemea Institute, San Francisco, CA, USA
Document6 pages
Machine Learning With Templates: Michael Stephen Fiske Aemea Institute, San Francisco, CA, USA
anca irina
No ratings yet
Important Instructions To Examiners:: Model Answer
Document16 pages
Important Instructions To Examiners:: Model Answer
DaNgEr001
No ratings yet
Mcqs Bank Unit 1: A) The Autonomous Acquisition of Knowledge Through The Use of Computer Programs
Document8 pages
Mcqs Bank Unit 1: A) The Autonomous Acquisition of Knowledge Through The Use of Computer Programs
varad
100% (1)
TE Comp Sem VI - AI For May 2022 Examination
Document3 pages
TE Comp Sem VI - AI For May 2022 Examination
SURABHI NAIK192074
No ratings yet
PGDCA Assignment Paper
Document13 pages
PGDCA Assignment Paper
Shopaholic
No ratings yet
Time: 3 Hours Total Marks: 100: B Tech (Sem Vii) Theory Examination 2017-18 Artificial Intelligence
Document2 pages
Time: 3 Hours Total Marks: 100: B Tech (Sem Vii) Theory Examination 2017-18 Artificial Intelligence
Mayank Pandey
No ratings yet
G8DLL Q3W2
Document5 pages
G8DLL Q3W2
NEithan Deldoc
No ratings yet
CM3060 NLP Mock Exam Oct2021
Document4 pages
CM3060 NLP Mock Exam Oct2021
currecurre
No ratings yet
Artificial Intelligence Exam
Document3 pages
Artificial Intelligence Exam
Muhammad Saud Ali
No ratings yet
Homework DL 5GI Sheet1
Document5 pages
Homework DL 5GI Sheet1
Simon Mengong
No ratings yet
Machine Learning 3rd Sem MCA 2022 QP
Document2 pages
Machine Learning 3rd Sem MCA 2022 QP
anjalidn2001
No ratings yet
Erocido Feb14 LP
Document6 pages
Erocido Feb14 LP
johnnen.magtolis
No ratings yet
Aiml Answers
Document42 pages
Aiml Answers
divyashreecr0910
No ratings yet
Adamas University: End-Semester Examination: May 2021 Name of The Program: Semester
Document1 page
Adamas University: End-Semester Examination: May 2021 Name of The Program: Semester
Srija Chakraborty
No ratings yet
Graph Clustering
Document38 pages
Graph Clustering
Alejandro Ortega de los Ríos
No ratings yet
Software Development Techniques 28th November 2019 Examination Paper
Document6 pages
Software Development Techniques 28th November 2019 Examination Paper
Yuki Ko
No ratings yet
Banasthali Vidyapith: 2nd Periodical Test, March, 2020 Class: B.Tech (CS) VI Semester
Document1 page
Banasthali Vidyapith: 2nd Periodical Test, March, 2020 Class: B.Tech (CS) VI Semester
kfrahman
No ratings yet
Er Sity: Vidyasagar University
Document3 pages
Er Sity: Vidyasagar University
Sunirmal Murmu
No ratings yet
All PDF
Document40 pages
All PDF
Charli Patil
No ratings yet
2021ISCReducedSyllabusXI-COMPUTER SCIENCE
Document5 pages
2021ISCReducedSyllabusXI-COMPUTER SCIENCE
Raj Vlogs
No ratings yet
A. Content Standards:: Other Learning Resources
Document4 pages
A. Content Standards:: Other Learning Resources
May Ann C. Payot
100% (1)
Code No.: CS211/13CS209 II B.Tech. II Sem. (RA13/RA11) Supplementary Examinations, March/April - 2019 Design & Analysis of Algorithms
Document2 pages
Code No.: CS211/13CS209 II B.Tech. II Sem. (RA13/RA11) Supplementary Examinations, March/April - 2019 Design & Analysis of Algorithms
cholleti prathyuusha
No ratings yet
Question Bank
Document4 pages
Question Bank
Aanchal Chaudhary
No ratings yet
CSC220 356 133-CSC220
Document5 pages
CSC220 356 133-CSC220
Aniket Ambekar
No ratings yet
Discrete Mathematics Syllabi
Document1 page
Discrete Mathematics Syllabi
Akkayya Naidu Bonu
No ratings yet
MYP Unit Planner: INQUIRY: Establishing The Purpose of The Inquiry
Document10 pages
MYP Unit Planner: INQUIRY: Establishing The Purpose of The Inquiry
Kelly Orosz
No ratings yet
Pamantasan NG Cabuyao College of Education, Arts and Sciences
Document11 pages
Pamantasan NG Cabuyao College of Education, Arts and Sciences
-------
No ratings yet
Artificial Intelegence Assignment
Document3 pages
Artificial Intelegence Assignment
Bhargav Kantipudi
No ratings yet
MIT6 006F11 ps4
Document5 pages
MIT6 006F11 ps4
Mo
No ratings yet
NLP Endsem 2016
Document2 pages
NLP Endsem 2016
Puneet Sangal
No ratings yet
Predicting and Evaluating The Popularity of Online News: He Ren Quan Yang
Document5 pages
Predicting and Evaluating The Popularity of Online News: He Ren Quan Yang
abhinav30
No ratings yet
AIML Syllabus
Document7 pages
AIML Syllabus
Omkar More
No ratings yet
Alarming! the Chasm Separating Education of Applications of Finite Math from It's Necessities
From Everand
Alarming! the Chasm Separating Education of Applications of Finite Math from It's Necessities
Ramune B. Adams
No ratings yet
Good Habits for Great Coding: Improving Programming Skills with Examples in Python
From Everand
Good Habits for Great Coding: Improving Programming Skills with Examples in Python
Michael Stueben
No ratings yet
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
21CSFS - Cornell Notes AI's - 25.03.23
Document1 page
21CSFS - Cornell Notes AI's - 25.03.23
Felipe
No ratings yet
Schwarke Et Al. - 2023 - Curiosity-Driven Learning of Joint Locomotion and
Document17 pages
Schwarke Et Al. - 2023 - Curiosity-Driven Learning of Joint Locomotion and
Anonymous o64C7SaZWs
No ratings yet
Data Science in 2024 - What Has Changed - by Nathan Rosidi - Jan, 2024 - Medium
Document18 pages
Data Science in 2024 - What Has Changed - by Nathan Rosidi - Jan, 2024 - Medium
chengshanwu1407
No ratings yet
Structuralism and Saussure
Document13 pages
Structuralism and Saussure
16gami
No ratings yet
Comparison of Various System Identification Methods For A MISO System
Document16 pages
Comparison of Various System Identification Methods For A MISO System
Sashank Varma Jampana
No ratings yet
Dav Public School, Vasant Kunj, New Delhi: Artificial Intelligence (Subject Code: 417)
Document8 pages
Dav Public School, Vasant Kunj, New Delhi: Artificial Intelligence (Subject Code: 417)
Smritee Ray
No ratings yet
Capsule Neural Network
Document42 pages
Capsule Neural Network
Mag Creation
100% (1)
Research Paper
Document7 pages
Research Paper
Shravan thouti
No ratings yet
Arduino Computer Vision Programming Graphics Bundle
Document45 pages
Arduino Computer Vision Programming Graphics Bundle
bobmarc
No ratings yet
Decision Tree
Document28 pages
Decision Tree
Rishabh Gupta
No ratings yet
Nonverbal Communication
Document5 pages
Nonverbal Communication
FahadKhan
No ratings yet
d2l en PDF
Document1,024 pages
d2l en PDF
Khanh Tran
100% (1)
KD Kakuro 8x8 s2 b015
Document9 pages
KD Kakuro 8x8 s2 b015
Dawn Gate
No ratings yet
Elements of Communication
Document2 pages
Elements of Communication
lekz re
100% (2)
Application of Artificial Intelligence Controller For Dynamic Simulation of Induction Motor Drives
Document7 pages
Application of Artificial Intelligence Controller For Dynamic Simulation of Induction Motor Drives
DrPrashant M. Menghal
No ratings yet
Final Version
Document25 pages
Final Version
Naveen Prakash
No ratings yet
HandleAzureSQLAuditingWithEase Passsummit
Document45 pages
HandleAzureSQLAuditingWithEase Passsummit
Tebele Sebuiwa
No ratings yet
License Plate Recognition
Document2 pages
License Plate Recognition
sujalavinnyp
No ratings yet
Lecture 1: Pendidikan Awal Kanak-Kanak
Document19 pages
Lecture 1: Pendidikan Awal Kanak-Kanak
Najwa Ahmad
No ratings yet
Machine Learning Based Intrusion Detection Systems Using HGWCSO and ETSVM Techniques
Document4 pages
Machine Learning Based Intrusion Detection Systems Using HGWCSO and ETSVM Techniques
aarthi dev
No ratings yet
Bagging and Boosting
Document33 pages
Bagging and Boosting
Natali Lourenço
No ratings yet
Computerized Image Processing and Machine Vision 7.5 Credits
Document2 pages
Computerized Image Processing and Machine Vision 7.5 Credits
Md. Tuhin Khan
No ratings yet
Chapter 2 - Inteligent Agent
Document31 pages
Chapter 2 - Inteligent Agent
Dawit Andargachew
No ratings yet
Unit 1 ETI Notes
Document15 pages
Unit 1 ETI Notes
02 - CM Ankita Adam
0% (1)
FALLSEM2022-23 SWE1011 EPJ VL2022230101204 Review I Material Review 1 SWE1011 Soft Computing Project1
Document22 pages
FALLSEM2022-23 SWE1011 EPJ VL2022230101204 Review I Material Review 1 SWE1011 Soft Computing Project1
abhishek asgola
No ratings yet
CC - Unit-5
Document26 pages
CC - Unit-5
kavists20
No ratings yet
SIGNLANGUAGE PPT
Document15 pages
SIGNLANGUAGE PPT
vishnuram1436
No ratings yet
Machine Learning - CheatSheet
Document2 pages
Machine Learning - CheatSheet
lucy01123
100% (1)
Proportional Control
Document47 pages
Proportional Control
Abd Aziz
No ratings yet
Qing Guo, Dan Jiang-Nonlinear Control Techniques For Electro-Hydraulic Actuators in Robotics Engineering-CRC Press (2017)
Document159 pages
Qing Guo, Dan Jiang-Nonlinear Control Techniques For Electro-Hydraulic Actuators in Robotics Engineering-CRC Press (2017)
FedericoBetti
No ratings yet