Welcome to Scribd!

HTMM Notes

Uploaded by

0% found this document useful (0 votes)

18 views3 pages

The document summarizes the notation and model for a hierarchical topic model (HTMM). It defines the notation used, including the number of documents, sentences, words, topics, and other variables. It then describes the generative process for the HTMM, which draws topic distributions for documents and words for sentences from those topics. Finally, it outlines the inference procedure, including initialization, E-step to update parameters using forward-backward, and M-step to estimate HTMM parameters like topic distributions and word probabilities.

Original Description:

Original Title

htmm_notes

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

18 views3 pages

HTMM Notes

Uploaded by

Diane Hu

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 3

Search inside document

Diane Hu Notes on HTMM April 12, 2010

1 Notation

Let there be D documents in the corpus. In standard HMM notation, for each document d:

z(d) = (z1 , . . . , zT ), where zt ∈ {1, . . . , K} (1)

(d) 1×V
x = (x1 , . . . , xT ), where xt ∈ {0, 1} (2)

where z is sequence of hidden states and x is a sequence of observations. Notation is as follows:

Var Code Name Dim Description

D docs .size() 1×1 Number of documents in corpus
Td docs [d].size() 1×1 Number of sentences in document d
Lt sen.size() 1×1 Number of words in sentence t
K topics 1×1 Number of topics
V words 1×1 Number of words in vocabulary
α alpha T × 2K αi (t) = p(x1 , . . . , xt , zt = i | b, a, π)
β beta T × 2K βi (t) = p(xt+1 , . . . , xT |zn = i; b, a, π)
γ p dwzpsi D × T × 2K γi (t)(d) = p(zt = i|w; b, a, π)
b local T ×K bi (t) = p(xt |zt = i)
π init probs 1 × 2K πi = p(z1 = i)
θ theta D×K θdz = p(topic z | document d)
φ phi K ×V φzw = p(word w | topic z)
epsilon 1×1 Binomial paramater, prior over ψ
λ alpha 1×1 Dirichlet parameter, prior over θ
η beta 1×1 Dirichlet parameter, prior over φ
(d) (d)
E[Czw ] = D
PT
= i, xt = w|x(d) )
P
E[Czw ] Czw K ×V d=1 t=1 p(zt
PT (d) (d)
E[Cdz ] Cdz D×K E[Cdz ] = t=1 p(zt = i, ψt = 1|x(d) )

2 Model

The generative process is as follows:

1. For each topic z ∈ {1, . . . , K}: draw φz ∼ Dirichlet(η)

2. For each document d ∈ {1, . . . , D},

(a) Draw θ ∼ Dirichlet(λ)

(b) For each of sentence xt in d:
(
ψt = 1, if t = 1
(i)
ψt ∼ Binomial(), if t > 1
(c) For each of sentence xt in d:
(
zt = zt−1 , if ψt = 0
(i)
zt ∼ Multinomial(θ), if ψt = 1

(ii) For each word w` in sentence xt : draw w` ∼ Multinomial(φzt )

1
Diane Hu Notes on HTMM April 12, 2010

3 Inference

3.1 Initialization

HTMM parameters , θ, and φ are all initialized randomly.

3.2 E-step

Update HMM-related parameters (M-step within HMM):

V
Y (j)
bi (t) = φij xt (3)
j=1
(
θdi , if 1 ≤ i ≤ K
πi = (4)
0, if i > K

Run HTMM version of the Forward-Backward algorithm (E-step within HMM):

(
bi (1)πi , if 1 ≤ i ≤ K
αi (1) = (5)
bi (1)πi+K , if K + 1 ≤ i ≤ 2K
(
θdi bi (t), if 1 ≤ i ≤ K
αi (t) = (6)
(1 − ) [αi (t − 1) + αi+K (t − 1)] bi (t), if K + 1 ≤ i ≤ 2K
K
X
Normalize all αi (t) by αj (t) + αj+K (t). (7)
j=1
(8)
βi (T ) = 1 (9)
( PK
(1 − )bi (t + 1)βi (t + 1) + j=1 θdj bj (t + 1)βj (t + 1), if 1 ≤ i ≤ K
βi (t) = (10)
βi−K (t), if K + 1 ≤ i ≤ 2K
K
X
Normalize all βi (t) by αj (t) + αj+K (t) (11)
j=1
(12)
αi (t)βi (t)
γi (t) = PK (13)
j=1 αj (t)βj (t)

To compute the complete-data log-likelihood:

T
X K
X
log p(x, z) = log αi (t) + αi+K (t) (14)
t=1 i=1

2
Diane Hu Notes on HTMM April 12, 2010

3.3 M-step

Compute MAP estimates for HTMM parameters , φ, and θ:

PD PT PK
d=1 t=2 i=1 γi (t)(d)
= PD (15)
d=1 Td − 1

We note that
K
(d)
X
γi (t)(d) = p(ψt = 1|x(d) ) (16)
i=1
2K
(d)
X
γi (t)(d) = p(ψt = 0|x(d) ) (17)
i=K+1

Let E[Cij ] denote the expected number of times word j was drawn from topic i, according to φij :

X Td h
D X ixt(j)
(d) (d)
E[Cij ] = γi (t) + γi+K (t) (18)
d=1 t=1

Then,

φij = E[Cij ] + η − 1 (19)

V
X
Normalize each φij by φim (20)
m=1

Let E[Cdi ] denote the expected number of times topic i was drawn according to θd in document d:

T X
X K
E[Cdi ] = γi (t)(d) (21)
t=1 i=1

Then,

θdi = E[Cdi ] + λ − 1 (22)

X
Normalize each θdi by θdj (23)
j=1

References

[1] Gruber, A., Rozen-Zvi, M., Weiss, Y. “Hidden Topic Markov Models,” Artificial Intelligence and
Statistics (AISTATS), 2007.

Compiler Construction MCQ
Document16 pages
Compiler Construction MCQ
Saad
No ratings yet
Boosting: I I I I
Document5 pages
Boosting: I I I I
S
No ratings yet
Probabilistic Graphical Models CPSC 532c (Topics in AI) Stat 521a (Topics in Multivariate Analysis)
Document35 pages
Probabilistic Graphical Models CPSC 532c (Topics in AI) Stat 521a (Topics in Multivariate Analysis)
aofgubh
No ratings yet
Practice Session 1 With Answers
Document5 pages
Practice Session 1 With Answers
Mds Dms
No ratings yet
Affine Processes and Applications in Finance: (With D. Duffie and W. Schachermayer)
Document25 pages
Affine Processes and Applications in Finance: (With D. Duffie and W. Schachermayer)
karasa1
No ratings yet
Random Graphs Assignment - 9: (1,δ) t 1+δ t (2+δ) + (1+δ) d (t) +δ t (2+δ) + (1+δ)
Document2 pages
Random Graphs Assignment - 9: (1,δ) t 1+δ t (2+δ) + (1+δ) d (t) +δ t (2+δ) + (1+δ)
IiserMohali
No ratings yet
Problem Set 8
Document5 pages
Problem Set 8
clouds lau
No ratings yet
Assignment 1
Document4 pages
Assignment 1
PAPA
No ratings yet
Stochastic Portfolio Theory - A Survey - Slides
Document27 pages
Stochastic Portfolio Theory - A Survey - Slides
TraderCat Solaris
No ratings yet
Pegasos
Document4 pages
Pegasos
Melanie2023
No ratings yet
A Simple Proof of AdaBoost Algorithm
Document4 pages
A Simple Proof of AdaBoost Algorithm
Xuqing Wu
No ratings yet
Lecture 13
Document23 pages
Lecture 13
radhelinuxpc
No ratings yet
Lecture 6: Introduction To Linear Dynamical Systems and ODE Review
Document13 pages
Lecture 6: Introduction To Linear Dynamical Systems and ODE Review
BabiiMuffink
No ratings yet
Lecture 6: Introduction To Linear Dynamical Systems and ODE Review
Document12 pages
Lecture 6: Introduction To Linear Dynamical Systems and ODE Review
BabiiMuffink
No ratings yet
Solution of Ordinary Differential Equations: 1 General Theory
Document3 pages
Solution of Ordinary Differential Equations: 1 General Theory
vlukovych
No ratings yet
Keynesian Theory and Harrod
Document4 pages
Keynesian Theory and Harrod
Angel Nicole Soriano
No ratings yet
Pres 1
Document8 pages
Pres 1
nsrathnayaka4564
No ratings yet
1 Characteristics of Time Series 1.3 Measures of Dependence
Document10 pages
1 Characteristics of Time Series 1.3 Measures of Dependence
Trịnh Tâm
No ratings yet
Chapter 8 SDE Handwriting
Document16 pages
Chapter 8 SDE Handwriting
Vito Liu
No ratings yet
HW1 Solution
Document3 pages
HW1 Solution
박천우
No ratings yet
CF Notes
Document7 pages
CF Notes
Hồ Nghĩa Phương
No ratings yet
Yates' Chapter 6, 10: Stochastic Processes & Stochastic Filtering
Document16 pages
Yates' Chapter 6, 10: Stochastic Processes & Stochastic Filtering
Mohamed Ziad Alezzo
No ratings yet
MA 212 Lecture Week3 Part1
Document11 pages
MA 212 Lecture Week3 Part1
Andy Shaw
No ratings yet
Midterm 2010 Solutions
Document8 pages
Midterm 2010 Solutions
Erico Archeti
No ratings yet
16.323 Principles of Optimal Control: Mit Opencourseware
Document24 pages
16.323 Principles of Optimal Control: Mit Opencourseware
Tomas Salmoiraghi
No ratings yet
Adaboost
Document13 pages
Adaboost
srobertjames
No ratings yet
Ex4 22
Document3 pages
Ex4 22
Harsh Raj
No ratings yet
Electronic Journal of Differential Equations, Vol. 2010 (2010), No. 23, Pp. 1-10. ISSN: 1072-6691. URL: Http://ejde - Math.txstate - Edu or Http://ejde - Math.unt - Edu FTP Ejde - Math.txstate - Edu
Document10 pages
Electronic Journal of Differential Equations, Vol. 2010 (2010), No. 23, Pp. 1-10. ISSN: 1072-6691. URL: Http://ejde - Math.txstate - Edu or Http://ejde - Math.unt - Edu FTP Ejde - Math.txstate - Edu
Luis Alberto Fuentes
No ratings yet
Review 8 - Fundamental Theorem
Document13 pages
Review 8 - Fundamental Theorem
karabolethe
No ratings yet
Signals & Systems B38SA 2018: Chapter 2 Assignment Question 1 - Theory - 10 Marks
Document6 pages
Signals & Systems B38SA 2018: Chapter 2 Assignment Question 1 - Theory - 10 Marks
Bokai Zhou
No ratings yet
1 The Hamilton-Jacobi-Bellman Equation
Document7 pages
1 The Hamilton-Jacobi-Bellman Equation
Makinita Cervera
No ratings yet
sns 2022 중간
Document2 pages
sns 2022 중간
juyeons0204
No ratings yet
Anthony Vaccaro MATH 264 Winter 2023 Assignment Assignment 6 Due 03/12/2023 at 11:59pm EDT
Document1 page
Anthony Vaccaro MATH 264 Winter 2023 Assignment Assignment 6 Due 03/12/2023 at 11:59pm EDT
Anthony Vaccaro
No ratings yet
Mid Term Solutions PDF
Document2 pages
Mid Term Solutions PDF
Md Nur-A-Adam Dony
No ratings yet
Методичка Stat. Inference
Document45 pages
Методичка Stat. Inference
Yehor
No ratings yet
Random Processes, Correlation, and Power Spectral Density
Document32 pages
Random Processes, Correlation, and Power Spectral Density
sha3107
No ratings yet
AMATH 350 Assignment 1 (Review) Winter 2017: − π) dt −1 dx u − 5u + 6 du,
Document1 page
AMATH 350 Assignment 1 (Review) Winter 2017: − π) dt −1 dx u − 5u + 6 du,
thomas94joseph
No ratings yet
Assignment 2b Solutions
Document12 pages
Assignment 2b Solutions
vbweuhvbw
No ratings yet
4th Slides
Document116 pages
4th Slides
Jonas Xoxae
No ratings yet
Convex Module B
Document29 pages
Convex Module B
chinaski06
No ratings yet
Analysis of A Complex Kind
Document12 pages
Analysis of A Complex Kind
Hamza Shabbir
No ratings yet
!!en3 The Fourier Transform v02
Document8 pages
!!en3 The Fourier Transform v02
Marcela Dobre
No ratings yet
Point Set Notes
Document24 pages
Point Set Notes
Sopheak Neak
No ratings yet
Stochastic Calculus II Exercise Sheet 8: Prof. D. Filipovi C, E. Hapnes
Document3 pages
Stochastic Calculus II Exercise Sheet 8: Prof. D. Filipovi C, E. Hapnes
DIA
No ratings yet
Unit 3 Fourier Transforms Properties Questions and Answers - Sanfoundry PDF
Document4 pages
Unit 3 Fourier Transforms Properties Questions and Answers - Sanfoundry PDF
zohaib
100% (1)
Worksheet I
Document6 pages
Worksheet I
Alemayew Azezew
No ratings yet
MIT8 324F10 Lecture11
Document5 pages
MIT8 324F10 Lecture11
Ayham ziad
No ratings yet
Time Series Exam, 2010: Solutions
Document4 pages
Time Series Exam, 2010: Solutions
강주성
No ratings yet
Time Series Analysis Lecture 8-3
Document12 pages
Time Series Analysis Lecture 8-3
wcxqqkmpwh
No ratings yet
University
Document2 pages
University
Sudo27
No ratings yet
Fourier series (FS) : ∞ k jkω t k T −jkω t
Document4 pages
Fourier series (FS) : ∞ k jkω t k T −jkω t
1MV20EE016 BHUVAN PM
No ratings yet
Fourier series (FS) : ∞ k jkω t k T −jkω t
Document4 pages
Fourier series (FS) : ∞ k jkω t k T −jkω t
arashix
No ratings yet
Complex Integration PDF
Document12 pages
Complex Integration PDF
jayroldparcede
100% (2)
Homework Assignment 7
Document10 pages
Homework Assignment 7
Fabio Sousa
No ratings yet
Quasi Maximum Likelihood Theory - Lecture Notes
Document119 pages
Quasi Maximum Likelihood Theory - Lecture Notes
Anonymous tsTtieMHD
No ratings yet
Solution 11
Document9 pages
Solution 11
Girl
No ratings yet
Ece45 HW2
Document5 pages
Ece45 HW2
Ped
No ratings yet
On - Hybrid - Caputo - Fractional - Integro Inclusion
Document15 pages
On - Hybrid - Caputo - Fractional - Integro Inclusion
Shorouk Al- Issa
No ratings yet
Partial Derivative of Composite Function
Document5 pages
Partial Derivative of Composite Function
Dibyananda Sahoo
No ratings yet
Fishing
Document10 pages
Fishing
vicky
No ratings yet
Long-Memory Time Series: Theory and Methods
From Everand
Long-Memory Time Series: Theory and Methods
Wilfredo Palma
No ratings yet
Syllabus Nimcet
Document4 pages
Syllabus Nimcet
PRIYESH CHAVAN
No ratings yet
Fault Report Sayausi
Document15 pages
Fault Report Sayausi
Daniel Alejandro Zúñiga
No ratings yet
Diagrama de Lineas Radio
Document1 page
Diagrama de Lineas Radio
Pedro Rodríguez Fernández
No ratings yet
Teleprotection Equipment NSD570: Utility Communications
Document8 pages
Teleprotection Equipment NSD570: Utility Communications
StTang
No ratings yet
Application of The PERT Method in Planning of Area Evacuation of
Document8 pages
Application of The PERT Method in Planning of Area Evacuation of
Belay Ayalew
No ratings yet
Background Screening Report: Triciclo Staff Solutions
Document6 pages
Background Screening Report: Triciclo Staff Solutions
Laura De Rosmini
No ratings yet
Service Thermostat MCH2
Document4 pages
Service Thermostat MCH2
ИлияИванов
No ratings yet
Hs 450 Unit 7 Assignment - Edited
Document16 pages
Hs 450 Unit 7 Assignment - Edited
Kennedy Washika
No ratings yet
HALL TICKET FOR SUMMER 2024 of 2200800178
Document1 page
HALL TICKET FOR SUMMER 2024 of 2200800178
Aditya Wale
No ratings yet
Networking Wireless Sensors: More Information
Document9 pages
Networking Wireless Sensors: More Information
Surajbhan Singh
No ratings yet
Class 1
Document53 pages
Class 1
Kalyan Kotha
100% (1)
Stylus DX5000 5050 6000 6050 Service Manual
Document212 pages
Stylus DX5000 5050 6000 6050 Service Manual
traminer
No ratings yet
G2-32 Circuit Breakers and Trip Units Molded Case Circuit Breakers - Trip Units
Document2 pages
G2-32 Circuit Breakers and Trip Units Molded Case Circuit Breakers - Trip Units
baskaranjay5502
No ratings yet
Sicar 6: The Shoe Print & Tyre Mark Identification System
Document5 pages
Sicar 6: The Shoe Print & Tyre Mark Identification System
juk expert
100% (1)
College Accounting A Career Approach 13th Edition Scott Test Bank
Document18 pages
College Accounting A Career Approach 13th Edition Scott Test Bank
synomocyeducable6pyb8k
100% (37)
Slides Organizing The Salesforce
Document46 pages
Slides Organizing The Salesforce
salman24
No ratings yet
AKULAKU Seller Centre - Mass Update Template
Document2 pages
AKULAKU Seller Centre - Mass Update Template
Lutfi Katresna
No ratings yet
Wabco e Basic Ecu Diagrama
Document1 page
Wabco e Basic Ecu Diagrama
Luis Hernan Cordova Masias
No ratings yet
Aliant Ommunications: Vcl-Eth-E1 F Ethernet Over E1 Converter
Document17 pages
Aliant Ommunications: Vcl-Eth-E1 F Ethernet Over E1 Converter
Rock Roll
No ratings yet
Configuringme: Configuring Wired 8021x Authentication On Windows Server 2012.pdf Wired 8021x Authentication On Windows Server 2012
Document96 pages
Configuringme: Configuring Wired 8021x Authentication On Windows Server 2012.pdf Wired 8021x Authentication On Windows Server 2012
simoo2010
No ratings yet
Comparison of Sic Mosfet and Si Igbt
Document10 pages
Comparison of Sic Mosfet and Si Igbt
Yassir Butt
No ratings yet
Wiring Diagram 4: Service Information
Document1 page
Wiring Diagram 4: Service Information
Rosu Larisa-Romina
0% (1)
Pulsatile Flow Pump Based On An Iterative Controlled Piston Pump
Document7 pages
Pulsatile Flow Pump Based On An Iterative Controlled Piston Pump
yue jiang
No ratings yet
New Iot Fire Alarm System
Document39 pages
New Iot Fire Alarm System
NarayananNanu
No ratings yet
How To Install A DMR Codeplug Step by Step (04-12-2017)
Document2 pages
How To Install A DMR Codeplug Step by Step (04-12-2017)
Thomas Yeadon
No ratings yet
MT6329A
Document98 pages
MT6329A
David Zuccarini
No ratings yet
ATPG - Stuck-At and At-Speed - Semicon Shorts
Document4 pages
ATPG - Stuck-At and At-Speed - Semicon Shorts
Sai Uttej Mandala
No ratings yet
3BK-17429-9087-RJZZA (9359-NPO Server and Client Installation) 01.24 Standard June 2009
Document110 pages
3BK-17429-9087-RJZZA (9359-NPO Server and Client Installation) 01.24 Standard June 2009
Francisco Salvador Mondlane
No ratings yet
Oracle PL - SQL Programming in Simple Steps
Document240 pages
Oracle PL - SQL Programming in Simple Steps
yjr_yogesh
100% (2)