Professional Documents
Culture Documents
HTMM Notes
HTMM Notes
1 Notation
Let there be D documents in the corpus. In standard HMM notation, for each document d:
2 Model
1
Diane Hu Notes on HTMM April 12, 2010
3 Inference
3.1 Initialization
3.2 E-step
V
Y (j)
bi (t) = φij xt (3)
j=1
(
θdi , if 1 ≤ i ≤ K
πi = (4)
0, if i > K
2
Diane Hu Notes on HTMM April 12, 2010
3.3 M-step
We note that
K
(d)
X
γi (t)(d) = p(ψt = 1|x(d) ) (16)
i=1
2K
(d)
X
γi (t)(d) = p(ψt = 0|x(d) ) (17)
i=K+1
Let E[Cij ] denote the expected number of times word j was drawn from topic i, according to φij :
X Td h
D X ixt(j)
(d) (d)
E[Cij ] = γi (t) + γi+K (t) (18)
d=1 t=1
Then,
Let E[Cdi ] denote the expected number of times topic i was drawn according to θd in document d:
T X
X K
E[Cdi ] = γi (t)(d) (21)
t=1 i=1
Then,
References
[1] Gruber, A., Rozen-Zvi, M., Weiss, Y. “Hidden Topic Markov Models,” Artificial Intelligence and
Statistics (AISTATS), 2007.