Professional Documents
Culture Documents
827 17
827 17
Vic hun luyn mu ny c thc hin bng cch tnh ba tham s , , t tp d liu
hun luyn tng ng. y ch l cc php ton thng k thng thng trong c tnh
bng trung bnh ca cc mu hun luyn, c tnh bng khong cch ln nht gia v cc
mu, v l s lng mu c tm trn tt c cc mu.
M hnh thng k HMM cng hay c dng lm phn t nhn dng. Mt m hnh
HMM thng c ba tham s =(A, B, ) c m t trong cc ti liu [3, 2, 4]. Ta c th tnh
lng equal(V, ) = p(V|) thng qua thut ton c lng. V ta c th lu thng tin thng k
p() nh trng hp trn. Vic hun luyn c thc hin thng qua thut ton Baum-Welch
3. Nhn dng cc chui k hiu ri rc
117
Tp ch Khoa hc & Cng ngh - S 1(45) Tp 2/Nm 2008
Ti i chi
Thi ti ch
Ta si ch
Tra cht
Thng tin ngn ng (language information) thng c lu hai dng ph bin, m
hnh ngn ng (language model) v vn phm (grammar) cng vi cc hnh thc tng ng
vn phm. M hnh ngn ng [2, 5, 6] l mt cng c thng k cho php tnh xc sut ca mt
cu ni trong ngn ng. Cc cu ni thng gp s c tn sut cao, cc cu ni sai ng php
hoc t gp s c xc sut xp x khng. M hnh ngn ng phn nh quy lut ng php, ng
ngha, ng dng di dng thng k. Vn phm [7, 8, 11] v cc dng tng ng ca n phn
nh ng php ca ngn ng. Vn phm l cc quy tc ghp k hiu chnh xc v khng th sinh
t ng nh cc quy lut thng k, do chng ta cn phi bin son cc b vn phm phn
nh thng tin ngn ng.
M hnh ngn ng thng c lu thnh m hnh bigram, trong mi t c xc sut
ng u p(W) v xc xut ng sau mt t no p(Wsau | Wtruoc) do cu ni trn c xc
nh nh sau, vi gi nh ta c ba k hiu Ttri, dsi, chci ng vi ba hnh nh cha bit. Ta s
tnh cc lng nh di y v chn ra cu c kh nng cao nht. V d ta s tnh cc lng sau
equal(Ti, Ttri) . equal(i, dsi) . equal(chi, chci) . p(Ti) . p(i | Ti) . p(chi | i)
equal (Ta, Ttri) . equal (ti, dsi) . equal(ch, chci) . p(Ta) . p(ti | Ta) . p(ch | ti)
118
Tp ch Khoa hc & Cng ngh - S 1(45) Tp 2/Nm 2008
119
Tp ch Khoa hc & Cng ngh - S 1(45) Tp 2/Nm 2008
Summary
Bayesian rule and its application to solve recognition problems
Tu Trung Hieu - { tutrunghieu@gmail.com }
Researches on recognition with stochastic approach usually use the Bayesian rule to evaluate the
probabilities of hypotheses and select the hypothesis with the maximum probability to be the recognition
result. In this paper, we would like to introduce the Bayesian rule and its application in different
recognition problems. In addition, we also introduce some recognition concepts, such as pattern space,
language model, grammar, hidden Markov model.
[1] E. T. Jaynes (2003), Probability Theory: The Logic of Science, Cambridge University Press.
[2] Steve Young, Dan Kershaw, Julian Odell, Dave Ollason, Valtcho Valtchev, Phil Woodland (2000),
The HTK Book.
[3] Lawrence R. Rabiner, A Tutorial on Hidden Markov Models and Selected Applications in Speech
Recognition. Proceedings of the IEEE, 77 (2), p. 257286, February 1989.
[4] Gernot A. Fink and Thomas Pltz (2007), Markov Models for Handwriting Recognition, ICDAR
2007 Tutorial, Curitiba, Brazil
[5] Fei Song, W. Bruce Croft (1999), A General Language Model for Information Retrieval.
[6] Jay M. Ponte, W. Bruce Croft (1998), A Language Modeling Approach to Information Retrieval,
[7] Jean-Michel Autebert, Jean Berstel, Luc Boasson ((1997), Context-Free Languages and Push-Down
Automata.
[8] J.E. Hopcroft and J.D. Ullman (1979). Introduction to Automata Theory, Languages, and
Computation, Addison-Wesley,
[9] Philippe Mclean. Nigel Horspool (1996), A Faster Earley Parser.
[10] Mark Hepple (1999), An Earley-style Predictive Chart Parsing Method for Lambek Grammars.
[11] Alon Lavie, Masaru Tomita (1993), GLR* An Efficient Noise-skipping Parsing Algorithm For
Context Free Grammars.
[12] J. C. Chappelier, M. Rajman (1998), A generalized CYK algorithm for parsing stochastic CFG.
120