Download as pdf or txt
Download as pdf or txt
You are on page 1of 19

Chng I.

Gii thiu
1.1. t vn
Ngy nay, cng vi s pht trin vt bc ca lnh vc cng ngh thng tin v
mng my tnh ton cu Internet, dn n s ra i v bng n ca ngun d
liu. Vi s lng d liu ngy mt tng, mt yu cu ln c t ra l lm
sao c th t chc v tm kim d liu mt cch hiu qu nht. V bi ton
phn loi (Classification) [1] mt tin trnh x l nhm gn nhn, xp cc mu
d liu hay cc i tng vo mt trong cc loi c nh ngha trc
v sau d liu s c trng bi tp cc thuc tnh ca cc i tng cha
trong loi l mt trong nhng gii php hp l cho yu cu trn. Nhim v
chnh ca bi ton phn loi d liu l cn xy dng m hnh phn loi khi
c mt d liu mi vo th m hnh phn loi s cho bit d liu thuc loi
no. Nhng mt thc t l khi lng d liu qu ln, vic phn loi d liu
th cng l iu khng th. Hng gii quyt l xy dng mt chng trnh t
ng cho php phn loi cc thng tin d liu [1].
C nhiu hng tip cn, k thut hc c xut cho vic xy dng
cc m hnh p dng trong cc bi ton phn loi ni trn, tiu biu bao gm
[1]: GMM (Gaussian mixture models), HMM (Hidden Markov models), mng
Bayesian, mng Neural (Neural Network), KNN (K-Nearest Neighbour - da
trn nhng im lng ging gn nht), Cc h thng m (Fuzzy Systems),
Support Vector Machine (SVM) [1, 7], etc. Hu ht cc hng tip cn cho cc
bi ton phn loi d liu u c phn vo hai hng tip cn chnh. Hng
tip cn th nht, hng tip cn tham s ha trong nhng hiu bit trc
v phn b ca tng im d liu c gi nh, i vi hng tip cn ny
mt h thng phn loi s c c trng bi cc tham s v nhim v ca qu
trnh hc l xc nh, c lng cc tham s ph hp cho m hnh da trn tp
d liu ban u nhm tng qut ha, m hnh ha nhng c im chung nht
1
ca tp d liu hc, cc k thut hc tiu biu cho hng tip cn ny bao gm:
GMM (Gaussian mixture models), HMM (Hidden Markov models), Hypothesis
testing, mng Bayesian. Hng tip cn th hai, hng tip cn khng tham s,
vi hng tip cn ny khng cn c s bit trc v phn b ca d liu. Mt
h thng hc s c xy dng da trn bn cht ca d liu, qu trnh hc l
vic xy dng v c lng cc tham s ca m hnh t tp d liu sao cho gi
tr ca hm li (t l li trn tp d liu ban u) trong m hnh t gi tr
nh nht. Vi hng tip cn ny, cc k thut hc tiu biu c cp bao
gm: Mng Neural (Neural Network) [2, 3, 4, 5], KNN (K-Nearest Neighbour)
[5], Cc h thng m (Fuzzy Systems) [5, 6] v Support Vector Machine (SVM)
[8, 9, 10]. Trong hng tip cn khng tham s, k thut hc Support Vector
Machine (SVM), mt thut ton tiu biu ca phng php kernel, l mt k
thut hc mnh m c xy dng trn nguyn tc cc tiu ha ri ro cu trc
(Structual Risk Minization) trong l thuyt my hc (Machine Learning). Vi
t tng tip cn trn, SVM c bit thch hp cho vic phn loi trn nhng
tp d liu ln v nhiu chiu, bn cnh SVM cn pht huy tc dng cho c
nhng tp d liu vi s lng b hn ch v c nhng tp d liu c cha mt
phn d liu ban u b phn loi sai [7, 8, 9]. Trong khi cc m hnh phn
loi cng hng tip cn nh nh Mng Neural (Neural Network) hay Cc h
thng m (Fuzzy Systems) hay cc m hnh thuc hng tip cn tham s ha
cho kt qu phn loi tt trn nhng tp d liu ln v c s tch bit r rng
gia cc loi d liu khc nhau nhng i vi nhng tp d liu ban u b hn
ch hoc d liu c tn ti nhiu (noisy - nhng im d liu b phn loi sai)
th chnh xc v kh nng tng qut ha ca m hnh cha t c kt qu
ti u.
Trong cc m hnh phn loi, vic phn loi mang li kt qu tt khi c
xy dng da trn mt tp d liu hc ln c gn nhn (labeled Data).
2
Tuy nhin, khi p dng vo thc t th iu kin ny s gp kh khn v cc
tp hc ban u c s lng hn ch v thng c gn nhn bi con ngi
nn i hi tn rt nhiu thi gian v cng sc. Trong khi , cc d liu cha
gn nhn (unlabeled Data) th li rt phong ph. Do vy, vic xem xt n cc
k thut hc khng cn nhiu d liu c gn nhn, c kh nng tn dng
c s lng phong ph ca cc d liu cha c gn nhn nhn c s quan
tm ca nhiu nh khoa hc trn th gii. Mt hng tip cn gii quyt bi
ton vi tn gi hc bn gim st (Semi-Supervised Learning) [13, 14] c
xut cho php s dng c ngun d liu gn nhn v ngun cha gn nhn, v
trong nhiu nghin cu ca ngnh my hc (Machine Learning) chng minh
c th tm ra c d liu cha gn nhn khi s dng vi mt s lng nh d
liu gn nhn. Cng vic thu c kt qu ca d liu gn nhn thng i hi
trnh t duy v kh nng ca con ngi, cng vic ny tn nhiu thi gian
v chi ph, do vy d liu gn nhn thng rt him v t, trong khi d liu
cha gn nhn th li rt phong ph. Trong trng hp , chng ta c th s
dng c ch hc bn gim st st (Semi-Supervised Learning) thi hnh cc
cng vic gn nhn d liu quy m ln. Hc bn gim st s dng c ngun
d liu gn nhn v ngun d liu cha c gn nhn vi mc tiu l lm
tng chnh xc cho vic hc dn n ci thin chnh xc cho qu trnh
phn loi v gn nhn d liu. Nh vy, bn cht ca phng php hc bn
gim st l phng php hc c gim st kt hp vi vic tn dng cc d liu
cha gn nhn. Trong phn b sung thm vo cho d liu gn nhn, thut ton
cung cp mt s thng tin gim st, vic ny khng cn thit cho tt c cc
mu hun luyn. Thng thng thng tin ny s c kt hp vi mt vi mu
d liu cho trc.
gii quyt vn ca vic hc bn gim st (Semi-Supervised Learning),
mt s k thut, m hnh tiu biu c pht trin v s dng [13, 14]: Cc m
3
hnh hn hp (Mixture Models), k thut Self-Training, Co-training, cc phng
php d trn th (Graph-Based Methods) v cc phng php p dng trn
k thut hc Support Vector Machine (Semi-supervised SVMs - TSVMs) nh:
SVMLight, Deterministic Annealing for Semi-supervised Kernel Machines, The
Concave-Convex Procedure for TSVMs, etc. C th ni rng, tng u tin
v s dng d liu cha gn nhn trong phn lp l thit lp Self-Training.
Self-Training l k thut hc bn gim st c s dng kh ph bin, vi mt
b phn lp (Classifier) ban u c hun luyn bng mt s lng nh cc
d liu gn nhn. Sau , s dng b phn lp ny gn nhn cc d liu
cha gn nhn. Cc d liu c gn nhn c tin cy cao (Vt trn mt
ngng no ) v nhn tng ng ca chng c a vo tp hun luyn
(Train Set). Tip , b phn lp c hc li trn tp hun luyn mi v th
tc lp tip tc. mi vng lp, b hc s chuyn mt vi cc mu c tin
cy cao nht sang tp d liu hun luyn cng vi cc d on phn lp ca
chng. Vi nhng bi ton nu cc b phn lp gim st c xy dng t
trc l phc tp v kh sa i th Self-Training s l mt la chn c u
tin. Tuy nhin k thut Seft-Training c mt khuyt im l nu d liu
trong ln hun luyn trc m b sai, th s dn n vic phn loi sai ngy
mt m rng trong nhng ln hun luyn tip theo sau v kt qu cui cng
khng nh mong i. Trong khi , nu cc c trng (Features) c s phn
chia t nhin thnh hai tp th Co-Training c la chn. Co-Training da
trn gi thit rng cc c trng (features) c th c phn chia thnh 2 tp
con. Mi tp con ph hp hun luyn mt b phn loi tt. Hai tp con
phi tho mn tnh cht c lp iu kin (Conditional Independent). Thut
ton Co-Training tri qua mt quy trnh bao gm cc giai on: Ban u, hc
hai b phn loi ring r bng d liu c gn nhn trn hai tp thuc
tnh con tng ng. Tip theo, mi b phn loi sau li phn loi cc d
4
liu cha c gn nhn (Unlabel Dataset), chng la chn ra cc d liu cha
c gn nhn (Unlabeled Dataset) cng vi nhn d on ca chng (Cc d
liu mu c tin cy cao) dy cho b phn loi khc. V cui cng, mi
b phn loi c hc li (Re-train) vi cc mu hun luyn c cho bi b
phn loi khc v tin trnh lp bt u. i vi k thut Co-Training th tp
d liu c gn nhn ban u phi ln v c tnh cht c phn thnh
hai loi ring r th kt qu mi em li hiu qu. Bn cnh cc m hnh
hn hp (Mixture Models) s cung cp mt khung mu cho vic hc bn gim
st v cc khung mu ny c th mang li hiu qu cao nu m hnh c sinh
ra l chnh xc, mang tnh tng qut cho d liu nhng iu ny rt kh bi v
d liu c gn nhn khng nhiu v nu m hnh khi to ban u sai th
dn n m hnh sau cng khng mang li hiu qu v m hnh khng chnh
xc cho phn loi d liu. Mt hng tip cn cng ang c s dng trong
c ch hc bn gim st l hng tip cn da trn th, trong hng tip
cn ny mt s lin kt gia cc im nt d liu cng loi c hnh thnh,
tuy nhin u im ca hng tip cn ny ch mang li hiu qu khi cc d liu
c kt ni vi nhau vi mt trng s ln ng ngha vi vic chng c khuynh
hng cng loi d liu, v m hnh phn loi sau cng c hnh thnh c th
mang li kt qu phn loi khng tt khi mt d liu no b phn loi sai
dn n nh hng cho cc d liu c phn loi sau . K n l cc phng
php c xy dng da trn k thut hc Support Vector Machine (SVM) vi
vic k tha nhng u im vt tri trong vic hc ca k thut hc Support
Vector Machine (SVM). Cc phng php ny cho kt qu rt tt trn nhng
tp d liu c s phn tch r rng gia cc loi d liu v c bit pht huy
u im trn c nhng tp d liu c t s lng d liu c nhn ban u, trn
c nhng tp d liu c s chiu ln (large-scale Dataset). Tuy nhin nu tp
d liu ban u c s lng nhiu cao v khng c s phn tch r rng gia
5
cc loi d liu, th cc phng php theo hng tip cn ny cng c chung
mt hn ch so vi nhng phng php theo nhng hng tip cn c nu
trn cho kt qu khng c ti u.
Hin nay, bn cnh nhng bi ton vi b d liu cn bng (Balanced Data),
trong thc t d liu ca nhiu bi ton phn loi khng cn bng m b lch
(Unbalanced Data) t l chnh lch gia cc lp d liu l rt r rng, c th
ln n hng chc ngn hoc hng triu, s ph bin ca nhng tp d liu
ny ngy mt ln v thu ht c nhiu s quan tm ca cc nh nghin cu.
V d nh trong cc bi ton d s thm nhp vo mng (Network Intrusion
Detection), d trng thi hng ca my (Machine Fault Detection)..., d liu
bnh thng (normal data packet ca ca ngi dng bnh thng cho cc dch
v nh HTTP, FTP, trng thi bnh thng ca my) chim u th so vi d
liu bt bnh thng abnormal data (Packet ca hacker, trng thi hng ca
my). V xy dng cc m hnh phn loi cho dng bi ton d liu khng
cn bng, bn cnh nhng hng tip cn, k thut hc c xut cho bi
ton phn loi d liu c cp trn nh (GMM (Gaussian mixture
models), HMM (Hidden Markov models), Hypothesis testing, mng Bayesian,
mng Neural (Neural Network), KNN (K-Nearest Neighbour - da trn nhng
im lng ging gn nht), Cc h thng m (Fuzzy Systems), Support Vector
Machine (SVM)), th hin nay cc hng tip cn c xut da trn cc
k thut Clustering (Gom cm), Rule based (da trn tp lut) v c bit
l cc bin th ca k thut hc Support Vector Machine (SVM) nhm tn
dng nhng u im ca k thut hc ny v ang c quan tm, pht
trin trong c cc phng php in hnh c xut v p dng bao
gm: One Class Support Vector Machine (OCSVM) [15], Support Vector Data
Description (SVDD) [16, 17, 18, 19, 21] c xut v thu c kt qu
phn loi rt tt khi c xy dng da trn cc dng bi ton phn loi. Trong
6
nhng dng bi ton vi nhng tp d liu khng cn bng th vic hc bn
gim st cng c thc hin da trn cc hng tip cn c cp nh:
Cc m hnh hn hp (Mixture Models), k thut Self-Training, Co-training,
cc phng php d trn th (Graph-Based Methods) v cc phng php
p dng trn k thut hc Support Vector Machine (Semi-supervised SVMs -
TSVMs) ban u c xut v cho kt qu tt trn nhng tp d liu
khng cn bng bao gm: Semisupervised One-Class Support Vector Machines
[20], Fuzzy Entropy Semi-supervised Support Vector Data Description [21].
1.2. Mc tiu ca lun vn
Da vo nhng hin trng phn tch trn cho thy rng, khi x l cc bi
ton phn loi t ng (cho c nhng bi ton d liu cn bng hay nhng
bi ton d liu khng cn bng) trong cc k thut hc, hng tip cn tiu
biu c xut bao gm (GMM (Gaussian mixture models), HMM (Hidden
Markov models), Hypothesis testing, mng Bayesian, mng Neural (Neural Net-
work), KNN (K-Nearest Neighbour - da trn nhng im lng ging gn nht),
Cc h thng m (Fuzzy Systems), Support Vector Machine (SVM) [1, 7], etc)
th hng tip cn da trn k thut hc Support Vector Machine (SVM)
mang n nhng u im v kt qu vt tri v thi gian xy dng, phn
loi cng nh chnh xc ca m hnh thch ng vi nhiu loi d liu khc
nhau. Trong , i vi nhng bi ton d liu cn bng, c rt nhiu cc bin
th ca k thut hc Support Vector Machine (SVM) c xut, tiu biu
Least Square SVM, Proximal SVM, Twin SVM, etc. i vi cc bi ton d
liu khng cn bng, nhng bin th tiu biu ca k thut hc Support Vector
Machine (SVM) gm SVDD (Support Vector Data Description), OCSVM (One
Class Support Vector Machine), etc. Trong m hnh OCSVM (One Class Sup-
port Vector Machine), mc d OCSVM c hiu sut tt cho tp d liu khng
7
cn bng, nhng hn ch ca m hnh ny l cha tn dng c cc mu d
liu thuc lp tiu s lm tng chnh xc cho m hnh hc. Do , mc
tiu ca lun vn l xut mt hng tip cn ci tin ca m hnh OCSVM,
cho php tn dng c nhng im d liu thuc lp tiu s cho vic xy dng
m hnh phn loi.
Bn cnh , nh c cp, trong cc m hnh phn loi, xy dng
c b phn loi c tin cy cao, i hi phi c mt lng ln cc mu d
liu hun luyn tc l cc d liu c gn nhn loi tng ng. Cc d liu
hun luyn ny thng rt him v t v i hi thi gian v cng sc ca con
ngi. Do vy phng php hc bm gim st ra i, mt phng php hc
khng cn nhiu d liu gn nhn v c kh nng tn dng c cc ngun d
liu cha gn nhn rt phong ph nh hin nay vi vic s dng thng tin cha
trong c d liu cha gn nhn v tp hun luyn, phng php hc ny hin
nay c s dng rt ph bin v tnh tin li ca n. Phn ln cc hng tip
cn cho bi ton phn loi vi phng php hc bn gim st c cp bn
trn (Mixture Models, Self-Training, Co-training, Graph-Based Methods) u
gp phi nhng vn khi tp d liu ban u c s lng t hoc d liu c
cha nhiu (Noisy) hoc cc im bt thng (Outliner) dn n m hnh phn
loi c xy dng cho kt qu phn loi khng cao. Tuy nhin, hng tip cn
k tha t k thut hc Support Vector Machine (SVM) nh Semi-supervised
SVMs (Transductive SVMs) khc phc c nhng vn trn, bn cnh
cc hng tip cn ny pht huy hiu qu rt tt vi nhng tp d liu ln
v bn thn d liu c nhiu chiu (c im thng nhn thy hin nay ca
d liu).
i vi bi ton phn loi d liu cn bng, cc k thut hc bn gim
st da trn k thut hc Support Vector Machine (SVM) c xut v
mang li nhng kt qu rt tt cho vic xy dng m hnh phn loi, cc k
8
thut SVM-Light Support Vector Machine, Deterministic Annealing for Semi-
supervised Kernel Machines, The Concave-Convex Procedure for TSVMs, etc.
i vi nhng bi ton phn loi d liu khng cn bng th cc k thut hc
bn gim st da trn k thut hc Support Vector Machine (SVM) cn rt
hn ch, trong mt vi nhng k thut ban u c xut cho hng
tip cn ny bao gm gm Semisupervised One-Class Support Vector Machines
v Fuzzy Entropy Semi-Supervised Support Vector Data Description. Mc tiu
ca lun vn l tip tc pht trin nhng k thut k tha nhng thnh cng
ban u t tm hiu, nghin cu v xut mt hng tip cn mi
trong m hnh hc bn gim st vi cc bi ton phn loi d liu khng cn
bng da trn nhng hng tip cn c xut v mang li thnh
cng trong m hnh hc bn gim st.
1.3. Ni dung thc hin
Nhm t c nhng mc tiu nu, v mt l thuyt, lun vn s tin hnh
nghin cu nhng hng tip cn, k thut cho vic xy dng cc m hnh phn
loi. C th hn, lun vn s tp trung i vo phn tch bn cht ca k thut
hc Support Vector Machine (SVM) cho bi ton phn loi vi nhng tp d
liu cn bng v khng cn bng. Bn cnh , lun vn cng s tp trung vo
vic tm hiu cc phng php c p dng cho hng tip cn hc bn gim
st trong bi ton phn loi. Cc cng vic c th bao gm:
Tm hiu nhng vn c bn v l thuyt hc, bi ton phn loi, bn
cnh i vo phn tch bn cht ca cc phng php Kernel (Kernel
Methods) trong lnh vc my hc (Machine Learning) cng nh nhng k
thut, hng tip cn tiu biu da trn cc phng php Kernel.
9
Tm hiu v phn tch bn cht ca k thut hc Support Vector Machine
(SVM) v cc bin th ca k thut ny cho bi ton phn loi vi tp
d liu cn bng v khng cn bng (bi ton phn loi mt lp). i vo
phn tch nhng im mnh v hn ch ca nhng phng php c xy
dng k tha t k thut hc Support Vector Machine (SVM). Nghin
cu kh nng ci tin v m rng cho cc bin th ca k thut hc hc
Support Vector Machine (SVM) trong bi ton phn loi vi tp d liu
khng cn bng.
Tm hiu v bn cht ca cc hng tip cn hc cho bi ton phn loi:
hc c gim st, hc bn gim st v hc khng gim st. K n i vo
tm hiu cc phng php c p dng cho k thut hc bn gim st.
Nghin cu kh nng s dng cc phng php p dng cho hng
tip cn hc bn gim st trn m hnh ci tin c xut.
V mt thc nghim, m hnh ci tin do lun vn xut v cc phng php
p dng trn m hnh ny cho hng tip cn hc bn gim st s c th
nghim v nh gi theo phng php offline, ngha l c th nghim trn
mt b d liu mu trong lnh vc phn loi, v d nh c xem l mt b d
liu chun ca UCI tin hnh nhng thc nghim trong lnh vc ca lun
vn. Qu trnh th nghim ca lun vn s c tin hnh theo nhng bc
chnh nh sau:
Chun b d liu.
Xy dng quy trnh th nghim.
Tin hnh th nghim theo phng php offline phn tch v nh gi.
10
1.4. Tm tt nhng ng gp ca lun vn
Lun vn thnh cng trong vic ci tin m hnh One Class Support Vector
Machine (OCSVM) mt bin th ca k thut hc Support Vector Machine
(SVM) cho bi ton phn loi vi nhng tp d liu khng cn bng. Trong
m hnh ci tin vi tn gi Large Margin One Class Support Vector Machine
(LM-OCSVM), vic xy dng m hnh phn loi tn dng c nhng im
d liu thuc lp tiu s cho vic tng chnh xc ca m hnh hc. Bn
cnh , lun vn cng tch hp c cc phng php hc bn gim st
tiu biu c xut cho k thut hc Support Vector Machine vo m hnh
Large Margin One Class Support Vector Machine (LM-OCSVM) gii quyt
bi ton ton hc bn gim st cho nhng tp d liu khng cn bng.
Lun vn tin hnh th nghim cc phng php hc bn gim st c
p dng cho m hnh xut cng nh phn tch kt qu nhiu kha cnh khc
nhau. Thng qua kt qu thc nghim, lun vn chng minh c nhng
th mnh ca hng tip cn c xut v cc phng php hc bn gim
st xy dng trn m hnh. ng thi lun vn cng phn tch lm r u v
khuyt im ca m hnh cng cc phng php c s dng t gi
cch chn p dng cc phng php sao cho c hiu qu nht.
Lun vn c thc hin trong khong thi gian t thng 10/2013 n thng
6/2014. Nhng kt qu chnh ca lun vn c chp nhn v trnh by ti
hi ngh khoa hc Nafosted trong nc v c chp nhn trnh by ti hi
ngh International Joint Conference on Neural Networks (IJCNN) nm 2014.
Bo co ca lun vn c trnh by thnh su chng nh sau:
Chng 1. Gii thiu v trnh by tng quan v nhng vn ca bi
ton phn loi, nu ln mc tiu, ni dung nghin cu v nhng kt qu
t c ca lun vn.
11
Chng 2. Trnh by khi qut v l thuyt hc v bi ton phn loi.
Phn tch bn cht ca nhng k thut, hng tip cn tiu biu (Support
Vector Machine v Suport Vector Data Description) c p dng cho bi
ton phn loi vi nhng phng php ht nhn (Kernel Methods) trong
lnh vc my hc (Machine Learning. Tm hiu v phn tch k thut hc
bn gim st v nhng phng php p dng trong k thut hc bn gim
st c p dng hin nay trong bi ton phn loi.
Chng 3. Gii thiu v trnh by nhng bin th ca k thut hc Support
Vector Machine khi gii quyt nhng bi ton vi d liu cn bng v
nhng bi ton vi d liu khng cn bng. K n s i vo trnh by
v phn tch cc phng php p dng trong k thut hc bn gim st
trn k thut hc Support Vector Machine.
Chng 4. Trnh by v m hnh Large Margin Once Class Support Vector
Machine, s ci tin v m rng ca m hnh Once Class Support Vector
Machince, cng nh cc phng php hc bn gim st c p dng v
th nghim trn m hnh ny.
Chng 5. Trnh by qu trnh tin hnh cng nh kt qu th nghim
ca cc phng php hc bn gim st p dng trn m hnh xut trn
cc b d liu chun ca UCI. Nhng nh gi v phn tch cng c
trnh by nhm gip cho vic la chn p dng cc phng php s dng
trn m hnh xut.
Chng 6. Phn kt lun v nu ln mt s hng pht trin trong tng
lai ca lun vn.
12
Chng II. L thuyt hc, cc phng php ht
nhn (kernel methods) v k thut hc bn gim
st (semi-supervised learning) trong bi ton phn
loi
2.1. L thuyt hc (learning theory)
2.1.1. Bi ton hc (Learning Problem)
2.1.2. Bi ton phn loi (Classification Problem)
2.2. Cc phng php ht nhn (Kernel Methods) trong
lnh vc my hc (Machine Learning)
2.2.1. tng chnh ca cc phng php ht nhn (Kernel Methods)
2.2.2. Cc hm ht nhn (Kernel Functions) thng dng
2.2.2.1. Linear Kernel
2.2.2.2. Polynomial Kernel
2.2.2.3. Gaussian Kernel
2.2.2.4. Sigmoid Kernel
13
2.2.3. Ht nhn xc nh dng (Positive Definite Kernel)
2.2.4. Bn ht nhn m phng (The Reproducing Kernel Map)
2.2.5. Bn ht nhn Mercer (The Mercer Kernel Map)
2.3. Nhng k thut, hng tip cn hc tiu biu p dng
tng phng php ht nhn (Kernel Method)
2.3.1. K thut hc Support Vector Machine
2.3.1.1. S hnh thnh k thut hc Support Vector Machine
2.3.1.2. Li gii cho k thut hc Support Vector Machine
2.3.1.3. Nhng l thuyt nn tng ca k thut hc Support Vector Machine
2.3.2. K thut hc Suport Vector Data Description
2.3.2.1. S hnh thnh k thut hc Suport Vector Data Description
2.3.2.2. Li gii cho k thut hc Suport Vector Data Description
2.3.2.3. Cc bin th ca k thut hc Suport Vector Data Description
14
2.4. K thut hc bn gim st (Semi-supervised Learning)
2.4.1. Tng quan v k thut hc bn gim st
2.4.2. Cc phng php, m hnh hc bn gim st tiu biu
2.4.2.1. Mixture Models
2.4.2.1.1. Tng quan v phng php Mixture Models
2.4.2.1.2. Nhng gi nh ca phng php Mixture Models
2.4.2.2. Co-Training
2.4.2.2.1. Tng quan v Co-Training
2.4.2.2.2. Nhng gi nh ca phng php Co-Training
2.4.2.3. Self-Training
2.4.2.3.1. Tng quan v phng php Self-Training
2.4.2.3..2. Nhng gi nh ca phng php Self-Training
2.4.2.4. Graph-Based Semi-Supervised Learning
2.4.2.4.1. Tng quan v phng php Graph-Based
15
2.4.2.4.2. Nhng gi nh ca phng php Graph-Based
2.4.2.5. Semi-Supervised Support Vector Machines
2.4.2.5.1. Tng quan v phng php Support Vector Machines
2.4.2.5.2. Nhng gi nh ca phng php Support Vector Machines
Chng III. K thut hc Support Vector Ma-
chine v cc bin th tiu biu. Nhng phng
php hc bn gim st p dng trong k thut
hc Support Vector Machine
3.1. Bin th ca k thut hc Support Vector Machine
trong bi ton phn loi d liu cn bng.
3.1.1. Least square SVM
3.1.1.1. S hnh thnh Least square Support Vector Machine
3.1.1.2. Li gii ca Least square Support Vector Machine
3.1.2. Proximal Support Vector Machine
3.1.2.1. S hnh thnh Proximal Support Vector Machine
3.1.2.2. Li gii ca Proximal Support Vector Machine
16
3.2. Bin th ca k thut hc Support Vector Machine
trong bi ton phn loi d liu khng cn bng.
3.2.1. Once Class Support Vector Machine
3.2.1.1. S hnh thnh Once Class Support Vector Machine
3.2.1.2. Li gii ca Once Class Support Vector Machine
3.3. Nhng phng php hc bn gim st tiu biu p
dng trong k thut hc Support Vector Machine
3.3.1. Semi-supervised Support Vector Machine Light (Label-switch
Retraining)
3.3.1.1. tng S3VM Light
3.3.1.2. Thut ton S3VM Light
3.3.2. Deterministic Annealing Semi-supervised Support Vector Ma-
chine
3.3.2.1. tng Deterministic Annealing S3VM
3.3.2.2. Thut ton Deterministic Annealing S3VM
17
3.3.3. Concave-convex Procedure Semi-supervised Support Vector Ma-
chine
3.3.3.1. tng Concave-convex Procedure S3VM
3.3.3.2. Thut ton Concave-convex Procedure S3VM
Chng IV. M hnh Large Margin Once Class
Support Vector Machine
4.1. Tng quan v m hnh Large Margin Once Class SVM
4.1.1. S hnh thnh ca Large Margin Once Class Support Vector Machine
4.1.2. Li gii cho ca Large Margin Once Class Support Vector Machine
4.2. Cc phng php hc bn gim st xut p dng
trn m hnh Large Margin Once Class SVM
4.2.1. Semi-supervised One-class Support Vector Machine
4.2.1.1. Bi ton ti u (Optimization Problem)
4.2.1.2. Thut ton (The Algorithm) ca Semi-supervised One-class Support
Vector Machine
18
4.2.2. Fuzzy Entropy Semi-supervised Large Margin One -Class Sup-
port Vector Machine
4.2.2.1. Bi ton ti u (Optimization Problem)
4.2.2.2. Li gii v thut ton (The Solution and Algorithm) ca FES2LM-
OCSVM
Chng V. Thc nghim
5.1. Quy trnh thc nghim
5.2. Kt qu thc nghim
Chng VI. Tng kt
6.1. Kt lun
6.2. Hng pht trin
19

You might also like