Professional Documents
Culture Documents
Hoc May
Hoc May
Hc my
Ni dung
Gii thiu Hc khi nim (Concept learning) Cy quyt nh (Decision Tree)
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 2
Hc khi nim
Hc khi nim:
Gm nhiu phn t (c th) Mi c th cho bit c thuc khi nim hay khng (thuc: positive, khng: negative)
V d:
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 3
Example 1 2 3 4
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Cng c hiu l khi nim (concept). L hi ca cc rng buc trn thuc tnh ca c th, dng biu din c th l:
<Attr1=Value1, Attr2=Value2, , AttrN=ValueN > <Value1, Value2, , ValueN >, trng hp ny th t thuc tnh c quy nh trc.
AttrK : Thuc tnh th K no . ValueK: Tr cho thuc tnh th K. C th nhn: ? : Mt tr bt k. : Khng tr no c th chp nhn c. Mt tr c th.
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 5
X l c th, v X tho tt c cc rng buc trn gi thit h th h phn loi X l positive (h(X) =1) V d: Gi thit l Aldo thch mn th thao di vo ngy cold days with high humidity, gi thit c ghi l:
<?, Cold, High, ?, ?, ?>
Gi thit c th nht:
< ,,,,,>
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 6
Tp c th (set of instances)
Tp c dng trch khi nim t . K hiu: X VD trn: tp c th = tp ngy, mi ngy c 6 thuc tnh.
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 7
Hc khi nim ~:
Tm trn khng gian gi thit c th. Tr v gi thit tt nht tho mn tp VD hun luyn.
S gi thit c th:
N thuc tnh trong mi gi thit. M tr c th cho mi c th, cng thm hai tr na:
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
S gi thit c th:
1 + (M+1)N.
Th t cc gi thit:
V d: h2 g h1 vi h1 v h2 sau:
h1 = <Sunny, ?,? , Strong, ?, ?> h2 = <Sunny, ?,? , ?, ?, ?>
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 9
3.
h = gi thit c th nht trong H. Vi mi x tp VD hun luyn, m c(X) =1 Vi mi rng buc ai trong h IF ai tho bi x THEN do nothing. ELSE thay ai bi RB tng qut hn k tip m n c tho bi x Xut ra h.
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 10
C th 2 (positive):
<Sunny, Warm, High, Strong , Warm , Same >
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Vi training examples l correct th FIND-S tr v gi thit c th nht trong H tng thch vi cc c th positive. 1 s bi ton cha gii:
Khi c nhiu gi thit tho th sao ? Khi training set is not correct th sao ? Gii thut c hi t?
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 12
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 13
VersionSpace danh sch cha mi gi thit trong H. For each <x, c(x)>: Remove h t VS nu h(x) <> c(x) Output: danh sch cc h trong VS.
Gii hn c th nht S:
Tp cc phn t c th nht ca H tng thch vi D. S={sH| Consistent(s,D) ^ (sH) [(s >g s) ^ Consistent(s, D)]}
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 14
nh l biu din VS: VSH,D = {hH | (s S) (g G) (g g h g s)} G = tp gi thit tng qut nht. S = tp gi thit c th nht. For each d thuc training set:
IF d is positive:
Candidate-Elimination:
Remove t G bt k gi thit no ko tng thch vi d. For each s S m ko tng thch vi d. Remove s t S. Thm vo S tt c cc tng qut nh nht h ca s sao cho: > h l tng thch vi d, > Vi phn t ca G tng qut hn h. Remove t S bt k gi thit no tng qut hn 1 ci khc trong S.
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 15
Remove t S bt k gi thit no ko tng thch vi d. For each g G m ko tng thch vi d. Remove g t G. Thm vo G tt c cc c th nh nht h ca g sao cho: > h l tng thch vi d, > Vi phn t ca S c th hn h. Remove t G bt k gi thit no t tng qut hn 1 ci khc trong G.
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 16
Khi ng:
S0: {< , , , , , >}
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 17
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 18
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 19
10
VersionSpace:
S4: {<Sunny, Warm, ?, Strong, ?, ?>}
<Sunny ,? ,? , Strong ,? ,? > <Sunny ,Warm ,? ,? ,? ,? > <? ,Warm ,? ,Strong ,? ,? >
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 21
Hc cy quyt nh
Gii thiu
L phng php xp x hm ri rc gi tr. Hm c biu di dng cy quyt nh, cng c th dng lut if-then. Thuc lp gii thut suy din quy np (inductive) Node thuc tnh ca c th. Nhnh t node X cc tr c th ca thuc tnh ti X. Phn loi trn cy cho 1 c th:
Bt u t node gc, kim tra thuc tnh tng ng vi node ny v di duyn xung theo nhnh c tr l tr ca thuc tnh. Lp li vi cy con.
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
11
Hc cy quyt nh (tt.)
Outlook
Sunny Humidity
Overcast YES
Rain Wind
High NO
Normal YES
Strong NO
Weak YES
Mt cy quyt nh biu din khi nim PlayTennis. Cy ny c kh nng phn loi mt bui sng ch nht no c thch hp cho vic chi tennis khng da vo cc thuc tnh <Outlook, Humidity, Wind, ..> ca sng .
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to Side 23
Hc cy quyt nh (tt.)
Vi cy trn:
Mt bui sng c thuc tnh:
Cy quyt nh:
Biu din mt tuyn(v) ca cc hi(^) cc rng buc trn thuc tnh.
mt hi
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 24
12
Hc cy quyt nh (tt.)
Cc bi ton thch hp vi cy quyt nh:
C th c biu din dng cc cp <thuc tnh, gi tr> Hm c gi tr output ri rc. V d trn hm c hai tr YES|NO. Bi ton c dng biu din tuyn ca cc hi. Training set c th cha error. Error trong vic phn loi c th, cng nh error trong vic gn tr ca thuc tnh. Training set c th c c th thiu i mt s thuc tnh. Cc lnh vc p dng:
Phn loi bnh nhn bi bnh. S trc trc thit b theo nguyn nhn. Ti chnh
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 25
Hc cy quyt nh (tt.)
Gii thut hc c bn - ID3.
tng:
Xy dng cy t root n l, bng cch tr li:
Thuc tnh no l tt nht c kim tra ti root ? Mi cy con c to ra tng ng mi nhnh l tr ca thuc tnh ny.
Qu trnh lp li vi cc cy con.
ENTROPY:
o lng tnh ng nht ca tp hun luyn. Tp hun luyn:S ENTROPY: Entropy(S) = -p+log2p+ - p-log2p-
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 26
13
Hc cy quyt nh (tt.)
Gii thut hc c bn - ID3.
ENTROPY:
P+: T l gia s cc th positive trn tng s c th. P- : T l gia s cc th negative trn tng s c th. V d:
Tng s c th: 14 Trong c: 9 c th positive (thuc vo phn loi ca khi nim ch). 5 c th negative (khng thuc vo phn loi ca khi nim ch). Hay ghi rt gn: [9+,5-] Entropy([9+,5-]) = -((9/14)log2(9/14)) ((5/14)log2(5/14)) = 0.94
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 27
Hc cy quyt nh (tt.)
Gii thut hc c bn - ID3.
ENTROPY:
Lu :
0log20 = 0 Khi P+ hay P- =0 th Entropy =0 Entropy =1 khi P+ =P0<= Entropy <=1 C = 2: trng hp trn. Entropy(S)= - SUM(Pilog2P i), i=1 c
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 28
14
Hc cy quyt nh (tt.)
Gii thut hc c bn - ID3.
Info Gain:
o lng s gim Entropy mong mun. Cng thc: Gian(S,A) = Entropy(S) SUM([|Sv|/|S|]Entropy(Sv)), v values(A)
S: Tp hun luyn. A: thuc tnh. Values(A): tp cc gi tr c th ca A. Sv: tp con ca S m thuc tnh A c tr l v. |Sv|/|S| : t s c th c thuc tnh A c tr v trn tng s c th.
VD:
Xem Bng c th hun luyn trang sau.
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 29
Hc cy quyt nh (tt.)
Day 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Outlook Sunny Sunny Overcast Rain Rain Rain Overcast Sunny Sunny Rain Sunny Overcast Overcast Rain Temperature Hot Hot Hot Mild Cool Cool Cool Mild Cool Mild Mild Mild Hot Mild Humidity High High High High Normal Normal Normal High Normal Normal Normal High Normal High Wind Weak Strong Weak Weak Weak Strong Strong Weak Weak Weak Strong Strong Weak Strong Playtennis No No Yes Yes Yes No Yes No Yes Yes Yes Yes Yes No
Side 30
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
15
Hc cy quyt nh (tt.)
Gii thut hc c bn - ID3.
Info Gain:
Trang sau tnh Info Gain cho hai thuc tnh: Humidity v Wind
Side 31
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Hc cy quyt nh (tt.)
S: [9+,5-] E=0.94 Humidity S: [9+,5-] E=0.94 Wind
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 32
16
Hc cy quyt nh (tt.)
Tng bc gii thut:
Bi ton:
Bng d liu cho trn. Khi nim ch: PlayTennis.
Bc 1: cy cho S
Gian(S, Outlook) = 0.246 Gian(S, Humidity) = 0.151 Gian(S, Wind) = 0.048 Gian(S, Temperature) = 0.029 Outlook : thuc tnh phn loi tt nht ti bc ny. Outlook: root node. Cy nh sau:
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 33
Hc cy quyt nh (tt.)
[D1,D2, ..,D14] [9+,5-]
Outlook
Sunny
[D1,D2,D8,D9,D11] [2+,3-]
Overcast
Rain
[D3,D7,D12,D13] [4+,0-]
[D4,D5,D6,D10,D14] [3+,2-]
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 34
17
Hc cy quyt nh (tt.)
Tng bc gii thut:
Bc 2: Cy cho SSunny
Ssunny={D1,D2,D8,D9,D11} Gian(Ssunny, Humidity) = 0.97- (3/5)0.0 (2/5)0.0 = 0.97 Gian(Ssunny, Wind) = 0.97 (2/5)1.0 (3/5)0.918 = 0.019 Gian(Ssunny, Temperature) = 0.97-(2/5)0.0-(2/5)1.0-(1/5)0.0=0.57 Humidity : thuc tnh phn loi tt nht ti bc ny. Humidity: root ca Ssunny. Cy nh sau:
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 35
Hc cy quyt nh (tt.)
[D1,D2, ..,D14] [9+,5-]
Outlook
Sunny
[D1,D2,D8,D9,D11] [2+,3-]
Overcast
Rain
[D3,D7,D12,D13] [4+,0-]
[D4,D5,D6,D10,D14] [3+,2-]
Humidity
YES
High
[D1,D2,D8] [0+,3-]
Normal
[D9,D11] [2+,0-]
NO
YES
Side 36
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
18
Hc cy quyt nh (tt.)
Tng bc gii thut:
Bc 3: Cy cho SRain
SRain={D4,D5,D6,D10,D14} Tng t nh trn.
Kt qu:
Cy cui cng nh cy u tin ca phn Hc cy quyt nh
iu kin dng:
Mi nt l u nm vo 1 trong hai trng hp: 1. Tt c cc thuc tnh u nm trn node thuc con ng t root n l . 2. Node l c entropy = 0.
Entropy=0, Tt c c th u + Entropy=0, Tt c c th u -
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Hc cy quyt nh (tt.)
Gii thut ID3:
To node gc cho cy. IF tt c cc c th l positive THEN tr v cy ch c node, nhn l + IF tt c cc c th l negative THEN tr v cy ch c node, nhn l IF Attributes trng THEN tr v cy ch c 1 node, nhn l gi tr chung nht ca Target_Attribute trong tp c th. ElSE: BEGIN
A Thuc tnh t Attributes tt nht phn loi tp c th. Thuc tnh cho root l A. (root A) For each tr Vi ca A:
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 38
19
Hc cy quyt nh (tt.)
Gii thut ID3:
For each tr Vi ca A:
Thm 1 nhnh mi di root, tng ng A = Vi. ExamplesVi = tp con cc c th thuc Examples c A=Vi. Nu ExamplesVi trng : Th: di nhnh mi ny, thm 1 node l c nhn = tr chung nht ca Target_Attribute trong Examples. Ngc li: di nhnh mi ny thm 1 cy con, tr v t li gi: ID3(ExamplesVi, Target_Attribute, Attributes {A})
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 39
Bi tp 1
Cho bng d liu dng hun luyn cho khi nim hc Ru gi nh sau: STT 1 2 3 4 5 Tem Khng C Khng C Khng Mu Trong Trong c Trong Trong Mi Khng Khng Khng Nng Khng V Cay Cay Cay Cht Chua Ru gi C Khng C Khng C
Yu cu: Thc hin tng bc gii thut hc Candidate-Elimination cho khi nim Ru gi
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 40
20
Bi tp 2
Cho bng d liu dng hun luyn cho khi nim hc Vim xoang nh sau: STT 1 2 3 4 5 6 7 Nhc u thng xuyn Khng C C C C Khng Khng Hi th Bnh thng Bnh thng Nng mi Bnh thng Nng mi Bnh thng Nng mi Hc mi Bnh thng Bnh thng Bnh thng Vim xoang Khng C Khng C C Khng C
Yu cu: Bng gii thut hc Candidate-Elimination hy a ra cc gi thit cho khi nim cn hc. Xy dng cy quyt nh cho khi nim cn hc.
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to Side 41
n tp
Cc dng ton cn n tp:
Tr chi:
Cho bi ton (???):
Chy tng bc gii thut. Gii thut: Minimax, alpha-beta, IDS, IDA*
21
n tp (tt.)
Cc dng ton cn n tp:
Lp k hoch:
Cho bi ton.
Chy tng bc gii thut Planning. Gii thut: Stack of Goal Non-linear.
Hc my:
(Hc khi nim & Cy quyt nh) Cho bng d liu.
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 43
n tp (tt.)
thi:
Khoa Cng Ngh Thng Tin - i Hc Bch Khoa Tp. HCM Bi Ging Mn: Tr tu nhn to
Side 44
22