Professional Documents
Culture Documents
SVC Talk
SVC Talk
Overview
& %
' $
2
& %
' $
3
& %
' $
4
(x)
& %
' $
5
The Primal is
min R2
& %
' $
6
The Lagrangian
m
L = R2 + αi (kφ(xi ) − µk2 − R2 )
X
i=1
αi ≥ 0 ∀i (2)
& %
' $
7
Karush-Kuhn-Tucker conditions
m m
∂L X X
= 0 =⇒ 2R − 2R αi = 0 =⇒ αi = 1 (3)
∂R i=1 i=1
m m
∂L X X
= 0 =⇒ −2 αi ( kφ(xi ) − µk ) = 0 =⇒ µ = αi φ(xi ) (4)
∂µ i=1 i=1
& %
' $
8
m
X
st αi = 1 αi ≥ 0 ∀i (6)
i=1
Here K(xi , xj ) is the Kernel function giving φ(xi ).φ(xj ) in the high
dimensional space and αi s are the Lagrangian multipliers.
& %
' $
9
Gaussian Kernel
−qkxi −xj k2
K(xi , xj ) = e (7)
& %
' $
10
(x)
& %
' $
11
The Primal is
m
min R2 + C
X
ξi
i=1
& %
' $
12
The Lagrangian
m m m
L = R2 + C αi (kφ(xi ) − µk2 − R2 − ξi ) −
X X X
ξi + βi ξi
i=1 i=1 i=1
αi ≥ 0 βi ≥ 0 ∀i (9)
& %
' $
13
Karush-Kuhn-Tucker conditions
m m
∂L X X
= 0 =⇒ 2R − 2R αi = 0 =⇒ αi = 1 (10)
∂R i=1 i=1
m m
∂L X X
= 0 =⇒ −2 αi ( kφ(xi ) − µk ) = 0 =⇒ µ = αi φ(xi ) (11)
∂µ i=1 i=1
∂L
= 0 =⇒ C − αi − βi = 0 =⇒ αi + βi = C (12)
∂ξi
& %
' $
14
βi ξi = 0 (14)
& %
' $
15
m
X
st αi = 1, 0 ≤ αi ≤ C ∀i (15)
i=1
& %
' $
16
Observations
• points with αi = 0 are inside the sphere.
• points with 0 < αi < C lies on the sphere (SVs).
• points with αi = C lie outside the feature space sphere (BSVs).
& %
' $
17
Radius of the sphere enclosing the image of the data points is given
by
R = {G(xi )/where 0 < αi < C} where
G2 (xi ) = kφ(xi ) − µk2
m
X m
X
= K(xi , xi ) − 2 αj K(xj , xi ) + αj αk K(xj , xk ) (16)
j=1 j,k
& %
' $
18
{x/G(x) = R} (17)
& %
' $
19
Cluster Assignment
Employs a geometric method involving G(x), based on the
observation:
given a pair of points that belong to different clusters, any path
that connects them must exit from the sphere in feature space.
Define an adjacency matrix M between pairs of points xi and xj
whose images lie in or on the sphere in the feature space by looking
at the image of path that connects them as
1 if G(y) ≤ R ∀y ∈ [xi , xj ]
M [i, j] = (18)
0 otherwise
& %
' $
20
& %
' $
21
& %
' $
22
m m
min R2 + T2 +
X X ′
1 δ
υm ξ i + υm ξi
i=1 i=1
s.t 2 2 ′
kφ(xi ) − µk ≤R + ξ i + ξ i
0≤ξ i ≤ T2 - R2
′
ξ i ≥0 ∀i
& %
' $
23
m m
L = R2 + T2 +
X X ′
1 δ
υm ξ i + υm ξ i+
i=1 i=1
m m
αi ( kφ(xi ) − µk2 - R2 - ξ i - ξ i ) -
X ′
X
βi ξ i
i=1 i=1
m m
λi (ξ i - T2 + R2 ) -
X X ′
+ ηi ξ i
i=1 i=1
& %
' $
24
m
X m
X
1
αi =2 — (1) µ= 2 αi φ(xi ) — (2)
i=1 i=1
1 δ
β i - λi = υm - αi — (3) υm
- αi = ηi — (4)
& %
' $
25
2 2 ′
αi ( kφ(xi ) − µk - R - ξ i - ξ i ) = 0 —– (5)
& %
' $
26
m
X m
X
min αi αj K(xi ,xj ) - αi K(xi ,xi )
i,j i=1
m
X
δ
s.t 0≤ αi ≤ υm for i =1....m, αi =2
i=1
& %
' $
27
& %
' $
28
Let us define
1
R = G(xi ) : 0 < αi < υm
1 δ
T = G(xi ) : υm
< αi < υm
& %
' $
29
& %