Pattern Recognition and Machine Learning

342 7. SPARSE KERNEL MACHINES

L ̃(a,̂a)=−^1 2

∑N

n=1

∑N

m=1

(an−̂an)(am−̂am)k(xn,xm)

−

∑N

n=1

(an+̂an)+

∑N

n=1

(an−̂an)tn (7.61)

with respect to{an}and{̂an}, where we have introduced the kernelk(x,x′)= φ(x)Tφ(x′). Again, this is a constrained maximization, and to find the constraints we note thatan 0 and̂an 0 are both required because these are Lagrange multipliers. Alsoμn 0 and̂μn 0 together with (7.59) and (7.60), require anCand̂anC, and so again we have the box constraints

0 anC (7.62) 0 ̂anC (7.63)

together with the condition (7.58). Substituting (7.57) into (7.1), we see that predictions for new inputs can be made using

y(x)=

∑N

n=1

(an−̂an)k(x,xn)+b (7.64)

which is again expressed in terms of the kernel function. The corresponding Karush-Kuhn-Tucker (KKT) conditions, which state that at the solution the product of the dual variables and the constraints must vanish, are given by

an(+ξn+yn−tn)=0 (7.65) ̂an(+̂ξn−yn+tn)=0 (7.66) (C−an)ξn =0 (7.67) (C−̂an)̂ξn =0. (7.68)

From these we can obtain several useful results. First of all, we note that a coefficient ancan only be nonzero if+ξn+yn−tn=0, which implies that the data point either lies on the upper boundary of the-tube (ξn =0) or lies above the upper boundary (ξn> 0 ). Similarly, a nonzero value for̂animplies+̂ξn−yn+tn=0, and such points must lie either on or below the lower boundary of the-tube. Furthermore, the two constraints+ξn+yn−tn=0and+̂ξn−yn+tn=0 are incompatible, as is easily seen by adding them together and noting thatξnand ̂ξnare nonnegative whileis strictly positive, and so for every data pointxn, either anor̂an(or both) must be zero. The support vectors are those data points that contribute to predictions given by (7.64), in other words those for which eitheran =0or̂an =0. These are points that lie on the boundary of the-tube or outside the tube. All points within the tube have

Pattern Recognition and Machine Learning

342 7. SPARSE KERNEL MACHINES

−

Get our desktop app

Company

Features

Documentation

Resources

Pattern Recognition and Machine Learning

342 7. SPARSE KERNEL MACHINES

−

Get our desktop app

Company

Features

Documentation

Resources

−