Support Vector Machine

transcript

Le Do Hoang Nam – CNTN08

Linear Programming

General Form with x in Rn

Linear objective, Linear constraints, …

Linear Programming

An example: The Diet Problem

How to come up with a cheapest meal that meets all nutrition standards?

Linear Programming

Let x1, x2 and x3 be the amount in kilos of carrot, cabbage and cucumber in the dish.

Mathematically,

Linear Programming

In canonical form:

How to solve? Simplex. Newton method. Gradient descend.

LP and Classification

Given a set of N samples (mi, li) mi is the feature set.

li = -1 or 1 is the label.

If a sample is correctly classified by a hyper-plane wTx + c then:

li (wTmi + c) ≥ 1

linear function

(w, c) is a good classification if it satisfies:

li (wTmi + c) ≥ 1 , i = 1..nwhich are linear constraints

LP form:

Without any objective function, we have ALL possible solutions:

Class 1

Class 2

Class 1

Class 2

If data is not linearly separable:

Minimize number of errors

Class 1

Class 2

Our objective becomes:

But, cardinal function is non-linear not an LP

Cardinal function:

Solution: Approximate it with Hinge-loss function.

Hinge-loss function:

Classification problem now becomes:

which can be solved as an LP

Geometry view:

Class 1

Class 2

wTx + c = 0

wTx + c = -1

wTx + c = 1

Another problem: Some samples are uncertain

Class 1

Class 2

Solution: Maximum the margin d.

Class 1

Class 2

All samples are outside the margin

All the distances from samples to boundary are bigger than d/2. That means:

Because hyper-plane is homogenous, we choose w such as:

The objective function:

The problem now becomes:

Support Vector Machine

Together with the error minimization, we have the SVM:

λ means the trade-off between error and robustness

Kernel Method

Support Vector Machine

Documents