09 Classification - Perceptroni-systems.github.io/HSE545/machine learning all/04...

Classification: Perceptron

Industrial AI Lab.

Classification• Where 𝑦 is a discrete value– Develop the classification algorithm to determine which class a new input should fall into

• Start with a binary class problem– Later look at multiclass classification problem, although this is just an extension of binary classification

• We could use linear regression– Then, threshold the classifier output (i.e. anything over some value is yes, else no)

– linear regression with thresholding seems to work

2

Classification• We will learn– Perceptron– Support vector machine (SVM)– Logistic regression

• To find a classification boundary

3

Perceptron

• For input 𝑥 =𝑥$⋮𝑥&

'attributes of a customer’

• Weights 𝜔 =𝜔$⋮𝜔&

4

Perceptron

• Introduce an artificial coordinate 𝑥( = 1 :

• In vector form, the perceptron implements

5

Perceptron• Linearly separable data

• Hyperplane– Separates a D-‐dimensional space into two half-‐spaces– Defined by an outward pointing normal vector– 𝜔 is orthogonal to any vector lying on the hyperplane– Assume the hyperplane passes through origin, 𝜔*𝑥 = 0 with 𝑥( = 1

6

Sign• Sign with respect to a line

7

How to Find 𝝎• All data in class 1– 𝑔 𝑥 > 0

• All data in class 0– 𝑔 𝑥 < 0

8

Perceptron Algorithm• The perceptron implements

• Given the training set

1) pick a misclassified point

2) and update the weight vector

9

Perceptron Algorithm• Why perceptron updates work ?

• Let's look at a misclassified positive example 𝑦0 = +1– Perceptron (wrongly) thinks 𝜔23&* 𝑥0 < 0

– Updates would be

– Thus 𝜔045* 𝑥0 is less negative than 𝜔23&* 𝑥0

10

Iterations of Perceptron1. Randomly assign 𝜔

2. One iteration of the PLA (perceptron learning algorithm)

where (𝑥, 𝑦) is a misclassified training point

3. At iteration 𝑡 = 1,2,3,⋯ , pick a misclassified point from

4. And run a PLA iteration on it

5. That's it!

11

Perceptron

12

Perceptron Loss Function

• Loss = 0 on examples where perceptron is correct,i.e., 𝑦0 > 𝜔*𝑥0 > 0

• Loss > 0 on examples where perceptron is misclassified,i.e., 𝑦0 > 𝜔*𝑥0 < 0

• Note:– sign 𝜔*𝑥0 ≠ 𝑦0 is equivalent to 𝑦0 > 𝜔*𝑥0 < 0

13

Python Example

14

Python Example

15

Python Example

16

Python Example

17

where 𝑥, 𝑦 is a misclassified training point

Python Example

18

The Best Hyperplane Separator?• Perceptron finds one of the many possible hyperplanes separating the data if one exists

• Of the many possible choices, which one is the best?

• Utilize distance information as well• Intuitively we want the hyperplane having the maximum margin• Large margin leads to good generalization on the test data– we will see this formally when we cover Support Vector Machine (SVM)

• Perceptron will be shown to be a basic unit for neural networks and deep learning later

19

Date post:	29-Jul-2018
Category:	Documents
Upload:	lamkhuong
View:	226 times
Download:	0 times

09 Classification - Perceptroni-systems.github.io/HSE545/machine learning all/04...

Documents