+ All Categories
Home > Documents > Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton...

Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton...

Date post: 26-May-2020
Category:
Upload: others
View: 8 times
Download: 0 times
Share this document with a friend
17
Recognition of Handwritten Names II Dafna Bitton Winter 2009, CSE 190a with Professor Serge Belongie Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17
Transcript
Page 1: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

Recognition of Handwritten Names II

Dafna Bitton

Winter 2009, CSE 190a with Professor Serge Belongie

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17

Page 2: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

Problem Statement

Given a class roster, match an image of a handwritten name incharacter boxes to the string representation of the name.

For example, match:

to the string “serge belongie”.

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 2 / 17

Page 3: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

Outline

1 Steps Involved

2 Data

3 Learning Framework

4 Results

5 Future Work

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 3 / 17

Page 4: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

Outline

1 Steps Involved

2 Data

3 Learning Framework

4 Results

5 Future Work

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 4 / 17

Page 5: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

How it will work

Students turn in work.TAs scan work after they grade it.Isolate the characters with normalized cross correlation.Perform OCR on cut out letters using a machine learningtechnique.Predict the string representation of the name based on the OCRresults and the roster.

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 5 / 17

Page 6: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

Outline

1 Steps Involved

2 Data

3 Learning Framework

4 Results

5 Future Work

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 6 / 17

Page 7: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

Training DataIn order for the project to work, we need a large number ofexamples of each character.Used ABCDETC template for character examples.Scanned in filled-out sheets and used the Hough Transform to cutout letters.Sized all characters to 24 by 24 pixels.

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 7 / 17

Page 8: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

Jittering

In order to product more training data, we apply rotation andtranslation to the images we already have, to cover more bases.

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 8 / 17

Page 9: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

Test Data

Had people fill in their names in character boxes.Used normalized cross correlation to cut out characters.

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 9 / 17

Page 10: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

Outline

1 Steps Involved

2 Data

3 Learning Framework

4 Results

5 Future Work

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 10 / 17

Page 11: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

First Attempt: Nearest Neighbor

Have a large amount of examples of each type of letter ’a’ through’z’.Compute the Euclidean distance between an unknown letter andall training examples.Whichever letter is closest is what we classify the unknown letteras.

Figure: Example of when the Nearest Neighbor method would perform poorly.

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 11 / 17

Page 12: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

Second Attempt: AdaBoost with Haar-like featuresWe create 26 classifiers - 1 for each letter. The ’a’ classifier gives aconfidence of whether or not an image is an ’a’ or not.Haar-like features:

In the beginning, an obscene number of Haar-like features arecreated, but the number is cut down to the 200 most definingfeatures.Instead of representing the image as a bitmap, we represent it asa 200 x 1 vector of the best Haar-like features.

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 12 / 17

Page 13: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

Taking the Roster into Account

Given an unknown handwritten name, we consider all names inthe roster that have the same number of characters.For each of these possibilities, we find the confidence of eachletter being the corresponding letter in the possibility.Whichever possibility yields the highest confidences is the winner.

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 13 / 17

Page 14: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

Outline

1 Steps Involved

2 Data

3 Learning Framework

4 Results

5 Future Work

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 14 / 17

Page 15: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

Results

The results are not good. The classifiers do not perform as well asthey should.

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 15 / 17

Page 16: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

Outline

1 Steps Involved

2 Data

3 Learning Framework

4 Results

5 Future Work

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 16 / 17

Page 17: Recognition of Handwritten Names IIcseweb.ucsd.edu/~dbitton/finalpresentation.pdf · Dafna Bitton (March 11, 2009) CSE190a Final Presentation 1 / 17. Problem Statement Given a class

Improvements to be Made

DebuggingI would like to redo the training phase. If the training data is notgood, then I can’t expect to get good results.In the training phase, I want to be as close to the conditions that Iwill be in during testing.

Dafna Bitton (March 11, 2009) CSE190a Final Presentation 17 / 17


Recommended