Pattern Recognition Final Task

Pattern RecognitionFinal Task

Ibrahim Arief – 185099Timo Eckhard – 185126

University of JoensuuDecember 17th, 2009

Contents• M-Fold-Cross Training• Color Data

– Preprocessing– Bayesian Classifier– Multilayer Perceptron– K-Means Clustering

• Speech Data– Preprocessing– Bayesian Classifier– Multilayer Perceptron– K-Means Clustering

• Summary

M-Fold-Cross Training

• Partition into M subsets• One subset is assigned as test subset, the rest is

training subset• We use the training subset for testing against

test subset• Assign other subset as new test subset, the rest

is training subset for that particular one• Repeat until all partition took their turn being

tested

Spectral Color Data - Preprocessing

Spectral Color Data – Bayesian Classifier (1)

• Raw spectral input – all classified to class 31 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24

1 0 0 1 3 6 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 02 0 0 1 3 6 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 03 0 0 1 4 8 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 04 0 0 1 4 4 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 05 0 0 1 5 9 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 06 0 0 1 5 9 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 07 0 0 1 5 1 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 08 0 0 1 5 1 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 09 0 0 1 6 3 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 010 0 0 1 5 9 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 011 0 0 1 4 4 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 012 0 0 1 4 0 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 013 0 0 1 4 4 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 014 0 0 1 3 6 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 015 0 0 1 3 6 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 016 0 0 1 4 8 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 017 0 0 1 4 4 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 018 0 0 1 3 3 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 019 0 0 1 3 3 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 020 0 0 1 4 4 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 021 0 0 1 4 0 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 022 0 0 1 4 4 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 023 0 0 1 4 4 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 024 0 0 1 3 6 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0


• Preprocessing : Tristimulus• Nice clumping, linearly separable


• Very high accuracy : 99.97%

Spectral Color Data – Multi Layer Perceptron

• Raw spectral data as input : ~5%• Tristimulus as input : ~30%• Question : parameters?• Answer : exhaustive search?

softmax quasinew 16 33,96% softmax hmc 14 24,21%

softmax conjgrad 13 31,89% logistic hmc 14 23,87%

softmax scg 14 28,77% softmax conjgrad 11 23,37%

softmax scg 16 28,03% softmax scg 13 23,29%

softmax conjgrad 14 27,03% softmax conjgrad 10 22,64%

softmax conjgrad 16 26,27% softmax scg 11 21,49%

softmax hmc 16 26,05% logistic quasinew 13 21,33%

softmax hmc 15 25,81% logistic scg 15 21,20%

softmax quasinew 15 24,39% logistic quasinew 15 21,02%

softmax conjgrad 15 24,29% softmax quasinew 10 20,88%

Spectral Color Data – K-Means Clustering

Speech Data – Preprocessing (1)

• MFCC – Timeseries?• Plot of coefficients within a class


• Plot of variance for each coefficient


• Plot of bayesian accuracy for n-least-varied


• Delta-coefficients• Source:

http://cslu.cse.ogi.edu/fsj/issues/issue5/sparse-ann/PhoneProbEst.html

• Formula

• Dimensionality reduction1 2 3 4 5 6 7 8 9 10

1 2 3 4 5 6



Speech Data – Bayesian Classifier

• Frequency matters• No risk matrix• Raw accuracy : 18.13%• Delta-coefficient preprocessing : 96.06%

Speech Data – Multi Layer Perceptron

• Hidden Neuron : 22• Normalized Raw Data : 20.25%• Reduced dimension, delta coefficient : 29.52%• Delta coefficient without reduced dimension :

27.84%

Summary – Spectral Color Data

• Bayesian Classifier– Raw Data : 3.92%– Preprocessed : 99.97% (tristimulus)

• Multi Layer Perceptron– Raw Data : ~5%– Preprocessed : 58.1% (tristimulus)

99.7% (tristimulus + CIELAB + sRGB)• K-Means Clustering– Raw data : 92%– Preprocessed : 95%

Summary – Speech Data

• Bayesian Classifier– Raw Data : 18.19%– Preprocessed : 96.09% (delta-derivative, high

variance elimination)• Multi Layer Perceptron– Raw Data : 20.25%– Preprocessed : 29.52% (delta-derivative, high

variance elimination)• K-Means Clustering– Raw data : 24%– Preprocessed : 62% (normalized, delta-derivative)

Date post:	07-Feb-2016
Category:	Documents
Upload:	hagop
View:	44 times
Download:	0 times

Pattern Recognition Final Task

Documents