+ All Categories
Home > Documents > Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research,...

Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research,...

Date post: 13-Jan-2016
Category:
Upload: robyn-cummings
View: 220 times
Download: 0 times
Share this document with a friend
19
| 2005/11/30 | ICDM 05 Tokyo Research Laboratory © Copyright IBM Corporation 2005 IBM Research, Tokyo Research L ab. Tsuyoshi Idé Pairwise Symmetry Decomposition Method for Generalized Covariance Analysis
Transcript
Page 1: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

IBM Research, Tokyo Research Lab.

Tsuyoshi Idé

Pairwise Symmetry Decomposition Method for Generalized Covariance Analysis

Page 2: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 2

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

Summary: We generalize the notion of covariance using group theory.

covariance theoretical properties

2-variate cross cumulant

An irreducible representation of a

group

))(( yyxx

dxdyyxp ),(

where

generalization

New approach to pattern recognition

2-variate CC as irreducible

representations

Page 3: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 3

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

Motivation.The traditional covariance cannot capture nonlinearities.

x

y

data point(xi , yi )

The traditional covariance cannot capture nonlinearities.

N

iiixy yyxx

NC

1

))((1

1

# of data points

Cxy would be useless in this case.

We wish to explicitly define useful metrics for nonlinear correlations.

cf. kernel methods are black-boxes

Page 4: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 4

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

The lowest order cross-cumulant (CC) is identical to the covariance.A generalized covariance could be higher-order CC.

2nd order CC is identical to the covariance

Use higher-order CC for generalizing the covariance

Multivariate systems can be described with PDF.

Notation of “cumulant average”

The cumulants of p completely characterize p.

),...,,()( 21 nxxxpp x

* We assume zero-mean data hereafter.

principle #1

Page 5: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 5

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

Relation between the covariance and symmetries. The axioms of group can be used to characterize the symmetries.

symmetric w.r.t. x and y, etc.

A collection of such symmetry operations may be used for characterizing the symmetries.

What is the guiding principle to define it?

The axioms of group can be the guiding principle.

Closure, Associativity,

Identity, Inverse.

PDF

Page 6: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 6

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

The set of OPs is almost unique --- C4v is the most appropriate group for characterizing the pairwise correlations.

Requirements on the group G G should include OPs describing the symmetri

es within the xy plane. G should include an OP to exchange x with y.

Most general one is

a group named C4v

Point group ?only rotations and mirror reflections

Point group is a natural choice

x

y

y

xrr

only xy

Page 7: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 7

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

What is the C4v group ? It contains 8 symmetry operations within the xy space.

x

yz

45 o

45 o

Page 8: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 8

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

Only a single IRR has been used for recognizing correlation patterns. We unveil the other representations !

One can easily prove:

x

y

any correlation pattern

A1

A2

B1

B2

E

Further, a “symmetry decomposition theorem” holds:

linear combination of the five IRRs

Only the B2 component has been used so far.

Find the other IRRs, which haven’t been used so far.

principle #2

Page 9: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 9

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

The two consequences lead to the definition of the generalized covariances, which are symmetrized cross cumulants in the C4v sense.

Note

• There is arbitrariness in prefactors

• x and y should be standardized (unit variance) to be scale invariant

4,C

yx2-variate cross cumulants irreducible representations of C4v

A1, A2, B1, (B2), E

Construct IRRs as linear combinations of CCs

result:

principle #2principle #1

Page 10: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 10

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

Experiment with Lissajous’ trajectories. The generalized covariances detect the nonlinearities while the standard covariance C(B2) fails.

•C(B2) should be minus 1 due to the perfect Inverse linear correlation

•C(B2) fails to capture the correlation

•C(A1) succeeds to detect the nonlinear correlation

•C(B2) fails to capture the correlation

•C(E2) and C(E2) succeed to detect the nonlinear correlation

•C(B2) fails to capture the correlation

•C(A2) succeeds to detect the nonlinear correlation

Page 11: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 11

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

Summary: We have generalized the notion of covariance using group theory.

covariance theoretical properties

2-variate cross cumulant

B2 irreducible representation of C4v

))(( yyxx

dxdyyxp ),(

where

generalization

Symmetry decomposition: new view to pattern recognition

2-variate CC as A1, A2, B2, and E representations

Page 12: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 12

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

Thank you !!

Page 13: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 13

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

Background. How can you tell the difference of the two states quantitatively? The traditional covariance is not helpful.

x

ystate B

x

ystate A

data point(xi , yi )

Traditional covariance

N

iiixy yyxx

NC

1

))((1

1

# of data points

Cxy would be useless in this case.

The traditional covariance cannot capture nonlinearities.

We wish to explicitly define useful metrics for nonlinear correlations.

cf. kernel methods are black-boxes

Page 14: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 14

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

•Take only 2-variate CC like

•Take only lower order ones up to

Covariance as the lowest order CC: Summary of this section. We focus on cross cumulants (CC) as a theoretical basis.

Describe n-variate systems using its PDF p

),...,,()( 21 nxxxpp xemploy cumulants as features of p

assumptioncharacterization approximation

4

cumulants are expansion coeff. w.r.t. s

Cumulant generating function

Definition of cumulants

Notation of “cumulant average”

Page 15: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 15

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

Sparse Correlation Approximation of the cumulant generating function. What kind of terms are omitted?

)(s cumulant generating function

Cix Cji xx

C

3kji xxx

C

2ix

1K 2K 3K ....

C

3ix

C

2ji xx

C

3ji xx

Ckji xxx

C

2kji xxx

C

22kji xxx

2-body cluster

SCA

terms taken * A pair of variable will be represented as x and y hereafter.

Page 16: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 16

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

Mathematical preliminaries. Position operator, Hilbert space, Dirac’s bra-ket notation, and moments in the bra-ket notation

x

y

D: The domain of p(r)

y

xr

DH rr |

Def. of the position eigenstateHilbert space spanned by the position eigenstate

rr ax ˆ

b

ar then etc.

position operator

pp rr )(

D

pdp )(rrr: vector state a of Def. p

where is the marginal DF wrt (x,y)

yxpyxdyxpD

)(rr

moments ofnotation ket -Bra

yxyx r

: vector state a of Def. vyx

Page 17: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 17

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

Verifying the definition of the generalized covariances: a few examples.

representation matrices are all 1 A1 representation

You can use the method of projection operators if want to construct IRRs systematically

(See a textbook of group theory)

Page 18: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 18

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

Experiment : Calculating the generalized covariances for Lissajous’ trajectories analytically.

Model of correlated variables → assume the uniformly distribution over t mean is zero for both x and y variance is 1 for both x and y

Generalized covariances can be explicitly calculated

x

y

Page 19: Tokyo Research Laboratory © Copyright IBM Corporation 2005 | 2005/11/30 | ICDM 05 IBM Research, Tokyo Research Lab. Tsuyoshi Idé Pairwise Symmetry Decomposition.

| 2005/11/30 | ICDM 05Page 19

Tokyo Research Laboratory

© Copyright IBM Corporation 2005

Detailed summary.

We generalized the traditional notion of covariance based on the two theoretical properties

1. Standard covariance is the lowest order 2-variate cross cumulant

2. Standard covariance is the B2 irreducible representation of the C4v group.

Our result suggests a new approach to pattern recognition where patterns are characterized by the irreducible representation of a finite group

Practically, we found that C(B2) would be greatly enhanced for linear corr

elations. C(E1) and C(E2) reflect some asymmetries in th

e distribution. C(A1) clearly takes a large value when the distr

ibution has a donut-like shape. Finally, C(A2) would be enhanced by distributio

ns with some Hakenkreuz-like correlations.

These features can be used in anomaly detection tasks where nonlinear correlations plays some important role


Recommended