+ All Categories
Home > Documents > Signal Reconstruction from its Spectrogram

Signal Reconstruction from its Spectrogram

Date post: 31-Dec-2015
Category:
Upload: sopoline-vance
View: 36 times
Download: 7 times
Share this document with a friend
Description:
Signal Reconstruction from its Spectrogram. Radu Balan IMAHA 2010, Northern Illinois University, April 24, 2010. Overview. Problem formulation Reconstruction from absolute value of frame coefficients Our approach Embedding into the Hilbert-Schmidt space Discrete Gabor multipliers - PowerPoint PPT Presentation
23
Signal Reconstruction from its Spectrogram Radu Balan IMAHA 2010, Northern Illinois University, April 24, 2010
Transcript
Page 1: Signal Reconstruction from its Spectrogram

Signal Reconstruction from its Spectrogram

Radu Balan

IMAHA 2010, Northern Illinois University, April 24, 2010

Page 2: Signal Reconstruction from its Spectrogram

2/23

Overview

1. Problem formulation

2. Reconstruction from absolute value of frame coefficients

3. Our approach– Embedding into the Hilbert-Schmidt space– Discrete Gabor multipliers– Quadratic reconstruction

4. Numerical example

Page 3: Signal Reconstruction from its Spectrogram

3/23

1. Problem formulation

• Typical signal processing “pipeline”:

Analysis Processing SynthesisIn Out

Features:Relative low complexity O(Nlog(N))On-line version if possible

Page 4: Signal Reconstruction from its Spectrogram

4/23

<·,gi>x

Analysis Synthesis

Ii

iigc ˆc y

Hx

ighc

Ilc

,

)(

i

2

Iiiigcy

Hy

The Analysis/Synthesis Components:

Example: Short-Time Fourier Transform

fkfk gxc ,, , )()( /2, kbtgetg Fiftfk

)(

10,);,(2 ZlH

ZZFfZkfkI F

)()( 2, kbtgetg kbtiffk

Page 5: Signal Reconstruction from its Spectrogram

5/23

*

=

fft fft

=

*

Data frame index (k)

f

ck,0

ck,F-1ck+1,F-1

ck+1,0

g(t)

x(t+kb:t+kb+M-1) x(t+kb+M:t+kb+2M-1)

x(t+kb)g(t) x(t+(k+1)b)g(t)

Page 6: Signal Reconstruction from its Spectrogram

6/23

*

=

ifft ifft

=*

ĝ(t)

ck,0

ck,F-1ck+1,F-1

ck+1,0

+

Page 7: Signal Reconstruction from its Spectrogram

7/23

Problem: Given the Short-Time Fourier Amplitudes (STFA):

we want an efficient reconstruction algorithm:Reduced computational complexityOn-line (“on-the-fly”) processing

1

/2,, )()(,

Mkb

kbt

Fiftfkfk ekbtgtxgxd

|.| Reconstructionck,f dk,f x

Page 8: Signal Reconstruction from its Spectrogram

8/23

• Where is this problem important:– Speech enhancement– Speech separation– Old recording processing

Page 9: Signal Reconstruction from its Spectrogram

9/23

• Setup: – H=En , where E=R or E=C

– F={f1,f2,...,fm} a spanning set of m>n vectors

• Consider the map:

• Problem 1: When is N injective?• Problem 2: Assume N is injective, Given c=N(x)

construct a vector y equivalent to x (that is, invert N up to a constant phase factor)

2. Reconstruction from absolute value of frame coefficients

mkk

mn fxxNREN

1

,)( , ~/:

1||scalar somefor ,~ zzyxyx

Page 10: Signal Reconstruction from its Spectrogram

10/23

Theorem [R.B.,Casazza, Edidin, ACHA(2006)]

For E = R :• if m 2n-1, and a generic frame set F, then N is

injective;• if m2n-2 then for any set F, N cannot be injective;• N is injective iff for any subset JF either J or F\J

spans Rn.• if any n-element subset of F is linearly independent,

then N is injective; for m=2n-1 this is a necessary and sufficient condition.

mkk

mn fxxNRRN

1

,)( , ~/:

1scalar somefor ,~ zzyxyx

Page 11: Signal Reconstruction from its Spectrogram

11/23

Theorem [R.B.,Casazza, Edidin, ACHA(2006)]

For E = C :• if m 4n-2, and a generic frame set F, then N is

injective.• if m2n and a generic frame set F, then the set of

points in Cn where N fails to be injective is thin (its complement has dense interior).

mkk

mn fxxNRCN

1

,)( , ~/:

1||scalar somefor ,~ zzyxyx

Page 12: Signal Reconstruction from its Spectrogram

12/23

3. Our approach

• First observation:

fkfkgx

HSgxgxfkfk

ggyyKxxyyK

KKKKtrgxd

fk

fkfk

,,

*2

,2,

,)( , ,)(

,,

,

,,

Hilbert-Schmidt

Signal space: l2(Z)

x KxK

nonlinearembedding

Kgk,f

E=span{Kgk,f}

Hilbert-Schmidt: HS(l2(Z))

FZZFfZkfkIZlH 10,);,( , )(2

Ifkkbtgetg kbtiffk ),( , )()( 2

,Recall:

Page 13: Signal Reconstruction from its Spectrogram

13/23

• Assume {Kgk,f} form a frame for its span, E. Then the projection PE can be written as:

where {Qk,f} is the canonical dual of {Kgk,f} .

fk

HSfkgE QKP

fk ,,

,,

)(

,)(

,

,,

1,

,

fk

fkfk

gfk

fkg

HSg

KSQ

KKXXSX

Frame operator

Page 14: Signal Reconstruction from its Spectrogram

14/23

• Second observation:since:

it follows:

)()(: , )()(: where /2

,

bthtThhTthetMhhM

gTMgFit

kffk

** : , : where

,

TXTXXMXMXX

KK gkf

g fk

Page 15: Signal Reconstruction from its Spectrogram

15/23

• However:

• Explicitely:

0,0,

S and

QQ

SSSfk

fk

kbtkbt

Fttif

ttfk QeQ

21

21

21 ,0,0/)(2

,,

Page 16: Signal Reconstruction from its Spectrogram

16/23

Short digression: Gabor Multipliers

• Goes back to Weyl, Klauder, Daubechies• More recently: Feichtinger (2000), Benedetto-

Pfander (2006), Dörfler-Toressani (2008)

LatticeggmmG

dggm

,)( :Multiplierabor

,m : Multiplier STFT 2

Theorem [F’00] Assume {g , Lattice} is a frame for L2(R).Then the following are equivalent:1. {<.,g>g,Lattice} is a frame for its span, in HS(L2(R));2. {<.,g>g,Lattice} is a Riesz basis for its span, in HS(L2(R));3. The function H does not vanish,

)( , ,)()(2

LatticeDualGroupeggeeHLattice

Page 17: Signal Reconstruction from its Spectrogram

17/23

• Return to our setting. Let

Theorem Assume {gk,f}(k,f)ZxZF is a frame for l2(Z).

Then

1. is a frame for its span in HS(l2(Z)) iff for each mZF, H(,m) either vanishes identically in , or it is never zero;

2. is a Riesz basis for its span in HS(l2(Z)) iff for each mZF and , H(,m) is never zero.

Zk

fkZf

F

mfki

ggemHF

2

,

2

,),(

Fg ZZfkKfk

),(;,

Page 18: Signal Reconstruction from its Spectrogram

18/23

• Third observation. Under the following settings:– For translation step b=1;– For window support supp(g)={0,1,2,...,L-1}– For F2L

• The span of is the set

of 2L-1 diagonal band matrices.

Fg ZZfkKfk

),(;,

g

LgLgg

ggg

Lggggg

K g

000000

0)1()1()0(0

00

0)1()1()0(0

0)1()0()1()0()0(0

000000

2

2

2

Page 19: Signal Reconstruction from its Spectrogram

19/23

• The reproducing condition (i.e. of the projection onto E) implies that Q must satisfy:

bandttXQgXgfk

ttttfkfkfk 21,

,,,,, , and X allfor , ,2121

By working out this condition we obtain:

1

02

2

,0,0)()(

1

d

epgpg

e

FQ

p

ip

it

tt

Page 20: Signal Reconstruction from its Spectrogram

20/23

• The fourth observation:We are able now to reconstruct up to L-1 diagonals of Kx.

This means we can estimate

11

2,,, Lttttt xxxxx

Assuming we already estimated xs for s<t,we estimate xt by a minimization problem:

2

,

2

1,11,

2ˆˆmin JttxJtJttxtttxx KxxwKxxwKx

for some JL-1 and weights w1,...,wJ.

Remark: This algorithm is similar to Nawab, Quatieri, Lim [’83]IEEE paper.

Page 21: Signal Reconstruction from its Spectrogram

21/23

Reconstruction Scheme

• Putting all blocks together we get:

IFFT

|ck,0|2

|ck,F-1|2

W0

WL-1

0ˆtz

1ˆ Ltz

LeastSquareSolver

ttx̂

Stage 1 Stage 2

t

t

ttzQzW

,0,01)(

Page 22: Signal Reconstruction from its Spectrogram

22/23

3. Numerical Example

Page 23: Signal Reconstruction from its Spectrogram

23/23

Conclusions

All is well but ...

• For nice analysis windows (Hamming, Hanning, gaussian) the set {Kgk,f} DOES NOT form a frame for its span! The lower frame bound is 0. This is the (main) reason for the observed numerical instability!

• Solution: Regularization.


Recommended