+ All Categories
Home > Documents > Frame-Level Pipelined Motion Estimation Array Processor Surin Kittitornkun and Yu Hen Hu IEEE Trans....

Frame-Level Pipelined Motion Estimation Array Processor Surin Kittitornkun and Yu Hen Hu IEEE Trans....

Date post: 19-Dec-2015
Category:
View: 215 times
Download: 0 times
Share this document with a friend
Popular Tags:
27
Frame-Level Pipelined Frame-Level Pipelined Motion Estimation Array Motion Estimation Array Processor Processor Surin Kittitornkun and Yu H Surin Kittitornkun and Yu H en Hu en Hu IEEE Trans. on, for Video Tech., Vol. 11, NO.2 FEB, 20
Transcript

Frame-Level Pipelined Motion Frame-Level Pipelined Motion Estimation Array ProcessorEstimation Array Processor

Frame-Level Pipelined Motion Frame-Level Pipelined Motion Estimation Array ProcessorEstimation Array Processor

Surin Kittitornkun and Yu Hen HuSurin Kittitornkun and Yu Hen Hu

IEEE Trans. on, for Video Tech., Vol. 11, NO.2 FEB, 2001

OUTLINE• Methodology for VLSI Array

Processors Design• An Example on Frame Level Block

Matching Algorithm

Design Levels• Sequential Algorithm• 1.DG Design• 2.SFG Design• 3.VLSI Array Design

Dependence Graph (DG)

DG: 1.Shift Invariant

Shift-Unvariant DG for Sorting Algorithm

For i from 1 to NFor j from 1 to im(i+1,j) <- max[x(i,j), m(i,j)]x(i,j+1) <- min[x(i,j),m(i,j)]

DG: 2.Localization

Broadcast vs. Transmittent Data

kjkkj

kj

j

kkjkj

wuyy

wuy

1

0

DG: 3.Reversible Arcs for Associative Operations

• If the operation used in the recursion is associative, then the directions of the arcs may be reversible.

kjkkj

kj

kjkkj

kj

wuyy

wuyy

1

1

DG: 4.Localization with Intermediate Variables Involved• AR Filtering

Algorithm3122130444

1

yayayayauy

uyayN

kjkjkj

DG: 4.Localization with Intermediate Variables Involved• AR Filtering Algorithm

– Spiral Communication Approach– Local Communication Approach

Signal Flow Graph (SFG)

Input(1) Output(1)

Input(2)Output(2)

D

x(n) x(n-1)

SFG Projection Procedure• For any projection direction, a

processor space is orthogonal to the projection direction.

• Replace the arcs in the DG with zero or nonzero delay edges between their corresponding processors.

• Attach the input and output data to their corresponding processors.

Projection ExampleInsertion sorting

Insertion Sorting

Selection sortingBubble sorting

1

0,

0

1,

0

1Psd

Selection Sorting

0

1,

1

0,

1

0Psd

Insertion Sorting

1

1,

1

1,

1

1Psd

SFG to Systolic Array• Replace Operation Node with PE.• Place data and Input/Output pin

with delay units.

Frame-Level Pipelined Motion Frame-Level Pipelined Motion Estimation Array ProcessorEstimation Array Processor

Frame-Level Pipelined Motion Frame-Level Pipelined Motion Estimation Array ProcessorEstimation Array Processor

Surin Kittitornkun and Yu Hen HuSurin Kittitornkun and Yu Hen Hu

IEEE Trans. on, for Video Tech., Vol. 11, NO.2 FEB, 2001

Six-level nested Do-loop FSBM

Two-level nested Do-loop

FSBM

Two-level nested Do-loop FSBM

kth-clock cycle

(v-1)NhN2

(h-1)N2

(i-1)N

j

kth-clock cycle

2D Localized DG of row 1, v

=1

Search area and current framecoordinates of Nv = 3; Nh = 2; p =N/2 = 1.

2p+1

2p+1

Linear SFG of (2p + 1)2 PEs, p = N/2 = 1 after systolic ma

pping of 2-D DG.

Linear SFG of (2p + 1)2 PEs, p = N/2 = 1 after systolic mapping of 2-D DG.

Systolic array with spiral interconnections

Microarchitecture of PE

Scheduled search area data

Scheduled search area data

Performance


Recommended