+ All Categories
Home > Documents > Proximity-Aware Directory-based Coherence for Multi-core...

Proximity-Aware Directory-based Coherence for Multi-core...

Date post: 19-Jul-2020
Category:
Upload: others
View: 3 times
Download: 0 times
Share this document with a friend
76
Proximity-Aware Directory-based Coherence for Multi-core Processor Architectures Jeff Brown Rakesh Kumar Dean Tullsen UC San Diego University of Illinois at Urbana-Champaign SPAA 19 June 9, 2007
Transcript
Page 1: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware Directory-basedCoherence for Multi-core Processor

Architectures

Jeff BrownRakesh KumarDean Tullsen

UC San Diego ● University of Illinois at Urbana-ChampaignSPAA 19 ● June 9, 2007

Page 2: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Introduction● The chip multiprocessor (CMP)

era is upon us!● Caching complicate writes● Cache Coherence ensures

caching is done safely

● Multi-core designs offer new tradeoffs

Page 3: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Introduction● The chip multiprocessor (CMP)

era is upon us!● Caching complicate writes● Cache Coherence ensures

caching is done safely

● Multi-core designs offer new tradeoffs

P

M

P

M

Page 4: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Introduction● The chip multiprocessor (CMP)

era is upon us!● Caching complicate writes● Cache Coherence ensures

caching is done safely

● Multi-core designs offer new tradeoffs

P

M

P

M

P

P M

M

Page 5: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Background: Directory-basedCache Coherence

● Directory-based; explicit per-block accounting– Doesn't rely on broadcasts

● Directory operation: client/server

Page 6: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Background: Directory-basedCache Coherence

● Directory-based; explicit per-block accounting– Doesn't rely on broadcasts

● Directory operation: client/server– Processors request data, permissions

P

Page 7: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Background: Directory-basedCache Coherence

● Directory-based; explicit per-block accounting– Doesn't rely on broadcasts

● Directory operation: client/server– Processors request data, permissions– Directory controllers manage memory access

P Dir

Page 8: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Background: Directory-basedCache Coherence

● Directory-based; explicit per-block accounting– Doesn't rely on broadcasts

● Directory operation: client/server– Processors request data, permissions– Directory controllers manage memory access

P

M

Dir

Page 9: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Background: Directory-basedCache Coherence

● Directory-based; explicit per-block accounting– Doesn't rely on broadcasts

● Directory operation: client/server– Processors request data, permissions– Directory controllers manage memory access

P

M

Dir

Page 10: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Background: Directory-basedCache Coherence

● Directory-based; explicit per-block accounting– Doesn't rely on broadcasts

● Directory operation: client/server– Processors request data, permissions– Directory controllers manage memory access

● Updates, conflicts

P

M

PDir

Page 11: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Background: HistoricalMP Cache Coherence● Distributed directory, memory

P

M

P

M

P

M

P

M

Page 12: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Background: HistoricalMP Cache Coherence● Distributed directory, memory

P

M

P

M

P

M

P

M

Cache Miss

Page 13: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Background: HistoricalMP Cache Coherence● Distributed directory, memory

P

M

P

M

P

M

P

M

Cache Miss "Home Node"

Page 14: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Background: HistoricalMP Cache Coherence● Distributed directory, memory

P

M

P

M

P

M

P

M

Cache Miss "Home Node"

Page 15: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Background: HistoricalMP Cache Coherence● Distributed directory, memory

P

M

P

M

P

M

P

M

Cache Miss "Home Node"

Data Request

Page 16: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Background: HistoricalMP Cache Coherence● Distributed directory, memory

P

M

P

M

P

M

P

M

Cache Miss "Home Node"

Data Request

Reply

Page 17: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Motivation: Multi-coreCache Coherence

M

M

P

M

P

P P

M

Page 18: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Motivation: Multi-coreCache Coherence

M

M

P

M

P

P P

M

Cache Miss

Page 19: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Motivation: Multi-coreCache Coherence

M

M

P

M

P

P P

M

Cache Miss

Page 20: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Motivation: Multi-coreCache Coherence

"HomeNode"

M

M

P

M

P

P P

M

Cache Miss

Page 21: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Motivation: Multi-coreCache Coherence

"HomeNode"

Data Request

M

M

P

M

P

P P

M

Cache Miss

Page 22: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Motivation: Multi-coreCache Coherence

"HomeNode"

Data Request

M

M

P

M

P

P P

M

Cache Miss

Page 23: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Motivation: Multi-coreCache Coherence

"HomeNode"

Reply

M

M

P

M

P

P P

M

Cache Miss

Page 24: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Motivation: Multi-coreCache Coherence

M

M

P

M

P

P P

M

AdditionalSharer

Page 25: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Motivation: Multi-coreCache Coherence

M

M

P

M

P

P P

M

AdditionalSharer

● Multi-core designs present radically differentrelative latency & bandwidth

Page 26: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Outline

● Introduction & Background

● System Architecture

● Proximity-Aware Coherence

● Results

● Conclusion

Page 27: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Directory-based Cache Coherence● Directory structures

Page 28: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Directory-based Cache Coherence● Directory structures

MainMemory

Page 29: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Directory-based Cache Coherence● Directory structures

MainMemory

Page 30: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Directory-based Cache Coherence● Directory structures

– Directory Memory

MainMemory

DirectoryMemory

Page 31: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Directory-based Cache Coherence● Directory structures

– Directory Memory– Directory Entries

MainMemory

DirectoryMemory

Page 32: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Directory-based Cache Coherence● Directory structures

– Directory Memory– Directory Entries– Directory Controller

MainMemory

DirectoryMemory

Controller

Page 33: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

A Traditional Multiprocessor

Core

L2 $

Dir

Mem

Interconnect

Core

L2 $

Dir

Mem

Page 34: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

A Traditional Multiprocessor

Core

L2 $

Dir

Mem

Interconnect

Core

L2 $

Dir

Mem

(Chassis, board, etc.)

Page 35: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

A Traditional Multiprocessor

Core

L2 $

Dir

Mem

Interconnect

Core

L2 $

Dir

Mem

(Chassis, board, etc.)

Page 36: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Our 16-Core Chip Multiprocessor

Core L2 $

BusDircontrol

Net.switch

Dir $

Mem.channel

Tile0

Tile1

Tile15

...

Page 37: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Our 16-Core Chip Multiprocessor

Core L2 $

BusDircontrol

Net.switch

Dir $

Mem.channel

Tile0

Tile1

Tile15

...

Page 38: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Our 16-Core Chip Multiprocessor

Core L2 $

BusDircontrol

Net.switch

Dir $

Mem.channel

Tile0

Tile1

Tile15

...

Page 39: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Our 16-Core Chip Multiprocessor

Core L2 $

BusDircontrol

Net.switch

Dir $

Mem.channel

Tile0

Tile1

Tile15

...

Page 40: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Our 16-Core Chip Multiprocessor

Core L2 $

BusDircontrol

Net.switch

Dir $

Mem.channel

Tile0

Tile1

Tile15

...

Page 41: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Our 16-Core Chip Multiprocessor

Core L2 $

BusDircontrol

Net.switch

Dir $

Mem.channel

Tile0

Tile1

Tile15

...

Page 42: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Outline

● Introduction & Background

● System Architecture

● Proximity-Aware Coherence

● Results

● Conclusion

Page 43: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware Coherence● Idea: home node asks sharer nearest requester

to forward its cached copy

Page 44: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware Coherence● Idea: home node asks sharer nearest requester

to forward its cached copy– Stay on-chip when possible

Page 45: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware Coherence● Idea: home node asks sharer nearest requester

to forward its cached copy– Stay on-chip when possible– Minimize transit of large data-carrying replies

Page 46: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware Coherence● Idea: home node asks sharer nearest requester

to forward its cached copy– Stay on-chip when possible– Minimize transit of large data-carrying replies

M

M

P

M

P

P P

M

Page 47: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware Coherence● Idea: home node asks sharer nearest requester

to forward its cached copy– Stay on-chip when possible– Minimize transit of large data-carrying replies

"HomeNode"

Data Request

Cache Miss

M

M

P

M

P

P P

M

Page 48: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware Coherence● Idea: home node asks sharer nearest requester

to forward its cached copy– Stay on-chip when possible– Minimize transit of large data-carrying replies

"HomeNode"

M

M

P

M

P

P P

M

AdditionalSharer

Page 49: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware Coherence● Idea: home node asks sharer nearest requester

to forward its cached copy– Stay on-chip when possible– Minimize transit of large data-carrying replies

"HomeNode"

M

M

P

M

P

P P

M

AdditionalSharer

ForwardRequest

Page 50: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware Coherence● Idea: home node asks sharer nearest requester

to forward its cached copy– Stay on-chip when possible– Minimize transit of large data-carrying replies

Reply

M

M

P

M

P

P P

M

Page 51: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware Coherence

● To service read misses for shared data,traditional protocols use main memory

● Other nodes may hold copies

● On the CMP landscape, inter-node latency ismuch less than memory latency

Page 52: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Sharer Selection

● When the home node lacks a cached copy, itselects a sharer to ask

Page 53: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Sharer Selection

● When the home node lacks a cached copy, itselects a sharer to ask

Miss

Home

Page 54: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Sharer Selection

● When the home node lacks a cached copy, itselects a sharer to ask– rand

Miss

Home

Page 55: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Sharer Selection

● When the home node lacks a cached copy, itselects a sharer to ask– rand– near1

Miss

Home

Page 56: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Sharer Selection

● When the home node lacks a cached copy, itselects a sharer to ask– rand– near1– via1

Miss

Home

Page 57: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Sharer Selection

● When the home node lacks a cached copy, itselects a sharer to ask– rand– near1– via1

● Retries didn't prove beneficial

Miss

Home

Page 58: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Outline

● Introduction & Background

● System Architecture

● Proximity-Aware Coherence

● Results

● Conclusion

Page 59: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Methodology

● Detailed, execution-driven processor andnetwork simulation

● "RSIM" simulator, adapted to our CMP model● Parallel workloads from several suites● Hardware, benchmark details in paper

Page 60: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware: Potential Coverage

appbt fft lu mp3d ocean quicksort unstruct0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

6

5

4

3

2

1

Fra

ctio

nofre

ad

mis

ses

tosh

are

dlin

es

Page 61: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware: Potential Coverage

appbt fft lu mp3d ocean quicksort unstruct0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

6

5

4

3

2

1

Fra

ctio

nofre

ad

mis

ses

tosh

are

dlin

es

Page 62: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware: Potential Coverage

appbt fft lu mp3d ocean quicksort unstruct0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

6

5

4

3

2

1

Fra

ctio

nofre

ad

mis

ses

tosh

are

dlin

es

Overallx=43%

Page 63: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware: Potential Coverage

appbt fft lu mp3d ocean quicksort unstruct0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

6

5

4

3

2

1

Fra

ctio

nofre

ad

mis

ses

tosh

are

dlin

es

Overallx=43%

Page 64: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware: Potential Coverage

appbt fft lu mp3d ocean quicksort unstruct0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

6

5

4

3

2

1

Fra

ctio

nofre

ad

mis

ses

tosh

are

dlin

es

Overallx=43%

dist 1x=75%

Page 65: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware: Latency Benefit

appbt fft lu mp3d ocean quicksort

un-struct

mean0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

1.1

rand

near1

via1

Nor

mal

ized

L2m

iss

late

ncy

Page 66: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware: Latency Benefit

appbt fft lu mp3d ocean quicksort

un-struct

mean0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

1.1

rand

near1

via1

Nor

mal

ized

L2m

iss

late

ncy

Page 67: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware: Latency Benefit

appbt fft lu mp3d ocean quicksort

un-struct

mean0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

1.1

rand

near1

via1

Nor

mal

ized

L2m

iss

late

ncy

Latency-25%

Page 68: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware: Latency Benefit

appbt fft lu mp3d ocean quicksort

un-struct

mean0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

1.1

rand

near1

via1

Nor

mal

ized

L2m

iss

late

ncy

Latency-25%

Page 69: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware: Latency Benefit

appbt fft lu mp3d ocean quicksort

un-struct

mean0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

1.1

rand

near1

via1

Nor

mal

ized

L2m

iss

late

ncy

Latency-25%

Reply traffic-6%

Page 70: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware: Latency Benefit

appbt fft lu mp3d ocean quicksort

un-struct

mean0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

1.1

rand

near1

via1

Nor

mal

ized

L2m

iss

late

ncy

Latency-25%

Reply traffic-6%

Page 71: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware: Speedup

appbt fft lu mp3d ocean quicksort

un-struct

mean0.00

0.20

0.40

0.60

0.80

1.00

1.20

1.40

1.60

1.80

rand

near1

via1Spe

edup

Page 72: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware: Speedup

appbt fft lu mp3d ocean quicksort

un-struct

mean0.00

0.20

0.40

0.60

0.80

1.00

1.20

1.40

1.60

1.80

rand

near1

via1Spe

edup

Page 73: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware: Speedup

appbt fft lu mp3d ocean quicksort

un-struct

mean0.00

0.20

0.40

0.60

0.80

1.00

1.20

1.40

1.60

1.80

rand

near1

via1Spe

edup

Speedup16%

Page 74: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Proximity-Aware: Speedup

● L2 latency sensitivity of workloads

appbt fft lu mp3d ocean quicksort

un-struct

mean0.00

0.20

0.40

0.60

0.80

1.00

1.20

1.40

1.60

1.80

rand

near1

via1Spe

edup

Speedup16%

Page 75: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Conclusion

● The latency/bandwidth aspects of CMPsmotivates multicore-aware coherence redesign

● One such change: Proximity-Aware Coherence– Ideas: stay on-chip, decrease "bulk" transit– Mean speedup 16%, mean L2 latency down 25%

● More aggressive techniques are under study

Page 76: Proximity-Aware Directory-based Coherence for Multi-core ...jbrown/papers/proximity-coher-slides.pdf · Background: Directory-based Cache Coherence Directory-based; explicit per-block

Conclusion

● The latency/bandwidth aspects of CMPsmotivates multicore-aware coherence redesign

● One such change: Proximity-Aware Coherence– Ideas: stay on-chip, decrease "bulk" transit– Mean speedup 16%, mean L2 latency down 25%

● More aggressive techniques are under study

● Questions?


Recommended