+ All Categories
Home > Documents > Automatic Monitoring for Interactive Performance and Power...

Automatic Monitoring for Interactive Performance and Power...

Date post: 01-Aug-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
52
Krisztián Flautner - [email protected] Automatic Monitoring for Interactive performance and Power Reduction 1 Automatic Monitoring for Interactive Performance and Power Reduction Krisztián Flautner [email protected]
Transcript
Page 1: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 1

Automatic Monitoring for Interactive Performance and Power Reduction

Krisztián [email protected]

Page 2: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 2

Overview

• A mechanism for quantifying the user experience.– Metric: response time.– Automatic, no user program modifications required.– Run-time feedback to the kernel.

• Multiprocessing to improve response times.• Slow down processor to save energy when

response times are fast enough.

Page 3: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 3

Research contributions

• A metric (TLP) and a portable methodology for quantifying the amount of concurrency in a multiprocessor system.

• An automatic technique for detecting execution episodes that directly impact the user-perceived response times of interactive applications.

• Quantifying how much multiprocessing improves the responsiveness of interactive applications.

• An automatic mechanism for setting the optimum performance level of processors that support dynamic voltage scaling.

Page 4: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 4

Response time

• Faster is not always better.– Fundamental limit to what is perceptible to humans.

• Movies: 20-30 frames per second.• Perceptual causality: 50ms-100ms.• Dragging objects on screen: 200ms.• Non-continuous operation: 1-2sec.

The time it takes for the computer to respond to user initiated events.

The goal is to run fast enough to meet the perception threshold, no point to running any faster.

Page 5: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 5

Episode classification

• Interactive episodes– When the user is waiting for the computer to respond.

• Periodic episodes– Producer (e.g. MP3 player).– Consumer (e.g. sound daemon).

Page 6: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 6

A utilization trace

Each horizontal quantum is a millisecond, height corresponds to the utilization in that quantum.

Page 7: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 7

Episode classification

Interactive (Acrobat Reader), Producer (MP3 playback), and Consumer (esd sound daemon) episodes.

Page 8: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 8

Mouse movement

X server updates screen every ~10ms. Update takes ~0.25ms.

Page 9: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 9

Interactive episodes

Page 10: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 10

Interactive episodes can include idle time

Waiting for data from the network during a run of Netscape. Page rendering starts after 250ms.

Page 11: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 11

Finding interactive episodes

• One way: mouse click indicates start, long idle time indicates end.– Not always accurate.– Not all episodes are initiated by mouse click.– Latency in finding the ends of episodes.

• Our approach: track inter-task communication.– Accurate.– Finds all interactive episodes.– No latency.– No program modifications required.

Page 12: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 12

Tracking interactive episodes

• Start of an interactive episode:– X server sends a message to another task.

• During interactive episode:– Keep track of communicating tasks (episode’s task set).– Compute desired metrics.

• Conditions for ending the episode (applied to tasks in the episode’s task set):– No tasks are executing.– Data written by the tasks have been consumed.– No task was preempted the last time it ran.– No tasks are blocked on I/O.

Page 13: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 13

Communication between tasksCPU 1CPU 0

895

757

757

757778

778

889895757

2088

757

R

R

R

W

W

W

W

W

W

W

CPU 1C PU 0

757

7572090

757

W

W

W

757 W

757W

757 W

757 W

Page 14: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 14

Does multiprocessing improve interactive performance?

Metrics: Response time, thread-level parallelism (TLP).• Response time: duration of interactive episode.• Machine is idle when all processors are idle.• TLP: machine utilization when machine is not idle.

Results relevant to SMT, CMP processors.

Workloads: interactive desktop applications.

OS: Linux 2.3.99-pre3, Mandrake 7.1, glibc 2.1.3, XFree86 3.3.6.

Hardware: Dell Precision 410 Workstation: dual Pentium II 450Mhz, 512M RAM, Matrox Millennium II AGP 4M.

Page 15: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 15

Why use TLP?

0%

25%

50%

75%

10 0%

Be nc hmarks

Machine Utilization

Idle time

Automatedbenchmark runs

"Realis tic"benchmark runs

Machine utilization only quantifies concurrency if there is no idle time during execution.

Page 16: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 16

Initial results

• Surveyed >50 desktop applications– BeOS, Linux and Windows NT.

• Lots of threads, but limited concurrency.– Multimedia, web: 1.2~1.4.– TLP is workload dependent. Photoshop: 1.23-2.36 TLP.– Java apps similar to Windows apps.

• Lots of idle time (often >80% of execution time).

• 4 processor machine is overkill (for apps other than make –j and parallel MPEG player).

Does TLP translate into improved response times?

Page 17: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 17

Workloads and TLP results

88%89%1.271.31Average

92%93%1.211.26Text editor21.1p8Xemacs

89%90%1.281.34Web browser4.7Netscape

84%88%1.241.26Image editor1.1.22GIMP

84%84%1.391.42PS and PDF viewer3.5.8Ghostview

93%93%1.331.35Document editor5.5.6bFrameMaker

87%88%1.191.20PDF viewer4.0Acroread

IdlerunIdlerunTLPrunTLPie

UniprocessorDual processorDescriptionVersionBenchmark

Page 18: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 18

Methodology

• All benchmarks run by a human– Non-intrusive automation is difficult.

• Repeated runs of the same workload are not identical.– Inexact repeat of mouse movement.– Different amounts of idle times between episodes.– Background activity.

• Average results of seven runs in each configuration.– Mouse clicks used to synchronize traces.– TLP identical, response time variance <3%.

Page 19: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 19

Response time improvement over uniprocessor

22%1.32Average

21%1.34Netscape

19%1.26GIMP

34%1.42Ghostview

22%1.35FrameMaker

15%1.20Acroread

Response-time (TR) improvementTLPieBenchmark

Very little idle time (<1%) during interactive episodes.Max. possible response-time improvement is 50% on a dual-processor.TR improvement = 1 - TR(DP) / TR(UP) (expected to be close to 1 – 1 / TLP)

Page 20: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 20

Background activity: MP3 playback

1.231.27TLPrun

1.361.31TLPie

29%22%Avg. TR improvement on dual-processor

MP3No MP3

4%14%Avg. TR increase due to MP3 playback

Dual-processorUniprocessor

P1 P2

Page 21: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 21

Time above the perception threshold

0%

20%

40%

60%

80%

100%

50ms 100ms 150ms 200ms 250ms 300ms

Perception threshold

Tim

e a

bove

the

perc

eptio

n th

resh

old

Acrobat Reader

FrameMaker

Ghostview

GIMP

Netscape

Time above the perception threshold is given as a percentage of time spent in all interactive episodes. Data is from the uniprocessor runs.

Page 22: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 22

Characteristics of Interactive Episodes

• Many interactive episodes are already fast enough.• More will be imperceptible in the near future.

– 200ms perception threshold today estimates work done during 50ms 3 years from now.

• Faster is not necessarily better.– Human perception has finite resolution.

Slow down the processor!

Page 23: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 23

Why bother?

386 386

486 486

Pentium(R) Pentium(R) MMX

Pentium Pro (R)

Pentium II (R)

1

10

100

1.5µ1.5µ1.5µ1.5µ 1µ1µ1µ1µ 0.8µ0.8µ0.8µ0.8µ 0.6µ0.6µ0.6µ0.6µ 0.35µ0.35µ0.35µ0.35µ 0.25µ0.25µ0.25µ0.25µ 0.18µ0.18µ0.18µ0.18µ 0.13µ0.13µ0.13µ0.13µ

Max

Pow

er (W

atts

) ?

Sour

ce: I

ntel

Higher performance = increased power consumption.

Page 24: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 24

Power Density!

1

10

100

1000

1.5µ1.5µ1.5µ1.5µ 0.8µ0.8µ0.8µ0.8µ 0.35µ0.35µ0.35µ0.35µ 0.18µ0.18µ0.18µ0.18µ 0.1µ0.1µ0.1µ0.1µ

Wat

ts/c

m2

Hot plate

Nuclear Reactor

RocketNozzle Sun’s

Surface?

Sour

ce: I

ntel

Page 25: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 25

Dynamic Voltage Scaling

• Voltage is proportional to the frequency.• Reduce frequency (and corresponding voltage)

to match performance demands.• Since reduced frequency implies increased

execution time, energy is proportional to v2.

Power = Capacitance • voltage2 • frequency

Energy ~ voltage2

Page 26: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 26

Processors supporting DVS

5.4

0.18

1000Mhz1.75V1.45W

150Mhz0.75V40mW

Intel XScale Demo

4

0.18

800Mhz1.5V

900mW

150Mhz0.75V40mW

Intel XScale

1.84.49Max/min energy

0.180.350.6Process

700Mhz1.6V~2W

251Mhz1.65V

964mW

100Mhz3.3V

220mWMax.

500Mhz1.2V~1W

59Mhz0.79V

106mW

8Mhz1.1V

1.8mWMin.

Transmeta Crusoe 5600Intel SA-1100lpARM

Page 27: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 27

Some recent desktop processors

38W66W

0.18

200Mhz, 266Mhz1.6V

650Mhz @ 1.75V1.2Ghz @ 1.75V

AMD AthlonModel 4

12W19.1W

0.18

100Mhz, 133Mhz3.3V

500Mhz @ 1.35V733Mhz @ 1.65V

Intel Pentium III

17W19.1W

0.18

133Mhz1.8V-2.5V

533Mhz @ 1.8V667Mhz @ 1.8V

MPC 7450

66.3W

0.18

400Mhz

1.4Ghz @ 1.7V

Intel Pentium IV

Max. Power

Process

I/O

Core

Page 28: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 28

Small performance reduction = big energy savings

20% performance reduction = 32% energy reduction40% performance reduction = 55% energy reduction

0

0.4

0.8

1.2

1.6

2

0 200 400 600 800 1000 1200

Frequency (Mhz)

Volta

ge (V

)

0

0.2

0.4

0.6

0.8

1Energy factor

Graph based onIntel XScale data

Page 29: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 29

The key: performance-setting algorithm

• Use episode detection and classification.– Interactive episodes.– Periodic episodes (producer and consumer).

• Performance-setting on a per episode basis.• Stretch episodes to their deadlines.

– Interactive episode: perception threshold.– Stretch producer to consumer.

No modification of existing programs needed.Works with irregular processor utilization and multiprogramming.

Page 30: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 30

Producer and consumer episodes

• Example: MP3 playback through esd sound daemon.• Monitor communications to/from sound daemon.• Distance between producer and consumer episodes determines

necessary performance level.

Sound daemon

MP3 player

HW sound device

Page 31: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 31

Cumulative interactive episode length distributionFr

ameM

aker

Episode length (sec)

Cumulative numberCumulative time

0%

10%

20%

30%

40%

50%

60%

70%

80%

90%

100%

1e-05 0.0001 0.001 0.01 0.1 1

50ms10ms

Minimum performance level sufficient Max. performance

Page 32: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 32

Cumulative interactive episode length distributionXe

mac

s

Episode length (sec)

Cumulative numberCumulative time

0%

10%

20%

30%

40%

50%

60%

70%

80%

90%

100%

1e-05 0.0001 0.001 0.01 0.1 1

50ms10ms

Minimum performance level sufficient Max. performance

Page 33: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 33

Performance-setting strategy for interactive episodes

• Predict the performance factor that would be correct most of the time (not for most events).– Based on past optimal performance factors.

• Limit worst case impact on response time.

• No need to predict episode length.– Performance factors have smaller range.

Page 34: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 34

Performance-setting for interactive episodes

• Wait 5ms before transition to ignore short episodes• Switch to predicted performance level.

• If episode duration reaches PanicThreshold, switch to maximum performance.

• Estimate full performance episode duration.• Compute optimum performance level for past episode.• Compute new prediction based on optimum settings.

At the beginning of the episode

During the episode

At the end of the episode

PanicThreshold = PerceptionThreshold(1 + PerformanceFactor)Predicted PerformanceFactor is the average of past optimum settings, weighted by the corresponding episode lengths.

Page 35: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 35

Performance-setting algorithm

• Enter period-sampling mode.• Switch to maximum performance.• Establish base performance level.• Exit period-sampling mode.

Periodic activity detected

• If not in period-sampling mode, apply interactive episode performance-setting policy.

Start of interactive episode

• Update interactive episode statistics.• Switch to base performance level, if there is periodic

activity on the machine.

End of interactive episode

Page 36: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 36

Advantages

• Automatic.• Impact on response time is quantifiable.

– Performance can be adapted to the user’s preference.• Works well in the presence of multiprogramming.• Irregular processor utilization is not a problem.• Implementation requires very little state.

– Weighted average: two counters.• Rescale to adapt to temporal variations.

Existing interval-based schemes:• No feedback about service quality.• Only work well if processor utilization is regular.

Page 37: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 37

Performance-setting during the Acrobat Reader benchmark (200ms p.t.)

0

0.2

0.4

0.6

0.8

1

0 2 4 6 8 10 12 14 16 18104 124

Time (sec)

Perfo

rman

ce fa

ctor

Transitions to maximum performance level are due to reaching the PanicThreshold

Page 38: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 38

Performance-setting during the Acrobat Reader + MP3 benchmark (200ms p.t.)

0

0.2

0.4

0.6

0.8

1

0 5 10 15 20

Time (sec)

Perfo

rman

ce fa

ctor

Transitions due to PanicThreshold

Full performance for periodic activity.

Page 39: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 39

Hardware assumptions

1msVoltage transition time

0.02msPLL resynch time (stalls execution)

1000Mhz @ 1.75VMaximum performance

150Mhz @ 0.75VMinimum performance

Assumptions based on Intel Xscale.

We assume that processor switches to sleep mode when it is not executing an episode.

Page 40: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 40

Energy factors (no MP3)

0%

20%

40%

60%

80%

100%

50ms 100ms 150ms 200ms 250ms 300ms

Perception threshold

Ener

gy fa

ctor

Acroread FrameMakerGhostview GIMPNetscape Xemacs

Page 41: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 41

Energy factors with MP3 playback

0%

20%

40%

60%

80%

100%

50ms 100ms 150ms 200ms 250ms 300ms

Perception threshold

Ener

gy fa

ctor

Acroread FrameMakerGhostview GIMPNetscape Xemacs

Page 42: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 42

Changes in cumulative episode lengths as the result of performance scaling (Xemacs 50ms p.t. )

0%

10%

20%

30%

40%

50%

60%

70%

80%

90%

100%

1e-05 0.0001 0.001 0.01 0.1 1

50ms10ms

Episode length (sec)

Bef

ore

perf

orm

ance

sca

ling After perform

ance scaling

Cum

ulat

ive

perc

enta

ge o

f tim

e

Page 43: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 43

Desired improvements

• Processor parameters are good enough.– Faster voltage transitions would help a little.– As peak performance gets higher, lower minimum

performance is desirable.

• More sophisticated prediction algorithms.– Distinguish between episode instances, not just

episode types.

Page 44: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 44

Conclusions• Multiprocessing can significantly improve response times.

– Measured 15%-38% improvement (out of possible 50%)!

• Many interactive episodes are already fast enough.– More will be fast enough in the near future.– Use Dynamic Voltage Scaling to save energy.

• Episode classification based on inter-task communication.– Fast, accurate, no user program modifications required.

• Performance-setting based on episode classification.– Works well with multiprogramming, irregular processor utilization.– Ensures high quality interactive performance.– Significant energy savings (10%-80%).

Page 45: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 45

Future work

• Evaluate our algorithms on real hardware.– Processors are slowly becoming available.– Impact on interactive performance.

• An API to specify episodes.– Light-weight: specify hints, not complete information.– Works in concert with existing detection mechanism.

• Apply episode detection to other problems.– Scheduler: can real-time deadlines be detected

automatically?

Page 46: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 46

fin.fin.fin.fin.

Page 47: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 47

The performance gap

1

10

100

1000

10000

100000

0 1.5 3 4.5 6 7.5 9Time (years)

Perf

orm

ance

Available performancestarts accommodatingrequirements (A).

Desired performance

Available P erformance

All performancerequirements are met (B).

Slowes t availableperformance exceedsminimum requirements (C).

Available performanceis higher than required (D).

Page 48: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 48

Applicability to other environments

Technique exploits information from existing design patterns.

On Linux with X windows:• Communication through sockets, pipes, signals.• Well-known tasks: X server, sound daemon, etc.• Select syscall used for asynchonous I/O.• Use of blocking system calls in dedicated threads.

Other systems:• Adapt to that system’s design patterns and IPC mechanisms.

Page 49: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 49

Computing the performance factor for interactive episodes

1

PFmin

perceptionthreshold

minimum-performancethreshold

PF1 = PFminfull-speed episode duration

perception thresholdPF2 = PF3 = 1

Perform

ance

Full-speed

fact

or

episode duration (sec)

Page 50: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 50

Performance scaling

Deadline

Per

form

ance

Per

form

ance

A

BTime

Page 51: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 51

Energy-delay (no MP3)

Increase of perceptible interactive episode lengths

Ener

gy fa

ctor

Page 52: Automatic Monitoring for Interactive Performance and Power ...web.eecs.umich.edu/~manowar/oldweb/publications/... · Krisztián Flautner - manowar@engin.umich.edu Automatic Monitoring

Krisztián Flautner - [email protected] Monitoring for Interactive performance and Power Reduction 52

Energy-delay (MP3)

Increase of perceptible interactive episode lengths

Ener

gy fa

ctor


Recommended