+ All Categories
Home > Documents > UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B [email protected] / [email protected] 11 Power...

UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B [email protected] / [email protected] 11 Power...

Date post: 31-Jul-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
41
UCLA EE215B [email protected] / [email protected] 1 Energy-Recovery CMOS Design Jay Moon, Bill Athas * Univ of Southern California * Apple Computer, Inc. [email protected] / [email protected] March 05, 2001
Transcript
Page 1: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 1

Energy-Recovery CMOS Design

Jay Moon, Bill Athas*

Univ of Southern California*Apple Computer, Inc.

[email protected] / [email protected]

March 05, 2001

Page 2: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 2

Outline• Motivation

• Review of CMOS switching energetics

• Adiabatic charging

• Energy-Recovery CMOS

• Stepwise charging

• Clock-powered logic (CPL)

• Harmonic resonant charging

• Future Research

Page 3: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 3

Motivation

• It’s becoming increasingly difficult to get rid of the heat generated by VLSI chips

• Battery life for portables

high-performance&

low-power computing

Page 4: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 4

Types of power dissipation

• Dynamic power dissipation– Charging and discharging capacitances

– Short-circuit current

• Static power dissipation– Sub-threshold currents

– Drain-junction leakage

Page 5: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 5

Capacitor energy equations• Suppose at time t, a charge q is transferred from one

plate to the other• The potential v is q/C• For a charge transfer increment of dq, the additional

work is :

• For the total charge transfer Q :

dqCqvdqdE ==

2

2

0

21

21

CVE

CVQCQdq

CqdEE

Q

=

=

=== ∫

Page 6: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 6

CMOS switching energetics• Interestingly (and thankfully) CMOS energetics can be

analyzed and understood from the CMOS inverter.• Charge is conserved• Energy is conserved• Neglect leakage current• Neglect short-circuit current

PSV

0 0

V

C

EPS=VQ=CV 2

Page 7: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 7

The charging event

• Power supply delivers a charge packet of size Q=CVEPS = CV • V = CV2

EC = (1/2)CV2

EPS – EC = (1/2)CV2 = EHEAT• This much energy is dissipated in the pFET

PSV

0 0

V

C

EPS=VQ=CV 2

EHEAT=(1/2)CV2

Page 8: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 8

The discharging event

• Power supply gets the charge at potential 0EPS = 0

• The energy on the capacitor goes from (1/2)CV2 to 0EC – 0 = (1/2)CV2 = EHEAT

• This much energy is dissipated in the nFET• All of the charge is returned to the PS at potential 0

PS

C0

V V

0EHEAT

EPS=0Q=0

Page 9: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 9

Complex gates and pass logic

• Circuit topology does not change energetics• It’s about the potential of the charge• Not where the charge goes

PS

C

0

V V

0

Page 10: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 10

Power supply perspectives• Inject charge at the highest allowed voltage

VDD• Recover returned charge at the lowest allowed voltage

0• Simple scheme of shorting capacitors to VDD or ground

through switches• Maximally wasteful from an energy conservation

standpoint

Page 11: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 11

Power equation• (1/2)CV2 is dissipated to charge the capacitor

• (1/2)CV2 is dissipated to discharge the capacitor

• CV2 is dissipated per charge/discharge cycle

• If we cycle the capacitor F times per second :

P = F • CV2

• Power is the rate at which work is done

• Note that if you need to cycle a capacitor N times from a battery, doesn’t matter if you do it fast or slow.

The battery is just as dead either way

Page 12: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 12

Voltage scaling• Energy decreases quadratically with the voltage

E ~ VDD2

• Delay increases as the voltage reduces

τ ~ VDD/(VDD-VTH)2

τ3.3V / τ2.0V = 0.3E3.3V / E2.0V = 2.7

(assuming Vth = 1V)

Page 13: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 13

Voltage scaling effects

• PowerMillTM simulations of a 16-bit uProcessor

Page 14: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 14

Energy vs. Cycle time

Page 15: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 15

Adiabatic charging• Charging from a variable-voltage source

(e.g. linear ramp)

• Assuming that R is the on-resistance of the switch, the dissipation for charging or discharging C is:

E = (RC/T)•CV2 when T >> RC• Energy can be traded for delay by increasing the charge

transport time• Model the FETs as simple resistors (Rup and Rdn)

T

C

R

0

V

Page 16: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 16

Adiabatic-charging principle

C

Rup

RdnT

ξ(RC/T)CV2

ξ(RC/T)CV2

VDD

Rup

Rdn

CC

Conventional digital CMOS Adiabatic charging

Ecycle = CV2 Ecycle = 2ξ(RC/T)CV2

Page 17: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 17

Energy-Recovery CMOS

• Exploit the on-chip capacitances of CMOS VLSI to reduce power dissipation below the conventional limit (FCV2) using adiabatic charging and energy-recovery

• This research includes:– Clock-energy recovery techniques– Clock-powered logic – balanced power versus speed– Stepwise charging (charging recycling) technique for

• Low-power VLSI pin drivers• LCD panels

– Harmonic resonant charging technique for• Clock signal for conventional chip

clock-poweredchip

energy sourceenergy-efficient

clock driver

Page 18: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 18

Stepwise charging

• The load C is switched from 0 to V and vice-versa through N steps• CT should be roughly 10 times larger than C• Only one supply voltage is required• Intermediate step voltages converge after a few cycles• Dissipation for charging or discharging C is: E = (1/2)(CV2)/N• The overhead for controlling the FETs needs to be considered

CT

CT

C

V

(N-1)V/N

V/N

0

V

charging steps

Page 19: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 19

2-Stepwise Driver

in d_in

t

p

n

in

d_in

t p

nCT CL

V/2

Page 20: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 20

2-Stepwise Driver

• Event 1 : 1/2C(V/2)2 stored, 1/2C(V/2)2 dissipated

• Event 2 : 1/2C(V/2)2 added, 1/2C(V/2)2 dissipated

• Event 3 : 1/2C(V/2)2 recovered, 1/2C(V/2)2 dissipated

• Event 4 : 1/2C(V/2)2 dissipated

• Total dissipation : 1/2C(V/2)2 * 4 = 1/2CV2

t

p

n

in

d_in

t p

nCT CL

V/2(1)

(2)

(3)(4)

Page 21: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 21

Clock-powered logic• Exploits adiabatic charging to reduce dissipation

• Uses clocks as global time-varying voltage sources

• The challenge is to use the clock to drive data nodes

clock line0 1

0

Page 22: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 22

Clock-Powered logic design• Need an efficient clock driver

• Innovate in the design of clock-steering logic

• Use conventional precharged, pass-transistor, static logic

• Use the clock-steering logic for high-capacitance nodes

Page 23: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 23

Resonant clock driver

• Build-up energy in inductor

• Transfer it to the load as a pulse

• Recover the pulsed energy in the inductor

• Repeat the process

Vdcoff-chipinductor

on-chipcapacitive load

powerpulse

Page 24: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 24

The all-resonant clock drivera.k.a blip driver

• Self-oscillating driver generates almost non-overlapping clock pulses

• Highly efficient because of all-resonant gate drive

• Trade-off between frequency stability and power efficiency

VdcL L

Cϕ Cϕ

ϕ1 ϕ2

Page 25: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 25

Clocked buffers

• Clock-pass transistor is critical for speed and power performance

• Bootstrapping yields high conductance per gate capacitance

• Clock voltage swing can be decoupled from the logic voltage swing.

– “Hot clocks” : clock swings above supply

ϕ1

ϕ2

Viso

Din

Vbn

clock-passtransistor

pull-down clamptransistor for

noise immunity

isolationtransistor

gate to channel capacitanceused for bootstrapping

ϕ1 ϕ2

Vbn

Page 26: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 26

Clocked buffers

ϕ1

ϕ2

Viso

1 clock-passtransistor

Vbn

1 0 1 1

0

0

ϕ1

ϕ2

Viso

1 clock-passtransistor

Vbn

1 0 10 A

0 A

1 1+A

Page 27: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 27

Clock-powered logic• Eliminate pFETs and complements of clocks (smaller circuits, simpler

clock requirements)– Precharge transistors are hot-clocked nFETs

– Pass gates in latches are hot-clocked nFETs

• Move more capacitive loads to the clock-powered paths– Pass-transistor logic (e.g. in muxes) powered by clocks (not shown)

ER latch ER latch

ϕ1

ϕ2

Viso Viso

Cp

prechargedlogic block

Page 28: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 28

The AC-1 processor experiment• Objectives

– Design and implement low-power processor based on clock-powered logic and blip driver

– Evaluate significance of blip driver for low-power operation– Compare clock-powered processor to conventional, static CMOS

alternative

• Approach– Select 16-bit ISA– Design five-stage pipelined microarchitecture– Use energy-recovery latches to inject and retract energy at large

capacitive loads– Design logic and latches using “mostly-nMOS” circuit styles– Include both conventional and blip drivers (for evaluation purposes)– Desing a implementation of the same ISA using purely conventional

static-CMOS techniques

Page 29: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 29

AC-1 microarchitecture

• RISC ISA (Bunda’93)• 16-bit data• 16-bit instructions• 16 registers• Conventional 5-stage pipeline• Integer operations only (no multiply or divide)

PLA

+

+1

RD0

RD1

RF ALU

WRL

EG

G

control

fromIR

fromIR

3

0

fromIR01

fromIR

fromI_B

to PC_B

C

AB

C

AB

AB

A

B F

H

to A_B

to D_B

from D_B

ϕ2

ϕ1 ϕ2

ϕ1 ϕ2 ϕ1 ϕ2 ϕ1

Page 30: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 30

AC-1 processor

• Clock-powered logic

• Resonant clock driver

• 16-bit data & instructions

• 16 registers

• 0.5um n-well CMOS

• 5-stage pipeline

• ~13K transistors

Page 31: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 31

AC-1c : a conventional processor• Same target process

• Cascade library cells

• 30k transistors

• 5.5um2

• Uses gated clocks to reduce power dissipation

• Important differences– Custom vs library cells

– Optimizations

– Clock gating in AC-1c (40%)

Page 32: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 32

Processor core summary• AC-1

– First generation clock-powered processor– Mostly nMOS logic style– Hot clocks– Custom layout

• AC-1c– First generation conventional processor– Static CMOS– Cascade Epoch standard-cell library

• ACPL– Second-generation clock-powered processor– Static CMOS– Low-swing clocks– Custom low-power fixed-cell library– Cascade Epoch for place and route

• DC-1– Second-generation conventional processor– Static CMOS– Single-phase clocking– Custom low-power fixed-cell library– Cascade Epoch for place and route

Page 33: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 33

Processor comparison

0

0.2

0.4

0.6

0.8

1

1.2

1.4

0 20 40 60 80 100 120 140 160

ACPL, 6.5x energy recovery

DC-1

AC-1/c

AC-1, 6.5x energy recovery

AC-1, no energy recovery

ACPL, no energy recovery

mW

/MH

z

Frequency (MHz)

Page 34: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 34

Resonant clock drivers

• The difficulty with clock-powered logic is in the clock driver

• Resonant circuits offer the highest efficiency

• Low-power techniques that minimize the switched capacitance in real time do not work well with resonant clock drivers

– The clocks will vary in phase, amplitude, and pulse width

• Stabilizing the clock load maximizes the capacitive load

• It’s an open research topic

controller

Csmall

Cbig

?

resonant clock driver clock-powered chip

Page 35: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 35

Harmonic resonant charging

– Sinusoids• Easy and efficient to generate• Low overhead• Hard to work with, very “undigital”

– Staircase• Simple to generate and control• High overhead• Positive-going only

– Blips• Advantages of the sinusoids• Can be complementary• Positive-going only

– Harmonic resonant driver• We thought this would be hard (practically)• Now think it is highly doable

Page 36: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 36

Harmonic resonator design

Page 37: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 37

Harmonic resonator results

• 2nd Harmonic Resonator– 85% Energy efficiency

– 10% slew rate of total cycle time

• 4th Harmonic Resonator– 80% Energy efficiency

– 6% slew rate of total cycle time

Page 38: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 38

Harmonic resonator result

• As R becomes smaller, slew rate decreases while power increases

Page 39: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 39

Harmonic resonator result

• Frequency of output signal doesn’t change for 30% variation of load capacitance while energy efficiency suffers

Page 40: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 40

Future research

• Clock-powered logic and blip driver has been developed as a practical way of exploiting adiabatic charging for CMOS microprocessor

• How about Digital signal processor?– Where power goes in DSP?

• Bus transaction vs. computation

• Energy-recovery SRAM, DRAM, SAM– Capacitance variance is minimal because bitlines are

dual

• Driving clock network using harmonic resonator

Page 41: UCLA Lecture Energy-Recovery CMOS Design · UCLA EE215B jsmoon@usc.edu / athas@apple.com 11 Power equation • (1/2)CV2 is dissipated to charge the capacitor • (1/2)CV2 is dissipated

UCLA EE215B [email protected] / [email protected] 41

References• ACMOS Homepage (still alive)

– http://www.isi.edu/acmos

• For online paper archive

– http://www.isi.edu/acmos/acmosPapers.html

• Books

– Rabaey, Pedram Ed. “Low Power Design Methodology”

– Chandrakasan, Brodersen Ed. “Low Power CMOS Design”

• Most recent paper is published in

– JSSC, Nov. 2000 pp1561-1570


Recommended