Victor Moroz
July 12, 2016
Semicon West 2016
Design-Technology Co-Optimization
for 5nm Node and Beyond
© 2016 Synopsys, Inc. 2
Why Scaling?
When What scales? When does it end?
1965Moore’s Law (Fairchild):
Double transistor density every couple of years
• By 2043, there will be 1 atom per transistor
• But you can go up (3D IC)
• Great for planning and aligning the industry
1999
Claasen’s Law (Philips CEO):
Usefulness = log(Technology), or:
Technology = exp(Usefulness)
Forever?
2010
Koomey’s Law (Stanford Professor):
"at a fixed computing load, the amount of battery you need will fall by a factor of two every year and a half.“
• By the second law of thermodynamics and Landauer's principle, irreversible computing cannot continue to be made more energy efficient forever. As of 2011, computers have a computing efficiency of about 0.00001%. The Landauer bound will be reached in 2048. Thus, after 2048, the law could no longer hold.
• With reversible computing, however, Landauer's principle is not applicable. With reversible computing, though, computational efficiency is still bounded by the Margolus–Levitin theorem. By the theorem, Koomey's law has the potential to be valid for about 125 years.
© 2016 Synopsys, Inc. 3
Sca
ling
Tra
nsisto
r st
reng
th
Capa
cita
nce
© 2016 Synopsys, Inc. 4
Existing Early Design Rule Evaluation
GDS: Maxwell
or Laker
DRDR
1
DR
2
Fin
pitch
24
nm
22
nm
MG
ext.
15
nm
15
nm
Spac
er
7
nm
6
nm
Litho:
Sentaurus
Design rule
bad
bad
good
© 2016 Synopsys, Inc. 5
Existing Early Design Rule Evaluation
GDS: Maxwell
or Laker
DRDR
1
DR
2
Fin
pitch
24
nm
22
nm
MG
ext.
15
nm
15
nm
Spac
er
7
nm
6
nm
Litho:
Sentaurus
Design rule
bad
bad
good
• Missing process proximity effects outside of litho
• Missing process interaction with design
• Gives design window, but no guidance within the window
© 2016 Synopsys, Inc. 6
Proposed DTCO: Pre-Si PowerPerformanceArea Evaluation
GDS: Maxwell
or Laker
Process T Time Etch
Process1 480 C 25 min 12 nm
Process2 475 C 23 min 13 nm
DRDR
1
DR
2
Fin
pitch
24
nm
22
nm
MG
ext.
15
nm
15
nm
Spac
er
7
nm
6
nm
Litho:
Sentaurus
3D structure:
Process
Explorer
Switching
behavior:
TCAD
Design rule
bad
bad
good
Design ruleP
rocess c
onditio
nDesign rule
Pro
cess c
onditio
n
© 2016 Synopsys, Inc. 7
Proposed DTCO: Pre-Si PowerPerformanceArea Evaluation
GDS: Maxwell
or Laker
Process T Time Etch
Process1 480 C 25 min 12 nm
Process2 475 C 23 min 13 nm
DRDR
1
DR
2
Fin
pitch
24
nm
22
nm
MG
ext.
15
nm
15
nm
Spac
er
7
nm
6
nm
Litho:
Sentaurus
3D structure:
Process
Explorer
Switching
behavior:
TCAD
Design rule
bad
bad
good
Design ruleP
rocess c
onditio
nDesign rule
Pro
cess c
onditio
n
© 2016 Synopsys, Inc. 8
Proposed DTCO: Pre-Si PowerPerformanceArea Evaluation
GDS: Maxwell
or Laker
Process T Time Etch
Process1 480 C 25 min 12 nm
Process2 475 C 23 min 13 nm
DRDR
1
DR
2
Fin
pitch
24
nm
22
nm
MG
ext.
15
nm
15
nm
Spac
er
7
nm
6
nm
Litho:
Sentaurus
3D structure:
Process
Explorer
Switching
behavior:
TCAD
Design rule
bad
bad
good
Design ruleP
rocess c
onditio
nDesign rule
Pro
cess c
onditio
n
• Provides quick PPA estimate
• Includes process effects
• Enables process-design feedback
• Reasonable TAT
© 2016 Synopsys, Inc. 9
2-Input NAND Standard Library Cell
• 2-input NAND library cell with a load of:
– Fan-out of 2
– Metal wire that is 70 metal pitches long
• Cload = 2*Cpin + Cwire
• Cwire = 0.34 fF
• Typical Cload is 1 fF to 2 fF
A
BQ
Cload
© 2016 Synopsys, Inc. 10
Layout: 5nm 2-NAND Cell, 9 Tracks Tall
fins
Dummy
gate
PMOS
NMOS
gates
Gate
contact
S/D
contact
M1
Via1
M2 (PWR)
Via2
3 Gate Pitches wide
9 M
eta
l P
itch
es t
all
M2 (GND)
GP = 32nmMP = 24nmFP = 18nm
© 2016 Synopsys, Inc. 11
3D Library Cell in Process Explorer
M2
M1
M0
Transistors
© 2016 Synopsys, Inc. 12
Power-Performance-Area Evaluation in TCAD
• Transient analysis of the switching behavior in Sentaurus-Device
• Time delay is the averaged pull-up and pull-down delays
• Rigorous current flow analysis in the 3D structure
-0.1
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0 2E-11 4E-11 6E-11 8E-11 1E-10
Po
ten
tia
l, V
Time, s
Input
Low_ROutput
3D current crowding
© 2016 Synopsys, Inc. 13
0
0.5
1
1.5
2
0 5 10 15 20 25
En
erg
y p
er
sw
itc
h, f
J
Switching delay, ps
FF 2x
FF 2x, low MG R
FF 1x
NW 2x2
NW 2x1
NW 1x2
5nm Technology Evaluation: PowerPerformanceArea @TCAD
Load: Fan-out of 2 plus 70 pitches long BEOL wire
3D current flow in TCAD
2-NAND logic cell
Reference 2-fin FF cell
© 2016 Synopsys, Inc. 14
0
0.5
1
1.5
2
0 5 10 15 20 25
En
erg
y p
er
sw
itc
h, f
J
Switching delay, ps
FF 2x
FF 2x, low MG R
FF 1x
NW 2x2
NW 2x1
NW 1x2
5nm Technology Evaluation: PowerPerformanceArea @TCAD
Load: Fan-out of 2 plus 70 pitches long BEOL wire
3D current flow in TCAD
2-NAND logic cell
10%
Low MG resistance:10% power reduction
© 2016 Synopsys, Inc. 15
Insight Into Metal Gate Resistance Effect
• Due to metal resistance, the input signal takes time to get to the fins
• Different parts of the gate experience different biases at any given time
• This is a new effect, due to the lack of space inside MG for tungsten fill, so MG resistivity increases from ~20 mW.cm to ~200 mW.cm
• It gets worse for 3 and 4 fins
2D cut across the gate
PMOS Gate
NMOS Gate
NMOS fins
PMOS fins
2-NAND cell
Electrostatic potential map
Input signal arrives here first
delay
eve
n mor
e d
elay
© 2016 Synopsys, Inc. 16
0
0.5
1
1.5
2
0 5 10 15 20 25
En
erg
y p
er
sw
itc
h, f
J
Switching delay, ps
FF 2x
FF 2x, low MG R
FF 1x
NW 2x2
NW 2x1
NW 1x2
5nm Technology Evaluation: PowerPerformanceArea @TCAD
Load: Fan-out of 2 plus 70 pitches long BEOL wire
3D current flow in TCAD
2-NAND logic cell
10%
30%
Fin depopulation from 2 to 1:30% power reduction
© 2016 Synopsys, Inc. 17
0
0.5
1
1.5
2
0 5 10 15 20 25
En
erg
y p
er
sw
itc
h, f
J
Switching delay, ps
FF 2x
FF 2x, low MG R
FF 1x
NW 2x2
NW 2x1
NW 1x2
5nm Technology Evaluation: PowerPerformanceArea @TCAD
Load: Fan-out of 2 plus 70 pitches long BEOL wire
3D current flow in TCAD
2-NAND logic cell
10%
30%
44%
Nano-wires: 44% better
© 2016 Synopsys, Inc. 18
0
0.5
1
1.5
2
0 5 10 15 20 25
En
erg
y p
er
sw
itc
h, f
J
Switching delay, ps
FF 2x
FF 2x, low MG R
FF 1x
NW 2x2
NW 2x1
NW 1x2
5nm Technology Evaluation: PowerPerformanceArea @TCAD
Load: Fan-out of 2 plus 70 pitches long BEOL wire
3D current flow in TCAD
2-NAND logic cell
10%
30%
44%
50%
Nano-wire depopulation: 50% power reduction
© 2016 Synopsys, Inc. 19
0
0.5
1
1.5
2
0 5 10 15 20 25
En
erg
y p
er
sw
itc
h, f
J
Switching delay, ps
FF 2x
FF 2x, low MG R
FF 1x
NW 2x2
NW 2x1
NW 1x2
5nm Technology Evaluation: PowerPerformanceArea @TCAD
Load: Fan-out of 2 plus 70 pitches long BEOL wire
3D current flow in TCAD
2-NAND logic cell
10%
30%
44%
50%
FinFET
NW
2 fins
1 fin
© 2016 Synopsys, Inc. 20
0
0.5
1
1.5
2
0 5 10 15 20 25
En
erg
y p
er
sw
itc
h, f
J
Switching delay, ps
FF 2x
FF 2x, low MG R
FF 1x
NW 2x2
NW 2x1
NW 1x2
5nm Technology Evaluation: PowerPerformanceArea @TCAD
Load: Fan-out of 2 plus 70 pitches long BEOL wire
3D current flow in TCAD
2-NAND logic cell
10%
30%
44%
50%
FinFET
NW
2 fins
1 fin
0.000
0.200
0.400
0.600
0.800
1.000
1.200
FF 2xFF 1x
NW 2x2NW 2x1
NW 1x2
0.7
96
0.4
95
0.5
41
0.3
68
0.3
64
1
0.5
1.09
0.54 0.54
Cpin, fF
Ion, normalized
© 2016 Synopsys, Inc. 21
0
0.5
1
1.5
2
0 5 10 15 20 25
En
erg
y p
er
sw
itc
h, f
J
Switching delay, ps
FF 2x
FF 2x, low MG R
FF 1x
NW 2x2
NW 2x1
NW 1x2
5nm Technology Evaluation: PowerPerformanceArea @TCAD
Load: Fan-out of 2 plus 70 pitches long BEOL wire
3D current flow in TCAD
2-NAND logic cell
10%
30%
44%
50%
FinFET
NW
2 fins
1 fin
0.000
0.200
0.400
0.600
0.800
1.000
1.200
FF 2xFF 1x
NW 2x2NW 2x1
NW 1x2
0.7
96
0.4
95
0.5
41
0.3
68
0.3
64
1
0.5
1.09
0.54 0.54
Cpin, fF
Ion, normalized50%
40%
© 2016 Synopsys, Inc. 22
0
0.5
1
1.5
2
0 5 10 15 20 25
En
erg
y p
er
sw
itc
h, f
J
Switching delay, ps
FF 2x
FF 2x, low MG R
FF 1x
NW 2x2
NW 2x1
NW 1x2
5nm Technology Evaluation: PowerPerformanceArea @TCAD
Load: Fan-out of 2 plus 70 pitches long BEOL wire
3D current flow in TCAD
2-NAND logic cell
10%
30%
44%
50%
FinFET
NW
2 fins
1 fin
0.000
0.200
0.400
0.600
0.800
1.000
1.200
FF 2xFF 1x
NW 2x2NW 2x1
NW 1x2
0.7
96
0.4
95
0.5
41
0.3
68
0.3
64
1
0.5
1.09
0.54 0.54
Cpin, fF
Ion, normalized
~CV/I
~CV
2
50%
40%
© 2016 Synopsys, Inc. 23
0
0.5
1
1.5
2
0 5 10 15 20 25
En
erg
y p
er
sw
itc
h, f
J
Switching delay, ps
FF 2x
FF 2x, low MG R
FF 1x
NW 2x2
NW 2x1
NW 1x2
5nm Technology Evaluation: PowerPerformanceArea @TCAD
Load: Fan-out of 2 plus 70 pitches long BEOL wire
3D current flow in TCAD
2-NAND logic cell
10%
30%
44%
50%
FinFET
NW
2 fins
1 fin
0.000
0.200
0.400
0.600
0.800
1.000
1.200
FF 2xFF 1x
NW 2x2NW 2x1
NW 1x2
0.7
96
0.4
95
0.5
41
0.3
68
0.3
64
1
0.5
1.09
0.54 0.54
Cpin, fF
Ion, normalized
~CV/I
~CV
2
•MOL capacitance engineering rules!
© 2016 Synopsys, Inc. 24
Why Variability is Important
#
Ion
• Technology A
Design spec:
Nominal – 3s
Nominal
Performance A• What matters is
nominal – 3s
• Therefore variability affects chip area
© 2016 Synopsys, Inc. 25
Why Variability is Important
#
Ion
• Technology A
• Technology B
NominalDesign spec:
Nominal – 3s
Nominal
Performance B
Performance A• What matters is
nominal – 3s
• Therefore variability affects chip area
• There is no “good enough” variability –the target is zero!
© 2016 Synopsys, Inc. 26
Fin Depopulation Adds Pressure to Variability Scaling
14nm
10nm
7nm
5nm
sVt ~ 1 / sqrt(# of fins)
© 2016 Synopsys, Inc. 27
Fin Depopulation Adds Pressure to Variability Scaling
14nm
10nm
7nm
5nm
sVt ~ 1 / sqrt(# of fins)
• Other considerations:
• Electromigration
• Power density
• Fin pitch
© 2016 Synopsys, Inc. 28
Variability Evolution: Planar to FinFET
0
10
20
30
40
50
60
70
Sig
ma V
t, m
V
RDF p
RDF n
n-poly
HKMG
L CD&LER
W CD&LER
fin height
sigma pVt
sigma nVt
• Encouraging trend
• Several “reset buttons”
• There is nothing that can be done to eliminate RDF, so it kept getting worse for planar
• The FinFETs are more sensitive to geometry, which can be better controlled by the equipment
V. Moroz, WMED 2013
© 2016 Synopsys, Inc. 29
Variability Evolution: Planar to FinFET
0
10
20
30
40
50
60
70
Sig
ma V
t, m
V
RDF p
RDF n
n-poly
HKMG
L CD&LER
W CD&LER
fin height
sigma pVt
sigma nVt
V. Moroz, WMED 2013
Measured data from S. Natarajan et al., IEDM 2014
The lower Sigma Vt values here are due to low Vt process
© 2016 Synopsys, Inc. 30
Planar to FinFET Transition
• FinFETs improve variability
• Planar MOSFETs suffered from RDF
• FinFETs are insensitive to channel doping RDF
0
10
20
30
40
50
60
110100
Sig
ma V
t, m
V
Technology node, nm
Variability Evolution
Total
Planar FinFET NW
130
22
147
1065
© 2016 Synopsys, Inc. 31
HKMG Grains Introduce Gate Workfunction Variation
• At 10nm and 7nm nodes, HKMG becomes the dominant variability mechanism
• Introduction of amorphous MG at 7nm would solve this issue
0
10
20
30
40
50
60
110100
Sig
ma V
t, m
V
Technology node, nm
Variability EvolutionTotal
HKMG
7
10
© 2016 Synopsys, Inc. 32
Geometry
• Planar MOSFETs are insensitive to geometry
• FinFETs and NW are more sensitive to geometry
• NW are less sensitive to L than FinFET, but more sensitive to W
• This data is based on 1 geometry sigma staying at 5% of CD0
10
20
30
40
50
60
110100
Sig
ma V
t, m
V
Technology node, nm
Variability Evolution Total
HKMG
Geometry
53
2
© 2016 Synopsys, Inc. 33
RDF (Random Dopant Fluctuations)
• Planar MOSFETs suffered from RDF
• FinFETs are insensitive to channel doping RDF
0
10
20
30
40
50
60
110100
Sig
ma V
t, m
V
Technology node, nm
Variability Evolution Total
HKMG
RDF
Geometry
© 2016 Synopsys, Inc. 34
FinFET to Nanowire Transition: Counting Particles
• NW variability depends on how many S/D dopants get into the channel
0
10
20
30
40
50
60
110100
Sig
ma V
t, m
V
Technology node, nm
Variability EvolutionTotal
HKMG
RDF
Geometry
KMC
0 dopants
1 dopant
2
3
S D
© 2016 Synopsys, Inc. 35
Summary
• 5nm technology has multiple trade-offs in transistor architecture and MOL RC that require holistic engineering
• Ideal variability is zero, and fin depopulation adds even more pressure
• Several key factors suggest fin depopulation towards 1 fin and beyond (i.e. fractional fins a.k.a. nano-wires):
– PPA
–MOL RC
–Electrostatics (DIBL)
–Electromigration and power density