RESISTIVE SWITCHING MEMORY FOR NON-VOLATILE … › file › druid:xh257vm1382...Conventional...

iii

RESISTIVE SWITCHING MEMORY FOR NON-VOLATILE STORAGE AND

NEUROMORPHIC COMPUTING

A DISSERTATION

SUBMITTED TO THE DEPARTMENT OF ELECTRICAL ENGINEERING

AND THE COMMITTEE ON GRADUATE STUDIES

OF STANFORD UNIVERSITY

IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF

DOCTOR OF PHILOSOPHY

Shimeng Yu

July, 2013

http://creativecommons.org/licenses/by-nc/3.0/us/

This dissertation is online at: http://purl.stanford.edu/xh257vm1382

© 2013 by Shimeng Yu. All Rights Reserved.

Re-distributed by Stanford University under license with the author.

This work is licensed under a Creative Commons Attribution-Noncommercial 3.0 United States License.

ii



http://purl.stanford.edu/xh257vm1382

I certify that I have read this dissertation and that, in my opinion, it is fully adequatein scope and quality as a dissertation for the degree of Doctor of Philosophy.

Philip Wong, Primary Adviser


Yoshio Nishi


S Simon Wong

Approved for the Stanford University Committee on Graduate Studies.

Patricia J. Gumport, Vice Provost Graduate Education

This signature page was generated electronically upon submission of this dissertation in electronic format. An original signed hard copy of the signature page is on file inUniversity Archives.

iii

iv

Abstract

As CMOS scaling will soon approach its physical limit, it is necessary to consider new

computing paradigms that continue to improve computing system performance. We are

entering the “Big Data” era which puts data sets remotely at the data center for

computing, and the size of data sets is rapidly exceeding exascale. The functionality and

performance of today’s computing system are increasingly dependent on the

characteristics of the memory sub-system. Conventional memories technologies such as

SRAM, DRAM, and FLASH are facing formidable device scaling challenges. Emerging

memory technologies may bring enormous opportunities for architecture evolution or

revolution of the computing system in the future.

Among various emerging memory technologies, oxide based resistive switching

memory (RRAM) stands out due to its simple structure, low switching voltage (<3 V),

fast switching speed (<10 ns), excellent scalability (<10 nm), and great compatibility

with the CMOS technology. The observation that oxides can switch between an

insulating state and a conductive state can be dated back to 1960s. Research activities

have revived since 2004 with more complete data sets of the memory characteristics

reported such as device reliability and array integration. Through years’ efforts from both

academia and industry, Gb-scale prototype RRAM chip has been demonstrated so far.

During the development of the RRAM technology, key challenges include unclear

physics of resistive switching in oxides, relatively poor uniformity and large variability of

v

the switching parameters. And at this moment today, the central question should be what

application spaces are for the RRAM technology to play a disruptive role. The works in

this dissertation aim to address those issues above.

The physical mechanism of resistive switching is generally attributed to the

conductive filament (made up of oxygen vacancies) formation and rupture in the oxide

due to field assisted oxygen ion migration. As a model system for device physics study,

HfOx based RRAM devices were fabricated and characterized. To identify the conduction

mechanism, various electrical characterization techniques such as I-V measurements at

various temperatures, low-frequency noise measurements, and AC conductance

measurements were employed. It was suggested that the trap-assisted-tunneling is the

dominant conduction mechanism. In order to explore the oxygen ion migration dynamics,

pulse switching measurements were performed. An exponential voltage-time relationship

was found between the switching time and the applied voltage.

To obtain a first-order understanding of the variability of resistive switching, a Kinetic

Monte Carlo (KMC) numerical simulator was developed. The

generation/recombination/migration probabilities of oxygen vacancies and oxygen ions

were calculated, and the conductive filament configuration was updated stochastically

according to those probabilities. The KMC simulation can reproduce many experimental

observations in the DC I-V sweep, pulse switching, endurance cycling, and retention

baking, etc. The tail bits in the resistance distribution are attributed to the oxygen vacancy

vi

left over in the gap region due to a competition between the oxygen vacancy generation

and recombination. To enable circuit and system development using RRAM, a compact

device model was developed, which can be employed in many commonly available

circuit simulators using the SPICE engine.

One promising application for the RRAM technology is to serve as the synaptic device

for the hardware implementation of neuromorphic computing. The gradual resistance

modulation capability in RRAM was utilized for emulating analog synapses, and the

stochastic switching behavior in RRAM was utilized for emulating binary synapses. In

order to evaluate the effectiveness of analog synapses and binary synapses, a simulation

of winner-take-all network was performed based on the parameters extracted from the

experiments. The simulation suggests that the orientation classification can be effectively

realized using both analog synapses and binary synapses. The system functionality is

found to be robust against the RRAM device variability due to the adaptive algorithm and

parallelism of the neural network.

vii

Acknowledgments

First and foremost, I would like to express my deepest gratitude to my advisor, Prof.

H.-S. Philip Wong, who is a great mentor and guided me through my PhD journey. He

always encourages me to explore new ideas, and I truly appreciate the degrees of freedom

that he offered on conducting the research. During my four years at Stanford, Prof. Wong

has influenced me in many aspects with his optimistic attitude and passionate personality.

Not only I learn from him how to do the research, but also I learn from him to how to

manage a group. Prof. Wong has provided me with many great opportunities such as

research collaborations and summer internships. And he is very supportive of my pursuit

of the academic career path and helps me a lot in my faculty position applications, from

editing my research statement to writing a strong recommendation letters, from offering

interview tips to submitting joint proposal to funding agencies. Without his unselfish

support, I cannot achieve my success so far.

I would like to acknowledge my other dissertation reading committee members: Prof.

Yoshio Nishi and Prof. Simon Wong, for their valuable suggestions and advises on my

work. Prof. Nishi is very kind and amiable and always willing to help when I meet

problems. Prof. Simon Wong is always sharp at the key issues in the research and

provided me a lot of feedbacks on my work.

viii

I would like to acknowledge other faculty in Stanford as well. I wish to thank Prof.

Ada Poon and Prof. Subhasish Mitra for serving as my oral defense committee. And

especially I would like to thank Prof. Mitra for his support in my future academic career.

I would also like to thank Prof. Roger Howe and Prof. Krishna Saraswat for their help in

my study at Stanford.

I am grateful to my collaborators, Prof. Jinfeng Kang and his group at Peking

University, on many joint projects: Prof. Kang is my undergraduate research advisor,

who brought me to the field of my study a few years ago. The technical discussions with

Dr. Bin Gao are always inspiring and rewarding. Yexin Deng is a diligent undergraduate

researcher who works with me on the 3D cross-point array analysis.

I am grateful to my mangers and mentors during my summer internships: Dr. Jurczak

Malgorzata, Dr. Jorge Kittl, and Dr. Yangyin Chen at IMEC, and Dr. Mark Ritter, Dr.

Kailash Gopalakrishnan, Dr. Jing Li at IBM. They either wrote recommendation letters

for me or helped me settle down at new places and guided me on intern projects.

I am grateful to my references who wrote recommendation letters for my faculty

position applications: Prof. Wei Lu at University of Michigan, Prof. Daniele Ielmini at

Politecnico di Milano, and Dr. Fred Chen at ITRI in Taiwan;

I also wish to thank my colleagues at Stanford for their indispensable assistance on my

projects: Ximeng Guan on the device modeling, Yi Wu, Hong-Yu Chen and Yang Chai

on the device fabrication, Jiale Liang and Zizhen Jiang on the cross-point array analysis,

ix

Rakesh Jeyasingh on the device testing, Duygu Kuzum on the synaptic device design, etc.

Without them, I would not have been able to accomplish my PhD work so smoothly. And

I am thankful for the useful discussions with the lab mates in other groups: Suyog Gupta,

Zhiping Zhang, Liang Zhao, etc. I would also thank our group secretary: Fely Barrera,

who helps me with all kinds of financial and administrative issues.

I truly appreciate my friends who make my life at Stanford so colorful and wonderful,

and I will treasure those beautiful moments forever: Ze Yuan, Yu Zhao, Jieying Luo,

Weiruo Zhang, Yangsen Kang, Mengying Zhang, Hangyu Li, Chao Ni, Yutian Lu,

Liangliang Zhang, Lan Wei, Xiangyu Chen, Chi-Shuen Lee, He Yi, Hai Wei, Xiao Zhang,

Chen Chen, and Nuo Xu. Special thanks are to my long-time old friends: Cong Xu,

Xiaomu Wu and Zhaoyi Kang, they have stood by my side throughout all the peaks and

valleys in my past years.

Last but not least, I would like to express my deepest love to my family members: my

father and mother, my grandfather and grandmother, my uncles and aunts, my cousins.

They have shown me unconditional and unbounded love, and they always encourage me

to pursue my dream. Their support is the strongest driving force for me to go forward.

x

Table of Contents

List of tables ...................................................................................................................... xii

List of figures ................................................................................................................... xiii

Chapter 1 Introduction .........................................................................................................1

1.1 Motivation for Emerging Memory Technologies ....................................................1

1.2 Resistive Switching Phenomenon in Oxides ...........................................................4

1.3 Development and Applications of RRAM technology ..........................................10

1.4 Thesis overview .....................................................................................................19

Chapter 2 Characterization of Resistive Switching in Oxides ..........................................22

2.1 Physical Mechanism of Resistive Switching in Binary Oxides .............................22

2.2 HfOx Based RRAM Device Fabrication and Characterization ..............................28

2.3 Electron Conduction Process in HfOx Based RRAM ............................................38

2.3.1 I-V Measurements at Various Temperatures ................................................38

2.3.2 Low Frequency Noise (LFN) Measurement .................................................43

2.3.3 AC Conductance Measurement ....................................................................52

2.4 Ion Dynamics in HfOx Based RRAM ....................................................................60

2.4.1 Voltage-Time Relationship in Switching Dynamics ....................................60

2.4.2 Energy-Efficient Programming Scheme for Multi-bit Operation .................63

2.5 Summary ................................................................................................................68

Chapter 3 RRAM Device Modeling and Simulation .......................................................69

3.1 Kinetic Monte Carlo (KMC) Modeling .................................................................69

3.1.1 TAT solver ....................................................................................................71

3.1.2 Resistor Network ..........................................................................................75

3.1.3 KMC Module for Ionic Process ....................................................................78

3.1.4 Simulation of Current Overshoot ..................................................................81

3.1.5 Simulation of Variability ..............................................................................86

xi

3.1.6 Simulation of Reliability ...............................................................................91

3.2 Compact Modeling.................................................................................................94

3.3 Discussions and Summary .....................................................................................98

Chapter 4 Oxide Based Synaptic Device for Neuromorphic Computing .......................103

4.1 Background ..........................................................................................................103

4.2 Synaptic Plasticity and Learning .........................................................................105

4.3 Analog Synapse and Binary Synapse for WTA Network ....................................113

4.3.1 Analog Synapse for Depression Learning ..................................................114

4.3.2 Binary Synapse for Stochastic Learning .....................................................117

4.3.3 Winner-Take-All Network Simulation .......................................................123

4.4 Performance Metrics for Synaptic Device ...........................................................134

4.5 Summary ..............................................................................................................138

Chapter 5 Conclusion and Outlook .................................................................................139

5.1 Summary of contribution .....................................................................................139

5.2 Future work ..........................................................................................................141

Bibliography ....................................................................................................................143

Author’s Biography .........................................................................................................171

Publications ......................................................................................................................172

xii

List of tables

Number Page

Table 1.1 Device Characteristics of conventional and emerging memory technologies .....4

Table 1.2 Summary of the materials that have been used for binary oxide RRAM.

Metals of the corresponding binary oxides used for the resistive switching

layer are colored in yellow. Metals used for the electrodes are colored in

blue. ....................................................................................................................6

Table 1.3 The switching modes for various metal oxide RRAM devices with different

electrode combination. .......................................................................................9

Table 1.4 Representatives of selection devices for RRAM in the literature ......................14

Table 2.1 The characteristics of the HfOx/AlOx RRAM device. .......................................38

Table 4.1 Summary of the desirable performance metrics for synaptic devices. ............137

xiii

List of figures

Number Page

Figure 1.1 The memory hierarchy in today’s system, which has a pyramid structure.

SRAM, DRAM, FLAH are the mainstream technologies serving for

caches, main memory, and solid-state-drive, respectively. ...............................2

Figure 1.2 (a) Schematic of metal-insulator-metal structure for oxide RRAM, and

schematic of metal oxide memory’s I-V curves, showing two modes of

operation: (b) unipolar and (c) bipolar. ..............................................................8

Figure 1.3 The cross-point memory architecture a) without cell selection elements and

(b) with cell selection elements........................................................................12

Figure 1.4 Schematic of the proposed 3D cross-point architecture using the vertical

RRAM cell. The vertical RRAM cells are formed at the intersections of

each pillar electrode and each plane electrode: the resistive switching

oxide layer surrounds the pillar electrode and is also in contact with the

plane electrode. To enable the random access of each memory cell, three-

dimensional decoding is needed through WL (decoding in z-direction), BL

(decoding in y-direction) and SL of the gate of the vertical MOSFET

(decoding in x-direction)..................................................................................17

Figure 2.1 Schematic of the possible electron conduction paths through a

metal/oxide/metal stack. (1) Schottky emission; (2) F-N tunneling; (3)

direct tunneling; (4) tunneling from cathode to traps; (5) emission from

trap to conduction band, which is essentially the Poole-Frenkel emission;

(6) F-N like tunneling from trap to conduction band; (7) trap to trap

hopping or tunneling; and (8) tunneling from traps to anode. .........................26

Figure 2.2 Schematic of the switching mechanism of oxide RRAM. ...............................28

xiv

Figure 2.4 The HRS and LRS resistance distribution in (a) and the switching voltage

distribution in (b) of the HfOx/AlOx bi-layer samples and the HfOx single

layer samples obtained by 200 DC sweep cycles. ...........................................32

Figure 2.5 The transient voltage/current waveform during (a) RESET transition and

(b) SET transition. ............................................................................................34

Figure 2.6 The HRS and LRS resistance degradation during the SET/RESET cycling. ...36

Figure 2.7 The I-V characteristics after 5×105 cycles........................................................36

Figure 2.8 The stability of the device for different resistance states measured at an

elevated temperature (100 °C). ........................................................................37

Figure 2.9 Poole-Frenkel emission model fitting results: (a) In(I/V) vs. sqrt(V) for

electron injection from TiN under a negative bias. (b) In(I/V) vs. 1/kT to

extract the activation energy (Ea) (c) Extrapolating Ea to zero bias, the trap

energy (~0.08 eV) is extracted from the intersection on the y-axis, and the

dielectric constant (~79) is extracted from the slope. These two parameters

are unreasonable compared to their known values. .........................................41

Figure 2.10 Temperature dependency (In(I/V) vs. 1/kT) for multi-level resistance

states at both positive and negative biases. ......................................................42

Figure 2.11 Schematic of the LFN measurement set-up. ...................................................44

Figure 2.12 The RRAM device noise, measurement system’s noise, and equivalent

resistor’s thermal noise for LRS (a) and HRS (b). ...........................................46

Figure 2.13 The relative noise current fluctuation in the time domain for different

resistance states. ...............................................................................................47

Figure 2.14 The normalized PSD (Si/I2) in the frequency domain for different states. .....48

Figure 2.15 The PSD (Si) as a function of the squared DC bias for LRS (the inset) and

HRS respectively. ............................................................................................48

Figure 2.16 The normalized PSD (Si/I2) as a function of the DC bias voltage. .................49

xv

Figure 2.17 Lorentzian function 4τ/(1 + ω2τ2) for different τ. Single Lorentzian is

1/f 2, and the envelope of multiple Lorentzian is 1/f. The cutoff frequency

corresponding to a shortest electrode-to-trap tunneling gap distance (~2

nm) is shown (the dash line). ...........................................................................52

Figure 2.18 The Nyquist plot (-Im(Z) vs Re(Z)) for HRS and LRS. .................................54

Figure 2.19 The total conductance spectrum in LRS and HRS under various DC

biases. ...............................................................................................................55

Figure 2.20 The total conductance spectrum in different resistance levels in HRS. .........55

Figure 2.21 The AC conductance spectrum (a) for various DC biases at a particular

HRS level and (b) for different HRS levels at a particular DC bias. ...............57

Figure 2.22 Schematic of the DC and AC conduction processes in RRAM device. .........60

Figure 2.23 Measured voltage-time relationship: pulse amplitudes needed to

successfully trigger the SET/RESET switching at the some fixed pulse

widths. The success of the trigger is defined to meet pre-defined target

LRS/HRS resistances (20 kΩ/1 MΩ). ..............................................................63

Figure 2.24 The DC I-V characteristics of HfOx/AlOx RRAM by a different RESET

stop voltages (-2.1 V, -2.7 V, and -3.3 V). ......................................................64

Figure 2.25 (a) Three levels of HRS obtained by fixed pulse width (50 ns) but

different pulse amplitudes (-2.3 V, -2.6 V, -2.9 V). (b) Similar three levels

of HRS obtained by fixed pulse amplitude (-2.3 V) but different pulse

widths (50 ns, 500 ns, 5 µs). ............................................................................66

Figure 2.26 Transient voltage/current waveform for the two RESET programming

schemes. The first scheme: -2.3 V/50 ns; The second scheme: -2 V/500 ns.

The initial resistance is ~10 kΩ, and the final resistance is ~50 kΩ. The

energy consumption for the first scheme (-2.3 V/50 ns) is about 7.4 pJ and

for the second scheme (-2 V/500 ns) is about 60.9 pJ. ....................................67

Figure 3.1 The KMC simulation flow................................................................................70

xvi

Figure 3.2 The TAT process in the oxide RRAM. ............................................................71

Figure 3.3 I-V of 1D Vo chain for different gap distances simulated by TAT solver.

The inset shows current exponentially decreases with increasing gap

distance, which is the main cause of the HRS variation. .................................73

Figure 3.4 The electron occupancy probability in the traps in the 1D Vo chain (a) for

a complete Vo chain connecting both electrodes (representing LRS) and

(b) for a ruptured Vo chain with a gap (representing HRS). ...........................74

Figure 3.5 Schematic of the resistor network: each node in the network represent an

atom site in the oxide matrix. The node can be a red Vo site or a blue

lattice oxygen (OL) site. ...................................................................................75

Figure 3.6 An example of the simulated local electric potential in HRS with a gap

between top electrode and residual CF. ...........................................................76

Figure 3.7 An example of the simulated local temperature in LRS when CF connects

both electrodes. ................................................................................................77

Figure 3.8 Simulated I-V curve of a 1D Vo chain in RRAM cell with 1 nm tunneling

gap considering multiple conduction mechanisms. .........................................78

Figure 3.10 Simulated I-V with overshoot: 1) FORMING with RC time constant

τ=100 ns; 2) FORMING with τ=10 ns; 3) SET with τ=100 ns. The

compliance is 10 µA. .......................................................................................82

Figure 3.11 Simulated voltage drop on the RRAM as a function of time during the

overshoot period...............................................................................................82

Figure 3.12 Simulated current during the overshoot period. The compliance is 10 µA. ...83

Figure 3.13 The evolution of the CF shape for the FORMING/RESET/SET cycles

using KMC simulator for (a) fresh sample; (b) Onset moment of

FORMING with τ=100 ns; (c) LRS after FORMING with τ=100 ns; (d)

Onset moment of FORMING with τ=10 ns; (e) LRS after FORMING with

xvii

τ=10 ns; (f) RESET with τ=100 ns; (g) Onset moment of SET with τ=100

ns; (h) LRS after SET with τ=100 ns. Pink dots are Vo. Blue dots are O2-. ....86

Figure 3.14 Simulated I-V curves during the repeated 100-cycle DC sweep. ...................87

Figure 3.15 Simulated (a) resistance distribution and (b) switching distribution during

the repeated 100-cycle DC sweep. ...................................................................88

Figure 3.16 An example of the existence of Vo (highlighted by red circle) in the gap

region which causes a tail bit in HRS. .............................................................90

Figure 3.17 The transient current waveform measured during the RESET with the last

current jump indicating a Vo left-over in the gap region. ................................91

Figure 3.18 Simulated endurance cycling at 125 °C. .........................................................92

Figure 3.19 The Vo configuration for the final failure state (stuck at LRS). ....................92

Figure 3.20 Statistically simulated LRS retention of 100 RRAM cells @150 °C. ............93

Figure 3.21 Simulated LRS failure time distribution at three baking temperatures. .........94

Figure 3.22 Schematic of conductive filament with oxygen vacancies. The tunneling

gap distance g determines the device resistance. .............................................95

Figure 3.23 The I-V fitting of multi-level resistance states by varying the gap

tunneling distance in the compact model. ........................................................97

Figure 3.24 The gradual RESET transition curves by pulse train with different

amplitudes. .......................................................................................................98

Figure 4.1 An analogy between the biological synapse and the artificial oxide

synaptic device. The vision is that the parallel neural network can be

emulated by the cross-point RRAM array for large-scale integration. ..........105

Figure 4.2 The asymmetric form of STDP states that the synaptic weight depends on

the relative timing of pre- and post-synaptic spikes [135]. ΔG = (Gafter-

Gbefore)/Gbefore. Gbefore is the conductance before the pre and post spike pair,

and Gafter is the conductance after the pre and post spike pair. ......................107

xviii

Figure 4.3 The up-down cycling of the HfOx/AlOx synaptic device for (a) the first 100

cycles and for (b) the 100 cycles after 105 endurance testing cycles. The

pulse widths were fixed to be 50 ns. The pulse sequence is 5 pulses

increased from -2.2 V to -2.6 V with a step -0.1 V and then 5 pulses with

increased from 1.4 V to 1.8 V with a step 0.1 V............................................109

Figure 4.4 The HfOx/AlOx synaptic device resistance modulation by the pulse

amplitude starting from an initial state with an intermediate resistance

value (200 kΩ ~ 300 kΩ, the shaded region in the plot). Each error bar

consists of 50 independent tests. ....................................................................111

Figure 4.5 Spike-timing-dependent plasticity (STDP) realization schemes developed

with time-division multiplexing and pulse amplitude modulation. The

pulse amplitudes for the pre-spike are -1.4 V, 1 V, 0.9 V, 0.8 V, 0.7 V, 0.6

V, consecutively, and for the post-spike are -1V, 1.4 V, 1.3 V, 1.2 V, 1.1

V, 1V, consecutively. .....................................................................................112

Figure 4.6 The STDP-like curve calculated with the data in Figure 4.4 employing the

signaling schemes in Figure 4.5 .....................................................................112

Figure 4.7 The DC I-V characteristic of the TiOx/HfOx/TiOx/HfOx synaptic device. .....114

Figure 4.8 The depression learning process by 400 consecutive identical RESET

pulses for 5 different devices. The initial resistance states are chosen to be

~500 Ω and ~20 kΩ, respectively. .................................................................115

Figure 4.9 The depression learning process at several milestones during the 1000

cycle endurance cycling test. 1 cycle includes 400 consecutive identical

RESET pulses. ...............................................................................................116

Figure 4. 10 Dependence of the energy per spike on the initial resistance states for the

training process. If starting at ~ 20 kΩ, the maximum energy per spike

drops below 1 pJ. ...........................................................................................116

xix

Figure 4.11 Simulated depression learning process starting from ~500 Ω by varying

the pulse amplitudes, mimicing the the experimental data in Figure 3.24. ...117

Figure 4.12 Measured SET/RESET continuous cycling with different SET pulse

amplitudes (a) +1.9 V/10 ns, (b) +1.6 V/10 ns, (c) +1.3 V/10 ns,

respectively. The SET process becomes stochastic under weak

programming conditions. ...............................................................................120

Figure 4.13 Measured statistical distribution of pulse amplitude required for

triggering the SET switching from the off-state. In (a), the probability of

SET switching is measured from one representative device for 100 cycles.

In (b), 50 different devices on the wafer were measured with one type of

symbol in the figure representing the data from one device. .........................122

Figure 4.14 The WTA architecture implemented by integrate-and-fire neurons and

oxide synaptic devices. ..................................................................................125

Figure 4.15 A spiking scheme for implementing the unsupervised competitive

learning algorithm designed for (a) binary synapse and for (b) analog

synapse. ..........................................................................................................126

Figure 4.16 Simulated normalized conductance map between the input layer neurons

and the output layer neurons utilizing binary synapse with stochastic

learning (a)-(c) and analog synapse with depression learning (d)-(f). The

normalization is done with respect to a reference that is the highest

conductance in the synapse array before the training. ...................................128

Figure 4.17 Simulated system performance metrics as a function of programming

conditions. Network orientation selectivity and orientation storage

capacity for binary synapse in (a) and for analog synapse in (c); Energy

consumption of the synaptic devices during the whole training (200

training images) for binary synapse in (b) and for analog synapse in (d).

The average values through 100 independent simulation runs are shown. ....132

xx

Figure 4.18 Simulated orientation selectivity as a function of the analog synapse

variability δR/R. At each error bar, 100 independent simulations are

performed. ......................................................................................................133

1

Chapter 1 Introduction

1.1 Motivation for Emerging Memory Technologies

As CMOS scaling will soon approach its physical limit, it is necessary to consider new

computing paradigms that continue improving the computing system performance. We

are entering the “Big Data” era which puts data sets remotely at the data center for

computing, and the size of data sets is rapidly exceeding exascale [1]. The functionality

and performance of today’s computing system are increasingly dependent on the

characteristics of the memory sub-system. The current memory sub-system has a well-

known pyramid-like hierarchy (Figure 1.1): SRAM, DRAM, and FLASH are the

mainstream technologies serving as caches, main memory, and solid-state-drive,

respectively. Going up of the pyramid, the latency decreases, and going down of the

pyramid, the capacity increases. From an architecture perspective, the multi-core

processors on the chip require more data to be fed in time to support the high computing

throughput. It means that memories need either lower latency or larger capacity to

increase the bandwidth. Meanwhile, low-power applications call for a non-volatile

memory on chip to minimize the standby leakage power. As a summary, the conventional

memory technologies are hard to satisfy the memory requirement with device scaling.

Therefore, emerging memory technologies have been actively pursued both in industry

and academia in hopes of finding solutions for future information storage needs.

2

Figure 1.1 The memory hierarchy in today’s system, which has a pyramid structure.

SRAM, DRAM, FLAH are the mainstream technologies serving for caches, main

memory, and solid-state-drive, respectively.

The ideal characteristics for a memory device include fast programming speed (<ns),

low operation voltage (<1 V), low power/energy consumption (~fJ/bit for write), long

retention time (>10 years), long write/read endurance (>1016 cycles), and excellent

scalability (<10 nm). Nevertheless, it is challenging to satisfy the entire ideal

characteristic in a single “universal” memory [2], several emerging memory technologies

have been pursued towards to achieving part of those ideal characteristics. Those

candidates are spin-transfer-torque magnetoresistive random access memory (STT-

3

MRAM) [3], phase change random access memory (PCRAM) [4], and resistive random

access memory (RRAM) [5]. These emerging memory technologies share some common

features: they are non-volatile two-terminal devices, and they differentiate their states by

the switching between a high resistance state (HRS) and a low resistance states (LRS).

The transition between the two states can be triggered by electrical inputs. However, the

detailed switching physics is quite different for different memories: STT-MRAM relies

on difference in resistance between the parallel configuration (corresponding to LRS) and

anti-parallel configuration (corresponding to HRS) of two ferromagnetic layers separated

by a thin tunneling insulator layer; PCRAM relies on chalcogenide material to switch

between the crystalline phase (corresponding to LRS) and the amorphous phase

(corresponding to HRS); and RRAM relies on the formation (corresponding to LRS) and

the rupture (corresponding to HRS) of conductive filament between two electrodes. Due

to the different underlying physics, the device characteristics are also different among

emerging memory technologies. Table 1.1 compares the device characteristics between

various emerging memory technologies with various conventional memory technologies.

As it is seen from Table 1.1, as compared with SRAM, STT-MRAM has the advantage of

a smaller cell area, while STT-MRAM has maintained low programming voltage, fast

write/read speed and long endurance, thus STT-MRAM is attractive for embedded

memory on chip, e.g., the SRAM replacement in caches [6]. As compared with FLASH,

RRAM is attractive due to its lower programming voltage and faster write/read speed,

4

thus the initial target of RRAM technology development is to replace FLASH for mass-

storage application. Besides replacing the existing technologies, those emerging memory

technologies hold the potential to revolutionize today’s memory hierarchy by adding

more levels in the hierarchy, e.g. creating a storage class memory level between the main

memory and the storage memory [7]. It should be mentioned that different emerging

memories may have different application spaces due to their various characteristics, and

hybrid system with emerging memory and conventional memory is also attractive [8].

Table 1.1 Device Characteristics of conventional and emerging memory technologies

Conventional Memories Emerging Memories

SRAM DRAM FLASH STT-

MRAM PCRAM RRAM

NOR NAND

Cell area 140F2 6F2 10F2 <4F2 ~20F2 4F2 <4F2 if 3D

Multi-bit 1 1 1 3 1 3 2

Voltage <1V <1V >20V >20V <2V <3V <3V

Read time <1ns ~10ns ~10ns ~0.1ms <10ns ~10ns <10ns

W/E time <1ns ~10ns 1µs/10ms 1/0.1ms <10ns ~50ns <10ns

Retention N/A ~64ms >10y >10y >10y >10y >10y

Endurance >1E16 >1E16 >1E5 >1E4 >1E15 >1E9 >1E6~1E12

Write

Energy

(J/bit)

<fJ ~10fJ ~100pJ <0.1fJ ~0.1pJ ~10pJ ~0.1 pJ

F: feature size of the lithography used for patterning the cell

1.2 Resistive Switching Phenomenon in Oxides

The resistive switching phenomenon in oxides was firstly reported in the 1960s [9].

Recent work on the oxide based resistive switching memory can be traced back to the

5

discovery of hysteresis I-V characteristics in perovskite oxides such as Pr0.7Ca0.3MnO3

[10][11], SrZrO3 [12], SrTiO3 [13], etc. in the late 1990s and the early 2000s. Since

Samsung demonstrated NiO memory array integrated with the 0.18 µm silicon CMOS

technology in 2004 [14], research activities have been focused on binary oxides such as

NiOx [15], TiOx [16], CuOx [17], ZrOx [18], ZnOx [19], HfOx [20], TaOx [21], AlOx [22],

etc., because of the simplicity of the material and good compatibility with silicon CMOS

fabrication process. Note that the oxides here are often non-stoichiometric, thus we use

subscript “x” for the oxygen composition in this thesis.

Generally speaking, there are two types of resistive switching memory. One type is

based on the conductive filaments (CF) of oxygen vacancies (Vo); the other type is based

on the CF of metal atoms, which is also called conductive-bridge RAM (CBRAM).

CBRAM relies on the fast-diffusive Ag or Cu ions migration into the oxide (or

chalcogenide) to form a conductive bridge. In this thesis, we focus our discussion within

the Vo based binary oxide memory1. For the perovskite oxide memory, it can be referred

to the review by R. Waser et al. [23]. For the CBRAM, it can be referred to the review by

I. Valov et al. [24].

So far, tens of binary oxides have been found to exhibit resistive switching behavior.

Most of them are transition metal oxides, and some are lanthanide series metal oxides.

1 If not specifically noted, the term “RRAM” refers to the Vo based binary oxide memory

in this thesis.

6

The materials for the resistive switching oxide layer and the electrodes are summarized in

Table 1.2. Besides metal electrode, conductive nitrides, e.g. TiN, TaN are also commonly

used as electrode materials.

Table 1.2 Summary of the materials that have been used for binary oxide RRAM. Metals

of the corresponding binary oxides used for the resistive switching layer are colored in

yellow. Metals used for the electrodes are colored in blue.

Before starting the following discussion, we first introduce some basic concepts and

terminologies about oxide RRAM. The switching event from HRS to LRS is called the

7

“SET” process. Conversely, the switching event from LRS to HRS is called the “RESET”

process. Usually for the fresh samples in its initial resistance state, a voltage larger than

the set voltage is needed to trigger on the resistive switching behaviors for the subsequent

cycles. This is called the “FORMING” process. The switching modes of the oxide

RRAM can be broadly classified into two switching modes: unipolar and bipolar. Figure

1.2 shows a sketch of the I-V characteristics for the two switching modes. Unipolar

switching means the switching direction depends on the amplitude of the applied voltage

but not on the polarity of the applied voltage. Thus SET/RESET can occur at the same

polarity. If the unipolar switching can symmetrically occur at both positive and negative

voltages, it is also referred as a non-polar switching mode. Bipolar switching means the

switching direction depends on the polarity of the applied voltage. Thus SET can only

occur at one polarity and RESET can only occur at the reverse polarity. For either

switching modes, to avoid a permanent dielectric breakdown in the FORMING/SET

process, it is recommended to enforce a compliance current, which is usually provided by

the semiconductor parameter analyzer, or more practically, by a memory cell selection

transistor/diode or a series resistor that is either on-chip or close to the memory cell. To

read the data from the cell, a small read voltage is applied that does not affect the state of

the memory cell to detect whether the cell is in HRS or LRS.

8

Figure 1.2 (a) Schematic of metal-insulator-metal structure for oxide RRAM, and

schematic of metal oxide memory’s I-V curves, showing two modes of operation: (b)

unipolar and (c) bipolar.

It is observed that the electrode materials have a significant effect on the switching

modes of the oxide RRAM. Table 1.3 summarizes the switching modes for various oxide

RRAM devices with different electrode combinations. Even with the same oxide material

but with different electrode materials, the switching modes can be different. Therefore, it

is inferred that the switching mode is not an intrinsic property of the oxide itself but a

property of the both the oxide material and the electrode/oxide interfaces. In most cases,

the unipolar mode is obtained with a noble metal such as Pt or Ru as both top and bottom

electrodes. With one of the electrodes replaced by oxidizable materials such as Ti or TiN,

the bipolar mode is obtained. For bipolar switching, it was suggested that the reversed

field is required for a successful RESET because an interfacial oxygen barrier (e.g., TiON)

9

exists [25]. If both electrodes are oxidizable, there should be some asymmetry in the

oxygen gettering capability for bipolar switching. A typical structure is

TiN/metal/oxide/TiN 2 , e.g. TiN/Ti/HfOx/TiN [20], TiN/Hf/HfOx/TiN [26], where the

metal capping layer function as an oxygen gettering layer. Generally, the unipolar

switching needs a higher reset current than bipolar switching, probably due to the

requirement of thermally enhanced filament dissolution. In the following discussion, we

will focus on the bipolar switching.

Table 1.3 The switching modes for various metal oxide RRAM devices with different

electrode combination.

Unipolar Bipolar

Pt/NiOx/Pt [15] Pt/NiOx/SrRuO3 [27]

Pt/TiOx/Pt [28] Pt/TiOx/TiN [29]

Pt/ZnOx/Pt [30] TiN/ZnOx/Pt [31]

Pt/ZrOx/Pt [32] Ti/ZrOx/Pt [32]

Pt/HfOx/Pt [33] TiN/HfOx/Pt [34]

Pt/AlOx/Ru [35] Ti/AlOx/Pt [36]

2 If not specifically noted, the stack sequence is from the top to the bottom in this thesis.

10

1.3 Development and Applications of RRAM technology

The development of oxide RRAM has progressed rapidly in the last several years. In

particular, binary oxides that use materials that are familiar to the semiconductor industry

has seen intense research and development in both industry and academia. The vision for

oxide RRAM is a non-volatile memory that is fast, high-density, and compatible with

integration with conventional silicon CMOS technology. The early RRAM had large

device areas (>>µm2), large programming currents (mA), long programming times (µs),

low endurance (<103 cycles), and requires a cell selection transistor which was limited in

device density. Today, many of these deficiencies have been overcome. Device sizes

down to 10 nm × 10 nm have been demonstrated [26], programming currents are now in

the order of µA or tens of µA [26], write/read speed is in the order of ns or tens of ns [26],

write/read endurance cycles are up to 1012 [37], retention time is up to 3000 hours at

150 °C [21], and the FORMING process can be eliminated [22], although these

characteristics are yet to be simultaneously demonstrated in the same material system.

Chip-scale RRAM array from Mb to Gb size with peripheral circuitry were demonstrated

[38][39][40]. Demonstrations of multi-bit operation have also been made [41][42].

At memory array level, the device density is mostly determined by the memory cell

selection device and not the memory cell itself. A common architecture is similar as the

random accessible NOR FLASH architecture: each RRAM cell is connected with a cell

selection transistor, which is referred as 1 transistor-1 resistor (1T1R). By controlling the

11

selection transistor’s gate voltage through the word line, varying current can be provided

to control the switching characteristics of RRAM cell. The 1T1R cell area can range from

6F2 (F: feature size of the lithography used for patterning the cell) using aggressive

DRAM-like design rules with borderless contacts and zero gate to source/drain spacing to

8F2 (with F/2 gate to source/drain spacing) and 18F2 (contacts with borders).

For high-density integration, a cross-point architecture with 4F2 area is preferred. The

simplest way is to connect the word and bit lines at each node by a RRAM cell. A sneak

path problem with this simple cross-point architecture may arise (Figure 1.3): if the cell

to be read out happens to be in HRS with surrounding cells in LRS, the reading current

can easily flow through the surrounding cells in the LRS and thus a LRS data will be

mistakenly read out. Further discussions about the sneak path problem and the

interconnect effect on the cross-point architecture design can be referred to these works

[43][44][45]. The conclusions from these works indicate that in order to magnify the

write/read margin of the memory array, larger ratio of the RRAM resistance over the

interconnect resistance is desired, thus increasing the LRS resistance is helpful. In order

to further suppress the sneak paths, a cell selection element with strong I-V non-linearity

is desired to be added at each node. The cell selection element can be a diode for unipolar

switching RRAM as the 1 diode-1 resistor (1D1R) architecture, or a bi-directional

selector (with a strong non-linear I-V characteristics in both polarities) for bipolar

switching RRAM as the 1 selector-1 resistor (1S1R) architecture. The diode or the

12

selector effectively cuts off the leakage current paths at reversed bias or low bias, thus the

interference between neighboring cells is prevented.

Figure 1.3 The cross-point memory architecture a) without cell selection elements and (b)

with cell selection elements.

Here we briefly discuss the cell selection element materials. For unipolar switching, a

p–n diode is the most common device for the cell selection element. Although high

performance p–n diode is easily fabricated with current epitaxial silicon technology for

the planar device structures, it is not feasible to implement epitaxial silicon-based p–n

diode with the RRAM array at the back-end-of-line (BEOL) because it is difficult to

grow epitaxial silicon on a metal layer and high processing temperatures are required. On

13

the other hand, amorphous silicon allows for lower processing temperatures. But it does

not meet the requirement for current density for memory cell programming. Therefore,

new materials need to be explored for the cell selection element, which should both allow

for low processing temperatures and also provide high current drivability. Compared with

silicon p-n diode, oxide p-n diode is more attractive because it offers better flexibility in

processing technology because it can be fabricated during BEOL processing even at room

temperature. If the oxide material is oxygen deficient with sufficient amount of oxygen

vacancies, it is n-type; while if the oxide material is metal deficient with sufficient

amount of metal vacancies, it is p-type. Thus a combination of p-type oxide and n-type

oxide essentially forms a p-n diode. Recently, several kinds of oxide p-n diodes [46],

such as p-NiO/n-TiO2, p-NiO/n-ZnO, p-NiO/n-InZnO, p-CuO/n-InZnO, have been

demonstrated and stacked with Pt/NiO/Pt RRAM in series, among which p-CuO/n-InZnO

is regarded as the best candidate in terms of current drivability. Besides the p-n oxide

diode, through oxide/electrode or oxide/oxide interface band engineering, rectifying I-V

in uni-direction for unipolar switching or non-linear I-V in bi-direction for bipolar

switching can also be achieved. For examples, Schottky diode with Pt/TiO2/Ti/Pt stack

has been integrated with Pt/TiO2/Pt unipolar RRAM [47], Ni/TiO2/Ni bi-directional

selector has been integrated with Ni/HfOx/Pt bipolar RRAM [48], Pt/TaOx/TiO2/TaOx/Pt

bi-directional selector has been integrated with Cu/HfOx/Pt bipolar RRAM [49]. In

addition, some metal-insulator-transition materials (MIT) such as VO2, NbO2 can be used

14

for selector as well. Unlike RRAM devices, the resistive switching behavior in MIT

materials is not bistable, and it is referred as threshold switching. The threshold switching

is non-polar dependent, thus it is suitable for bipolar switching RRAM. The sharp

increase of current by magnitude of orders when reaching the threshold voltage provides

an ideal behavior as selector device. Pt/VO2/Pt selector has been integrated with NiO

unipolar RRAM [50] and ZrOx/HfOx bipolar RRAM [51]. However, the VO2 has a

transition temperature around 67 oC beyond which the threshold switching behavior

disappears [52], which is a major drawback for the practical application. Alternatively,

NbO2 has a transition temperature around 800 oC, thus it is more attractive due to its

thermal stability. W/NbO2/Pt selector has been integrated with WOx [53] and NbOx [54]

bipolar RRAM respectively. Last but not least, Cu ion motion in the Cu-containing

Mixed Ionic Electronic Conduction (MIEC) materials also shows a good selector

behavior for bipolar switching RRAM [55].

Table 1.4 compares several selection element candidates mentioned above in aspects

of voltage range, current drivability, rectifying ratio. Although substantial progress has

been made in the past few years, the development of selection element is still a key

challenge for implementing cross-point memory architecture today. Although the

reported diodes or selectors show a promising current drivability and rectifying ratio,

many of them use Pt in the stacks, which is not CMOS fabrication process-friendly.

Table 1.4 Representatives of selection devices for RRAM in the literature

15

Type Stack Voltage Range Current Drivability Ratio

p-n diode p-CuO/n-InZnO

[46]

-2V~+2V 3×104 A/cm2 3×104

Schottky

diode

Pt/TiO2/Ti/Pt [47] -2V~+2V 3×105 A/cm2 2.4×106

Bi-directional

selector

Ni/TiO2/Ni [48] -4V~+4V 105 A/cm2 103

Bi-directional

selector

Pt/TaOx/TiO2/TaOx

/Pt [49]

-2.5V~+2.5V 3.2×107 A/cm2 104

MIT Pt/VO2/Pt [51] -0.5V~+0.5V 6×106 A/cm2 50

MIT W/NbO2/Pt [54] -1.5V~+1.5V 2×106 A/cm2 50

Bi-directional

selector

MIEC [55] -0.5V~+0.5V 5×104 A/cm2 * 3×103

The voltage range is the max voltage where the current density is measured; For diode,

the ratio is defined as the forward/reverse current at the max voltage; For bi-directional

and MIT selector, the ratio is defined as the current at max voltage over half of the max

voltage; *MIEC has a higher current drivability ~50×106 A/cm2 at pulse mode (1.6V).

16

The major remaining challenge is device variability. Switching parameter variation is a

major barrier for using RRAM in large memory arrays as well as multi-bit operation.

There is still substantial cycle-to-cycle variation as well as device-to-device variation of

the device characteristics such as the switching voltages, resistance distributions, etc. To

make progress in this area, it is necessary to have a more complete understanding of the

conduction and resistive switching mechanism. This thesis aims to address this issue. At

the end of the day, the solutions to the variability problem may come from a combination

of materials engineering, device structure optimization, as well as innovations in

addressing/readout circuitry and programming algorithm.

The primary target of RRAM is to replace the FLASH technology, as FLASH is facing

scaling limitations beyond 10 nm technology node [56]. For mass-storage application, a

key challenge for RRAM is improving the integration density in terms of cost per bit (bit-

cost), so it can compete with the multi-bit storage NAND FLASH. State-of-the-art

NAND FLASH has been scaled down to sub-20 nm regime, and the 3D stackable NAND

FLASH is emerging [57][58][59]. To achieve similar device density as the 3D NAND

FLASH, a technology path toward the 3D stackable RRAM is required. There are two 3D

integration approaches for RRAM available: one is the planar RRAM stacked layer by

layer [60], however, it does not save lithography steps or masks because in each layer a

lithography step is needed to pattern the features and therefore the bit-cost remains high;

the other one is the vertical RRAM sandwiched between the pillar electrodes and multi-

17

layer plane electrodes [61][62][63], which requires only one critical lithography step or

mask thus it is a more promising approach for reducing the bit-cost. We initiated the 3D

RRAM research and proposed a cost-effective technology path towards 3D integration

(Figure 1.4). As a proof of concept work, two-layer vertical HfOx RRAM structure were

fabricated and characterized [64]. Programming schemes for random access in the 3D

array were proposed and the 3D array operation was experimentally demonstrated [65].

Furthermore, the scaling trend of the 3D vertical RRAM array was analyzed by

experiments and 3D circuit simulation [65]. Due to the space limit, these works [64] [65]

are not included in this thesis.

Figure 1.4 Schematic of the proposed 3D cross-point architecture using the vertical

RRAM cell. The vertical RRAM cells are formed at the intersections of each pillar

electrode and each plane electrode: the resistive switching oxide layer surrounds the pillar

electrode and is also in contact with the plane electrode. To enable the random access of

18

each memory cell, three-dimensional decoding is needed through WL (decoding in z-

direction), BL (decoding in y-direction) and SL of the gate of the vertical MOSFET

(decoding in x-direction).

While RRAM has the potential to be a stand-alone, high-capacity non-volatile memory

technology, it may be even more suitable for embedded applications. This is because it

offers the low programming voltage that FLASH does not offer and the non-volatility

that DRAM does not offer and yet has a speed that is comparable to DRAM. For

examples, a 8T2R nonvolatile SRAM cell with 2 RRAM at two storage nodes of SRAM

has been demonstrated for dynamical power management [66]. The use of RRAM as

reconfiguration memory enables a 3D reconfigurable field programmable gate array

architecture (FPGA) design [67]. Another emerging application is using RRAM as

artificial synaptic device for the hardware implementation of neuromorphic computing.

Owing to RRAM’s multi-level capability and low power/energy consumption, it can

behave like an analog memory emulating the function of plastic synapses in a neural

network. And part of this thesis research aims to explore the design space of RRAM

synaptic device. Although the early vision for RRAM is to strive for a 4F2 cross-point

architecture (and even with multi-bit operation and 3D integration), it is not entirely clear

that these goals continue to make sense generally, given the many diverse potential

applications of RRAM. For example, for embedded applications, improving the

19

endurance of RRAM appears to be more important, and then the design trade-offs

between the endurance and retention may be utilized. The ability to put an RRAM at the

contact vias of the MOSFET without extensive process steps [20] is also an attractive

device feature, especially for embedded applications where only a low memory capacity,

multiple-time programmable, non-volatile memory is required. Therefore, there is an

enormous opportunity to completely rethink the design of the system to gain new

functionalities and even orders of magnitude improvements in speed and/or power

consumption.

1.4 Thesis overview

This thesis addresses the key challenges of the RRAM technology development,

including the physics of resistive switching in oxides, the origin of the relatively poor

uniformity and large variability of the switching parameters. And also one promising

application of RRAM technology: emulating synapse for neuromorphic computing, is

explored.

Chapter 1 gives an overview of the background of the work in this thesis, including

the motivation of the emerging memory technologies, the basics and terminologies of

resistive switching phenomenon in oxides, the current status and challenges of RRAM

technology development.

Chapter 2 discusses the physical mechanism of resistive switching phenomenon in

binary oxides. The resistive switching is attributed to the conductive filament (made up of

20

oxygen vacancies) formation and rupture in the oxides. As a model system for device

physics study, HfOx based RRAM devices were fabricated and characterized. The trap-

assisted-tunneling was identified to be the dominant conduction mechanism. An

exponential voltage-time relationship was found between the switching time and the

applied voltage, suggesting the oxygen ion migration dynamics. Inspired by the voltage-

time relationship, an energy efficient programming scheme was also proposed for multi-

level operation.

Chapter 3 reports a Kinetic Monte Carlo (KMC) numerical simulator and a compact

model of RRAM. In order to gain more insights into the device physics such as the origin

of the switching variability, a physics-based KMC model was developed. The KMC

simulation can reproduce many experimental observations in the DC I-V sweep, pulse

switching, endurance cycling, and retention baking, etc. The tail bits in the resistance

distribution are attributed to the oxygen vacancy left over in the gap region due to a

competition between the oxygen vacancy generation and recombination. In addition, for

circuit/system-level simulations, a compact device model was developed, and the

compact model can be implemented in a SPICE simulator.

Chapter 4 explores the potential of the RRAM technology as synaptic device for the

hardware implementation of neuromorphic computing. Programming schemes were

proposed for realizing spike-timing dependent plasticity in the RRAM devices. The

gradual resistance modulation capability in RRAM was utilized for emulating analog

21

synapses, and the stochastic switching behavior in RRAM was utilized for emulating

binary synapses. In order to evaluate the effectiveness of analog synapses and binary

synapses, a simulation of winner-take-all network was performed based on the

parameters extracted from the experiments. The simulation suggests that the orientation

classification can be effectively realized using both analog synapses and binary synapses.

The system functionality is found to be robust against the RRAM device variability due

to the adaptive algorithm and parallelism of the neural network.

Chapter 5 summarizes the results and contribution of this thesis. Future work is also

proposed in this chapter.

22

Chapter 2

Characterization of Resistive Switching in

Oxides

2.1 Physical Mechanism of Resistive Switching in Binary

Oxides

The physical mechanism for resistive switching in oxide RRAM has been a

complicated topic. We need to first differentiate two types of switching mechanisms:

electronic-effect and ionic-effect. The electronic-effect mechanism conjectures that

injected charges are trapped by interface defects. These trapped charges modify the

Schottky barrier height between electrodes and oxides and thereby changing the

conductance through the stack [68][69]. Ionic-effect mechanism conjectures that the

migration of ions with related electrochemical reactions to form a conducting channel

between electrodes [23][70], which also referred as redox (reduction/oxidation) effect. In

this thesis, we focus the discussion on the ionic-effect mechanism, which is the prevailing

theory for the switching mechanism for the binary oxide RRAM.

The conduction of LRS current in the oxide RRAM is usually filamentary.

Conductive atomic force microscopy (C-AFM) measurements scanning the top surface of

the oxide [16][71] revealed that conductive filaments (CF) with nanoscale diameters are

formed after the FORMING/SET process, and conducting current passes through these

CF in LRS. The first direct observation of CF in the cross-section fashion was done by

D.-H. Kwon et al. [72], in which the nanoscale (~10 nm in diameter) CF in TiOx RRAM

was seen by high-resolution transmission electron microscopy (HR-TEM). Typically, CF

23

are sparsely and non-homogeneously distributed across the device. This means that the

conducting area in LRS is an extremely small portion of the entire electrode area, which

is typically quite large in most experiments. A usual experiment that aims at

distinguishing filamentary or bulk conduction is to measure the trend of the HRS/LRS

resistance ratio versus the cell area. If the ratio goes up when the cell area is scaled down,

filamentary conduction prevails. If the ratio remains almost constant when the cell is

scaled down, bulk conduction prevails. It should be noted that when the device size

scaled to the size that is comparable to the CF diameter (e.g. < 20 nm), a single CF may

dominate the conduction.

The nature of CF is usually conjectured to be oxygen vacancies (Vo) in many oxide

RRAM devices. It is well known that Vo can act as an effective donor in n-type metal

oxides. Taking HfOx RRAM as an example, ab-initio calculation of the monoclinic and

amorphous HfOx electronic structure [73] indicated that Vo can produce a defect state

within the band gap. Ordered Vo chain can form a transmission channel between two

metal contacts. Experimentally, HR-TEM measurements [74] on HfOx RRAM show that

the CF extends ~20 nm diameter with morphological changes and local atomic disorder.

The electron energy loss spectroscopy (EELS) on oxygen K-edges spectrum revealed the

presence of Vo associated with localized states in the band gap inside the CF region.

However, the composition of CF is not limited to Vo. Sometimes, the CF can be metallic

as well. Another study [75] using HR-TEM and EELS reported observation of a few nm

24

wide metallic CF dominated by metallic Hf in HfOx RRAM. We believe that the nature

of CF depends on the density of Vo. If the density of Vo is low, then the CF may show a

semiconducting nature as electrons are still in localized states in the band gap. If the

density of Vo is sufficiently high, then the CF may show a metallic nature as electrons

are now in extended states forming a sub-band in the band gap. A simple way to

distinguish whether the CF are metallic or semiconducting is to measure the LRS

resistance temperature dependency. If the LRS resistance goes up with the increase of

temperature, the CF are metallic and may consist of metal precipitates. On the contrary, if

the LRS resistance drops with the increase of temperature, CF are semiconducting and

may consist of Vo.

There are many efforts to fit the I-V characteristics of current conduction in the oxide

RRAM in the literature. Most reports show a linear or an Ohmic relationship in the LRS.

However, the conduction characteristics in HRS are quite diverse: Poole-Frenkel

emission (log(I/V)~V1/2) [30][33], Schottky emission (log(I)~V1/2) [21][76], the space

charge limited current (SCLC) characteristic (the Ohmic region I~V, and the Child’s

square law region I~V2) [77][78], were observed in various oxide RRAM devices.

We think that a simple I-V fitting with the aforementioned established model may not

be sufficient to ascertain the conduction in the oxide RRAM. In general, Figure 2.1

shows all the possibilities for an electron transport from cathode to anode [79]: (1)

Schottky emission: thermally activated electrons injected over the barrier into the

25

conduction band; (2) Fowler-Nordheim (F-N) tunneling: electrons tunnel from the

cathode into the conduction band, usually occurs at high field; (3) direct tunneling:

electron tunnel from cathode to anode directly, usually occurs when the oxide is thin

enough (<3 nm). If the oxide has substantial number of traps (e.g., Vo), trap-assisted-

tunneling (TAT) contributes to additional conduction, including the following steps: (4)

tunneling from cathode to traps; (5) emission from trap to conduction band, which is

essentially the Poole-Frenkel emission; (6) F-N like tunneling from trap to conduction

band; (7) trap to trap hopping or tunneling, maybe in the form of Mott hopping when the

electrons are in the localized states or maybe in the form of metallic conduction when the

electrons are in the extended states depending on the overlap of the electron wave

function; and (8) tunneling from traps to anode. Whether any one particular process

above dominates is determined by its transition rate; electrons would seek the fastest

transition (or least resistive) paths among all the possibilities. Therefore, various oxide

RRAM devices may have different dominant conduction mechanisms depending on the

dielectric properties (band gap or trap energy level, etc.), the fabrication process

conditions (annealing temperature, annealing ambient, etc.), and the properties of the

interface between the oxides and the electrodes (interfacial barrier height). The I-V

relationship at the low bias regime is mainly determined by the electron conduction

process for a given configuration of the CF, while at the high bias regime, the motion of

26

atoms (such as oxygen ions/vacancies) would change the configuration of the CF and

trigger a change of the resistance.

Figure 2.1 Schematic of the possible electron conduction paths through a

metal/oxide/metal stack. (1) Schottky emission; (2) F-N tunneling; (3) direct tunneling;

(4) tunneling from cathode to traps; (5) emission from trap to conduction band, which is

essentially the Poole-Frenkel emission; (6) F-N like tunneling from trap to conduction

band; (7) trap to trap hopping or tunneling; and (8) tunneling from traps to anode.

The detailed physical mechanism for resistive switching phenomenon in oxides is still

an active research area. Here we aim to give a general physical picture for the simple

binary oxide RRAM (Figure 2.2). The FORMING process for the fresh samples is similar

as a dielectric soft breakdown [80]. Initially, the Vo density is low. Under the high

27

electric field (>10 MV/cm), the oxygen atoms are knocked out of the lattice, and becomes

oxygen ions (O2-) drift toward the anode, where they are discharged as neutral non-lattice

oxygen if the anode materials are noble metals or react with the oxidizable anode

materials to form an interfacial oxide layer. Thus the electrode/oxide interface behaves

like an “oxygen reservoir” [81]. Meanwhile, Vo in the bulk oxide are generated. The

localized deficiency of oxygen leads to the formation of CF. Usually the as-deposited

RRAM oxide thin films are amorphous or poly-crystalline, and the CF are preferentially

generated along the grain boundaries as revealed by C-AFM [82]. Then the RRAM

device switches to the LRS. During the RESET process, O2- migrate back from the

interface to the bulk oxide either to recombine with the Vo or to oxidize the metallic CF

precipitates and thus partially rupture the CF. For the unipolar switching, Joule heating

by the current thermally activates the O2- diffusion. And O2- diffuse from the interface or

the region around the CFs [83] due to the concentration gradient. Usually the unipolar

switching requires a relatively higher reset current to raise the local temperature around

CF. For the bipolar switching, the interfacial layer may present a significant diffusion

barrier and pure thermal diffusion is not sufficient, thus O2- migration needs to be aided

by the reverse electric field [25]. Nevertheless, in both cases the CF is partially ruptured,

and a Vo-poor region forms and causes a tunneling gap for electrons. Then the RRAM

device switches to the HRS. The residual CF with Vo-rich region is referred as the

“virtual electrode”. In the next SET process, the soft-breakdown occurs in the gap region

28

and the CF reconnect both electrodes. And such SET/RESET loop can repeat for many

cycles. This picture can at best be viewed as a phenomenological description of the

experimental observations for many binary oxide RRAM devices. And the details of the

electron transport and ion dynamics will be discussed in the following sections.

Figure 2.2 Schematic of the switching mechanism of oxide RRAM.

2.2 HfOx Based RRAM Device Fabrication and

Characterization

As a model system, we chose HfOx based RRAM for the study in this thesis simply

because HfOx is one of the most mature RRAM materials explored so far. For example,

H.-Y. Lee et al. [20] reported a TiN/Ti/HfOx/TiN stack with superior performances such

29

as switching speed (~5 ns), switching voltage (<1.5 V), reset current (~25 µA), HRS/LRS

ratio (>100), endurance (>106 cycles), retention (>10 hr @200 °C). Based on this stack,

S.-S. Sheu et al. [38] reported a 4 Mb RRAM chip with 7.2 ns read/write random-access

time and 160 ns 2bit/cell operation time. Recently, B. Govoreanu et al. [26] reported an

ultra-scaled 10 nm×10 nm TiN/Hf/HfOx/TiN stack with superior performances such as

switching speed (~10 ns), switching voltage (<1.5 V), reset current (~50 µA), HRS/LRS

ratio (>10), endurance (>5×107 cycles), retention (>30 hr @250 °C).

We fabricated RRAM devices with TiN/HfOx/AlOx/Pt stack and TiN/HfOx/Pt stack (as

control sample) at Stanford Nanofabrication Facility. The fabrication process flow is as

follows: 50 nm Pt was first deposited by e-beam evaporation on silicon substrate as the

bottom electrode layer. Then 5 nm AlOx was deposited by ALD using TDMA-Al

(tetrakis dimethylamido aluminum Al[N(CH3)2]4) and H2O as precursors at 300 °C, and

then 5 nm HfOx was deposited by ALD using TEMA-Hf (tetrakis ethylmethylamino

hafnium Hf[N(C2H5)(CH3)]4) and H2O as precursor at 220 °C. For control sample, 10 nm

HfOx single layer was deposited using the same process conditions. Then the patterns

with 0.5 µm×0.5 µm ~ 10 µm×10 µm active cell area were defined by photolithography

with photoresist. Then 50 nm TiN was deposited by reactive sputtering and was lifted-off

as top electrode layer. Then the electrical measurements were performed on these

samples, Agilent 4156C semiconductor parameter analyzer was used for DC

measurements and Agilent 81101A pulse generator was used for pulse measurements.

30

The bottom electrode was grounded and the signals were applied to the top electrode in

all the measurements.

The fabricated RRAM devices show the typical bipolar switching behavior. The initial

resistances of these samples are very high (>10 GΩ), and a forming voltage (~8 V) is

needed to trigger the subsequent switching events. Figure 2.3 shows the typical DC I-V

characteristics of the HfOx/AlOx bi-layer samples and HfOx single layer samples. The

devices can SET by a positive bias with 100 µA compliance current was enforced, and

they can RESET by a negative bias. Figure 2.4 shows the HRS and LRS resistance

distribution in (a) and the switching voltage distribution in (b) of the HfOx/AlOx bi-layer

samples and the HfOx single layer samples obtained by 200 DC sweep cycles. The bi-

layer samples show less variation of the HRS resistance and switching voltages. It has

been suggested that the diffusion of Al atoms into HfOx could stabilize the CFs by the

experiments and the ab-initio principle calculation prediction [84], thus it is speculated

that the inter-diffusion of Hf and Al atoms in HfOx/AlOx bi-layer samples may help

improve the switching uniformity. Since the bi-layer samples shows less variation, in the

following part of this chapter, the study was performed on the HfOx/AlOx bi-layer

samples.

31

Figure 2.3 Typical DC I-V characteristics of the HfOx/AlOx bi-layer samples and HfOx

single layer samples. The devices show a bipolar switching behavior.

32

Figure 2.4 The HRS and LRS resistance distribution in (a) and the switching voltage

distribution in (b) of the HfOx/AlOx bi-layer samples and the HfOx single layer samples

obtained by 200 DC sweep cycles.

(a)

(b)

33

In order to assess the switching speed of our RRAM device, we performed the pulse

measurement by connecting the RRAM device in series with the 50 Ω input impedance

of the Tektronix DPO 4054 oscilloscope. The response current waveform was calculated

using the voltage waveform monitored in the oscilloscope divided by 50 Ω. Figure 2.5

shows the transient voltage/current waveform during (a) RESET transition and (b) SET

transition. In (a), the RESET starts with a LRS ~10 kΩ, and then a -2.5 V/50 ns RESET

voltage pulse is applied. The current drops after a wait time around 10 ns, indicating the

RESET event occurs at that moment. At the end, the cell switches to HRS ~1 MΩ. In (b),

the SET starts with a HRS ~1 MΩ, and then a +1.5 V/50 ns SET voltage pulse is applied.

The current increases after a wait time around 30 ns, indicating the SET event occurs at

that moment. At the end, the cell switched to LRS ~8 kΩ. It should be noted that the

SET/RESET speed is actually a function of the applied pulse amplitude as we will

discuss in Section 2.4. If increasing the pulse amplitude, the switching speed can be faster

than the limitation of our measurement set-up (~10 ns). In addition, we can obtain the

energy consumption per SET/RESET programming by integrating the voltage, current,

and time from these waveforms. According to Figure 2.5, the SET energy is ~5.9 pJ and

the RESET energy is ~6 pJ. The energy dissipated in the RESET process is necessary for

the switching because we start with a high LRS current, thus it is unavoidable. On the

contrary, the energy dissipated in the SET process is unnecessary because ideally we can

withdraw the pulse right after the SET occurs. However, considering the variation in the

34

switching time, a SET pulse that is longer than the actual SET time is needed which

causes the waste of energy in the SET process.

Figure 2.5 The transient voltage/current waveform during (a) RESET transition and (b)

SET transition.

(a)

(b)

35

The reliability of our RRAM device is examined. First, the pulse cycling test was

performed. Figure 2.6 shows the HRS and LRS resistance degradation during the

SET/RESET cycling. During the cycling, the HRS resistance keeps decreasing, and about

106 cycles have been obtained before the resistance window collapses. Figure 2.7 shows

the I-V characteristics after 5×105 cycles, and it is clear that the HRS resistance has

decreased significantly as compared with the initial state. The I-V curve becomes non-

linear in LRS and the LRS resistance gets increased as compared with the initial state.

This degradation trend indicated during the cycling more and more Vo are generated in

the oxide making the HRS resistance decrease, meanwhile more and more O2- migrated

towards the TiN electrode where an interfacial TiON tunneling layer may form causing

the non-linear I-V in the LRS. The final failure is stuck at LRS, indicating that there are

eventually too many Vo in the oxide that not sufficient O2- can come back from TiN.

36

Figure 2.6 The HRS and LRS resistance degradation during the SET/RESET cycling.

Figure 2.7 The I-V characteristics after 5×105 cycles

37

Then the retention property of our RRAM device was studied. Figure 2.8 shows the

stability of the device for different resistance states measured at an elevated temperature

(100 °C). The resistance states are monitored by a 0.1 V read voltage every 10 s, and no

significant change of the states is observed for a period of 7200 s (2 hours).

Table 2.1 summarizes the characteristics of our HfOx/AlOx RRAM device. The

characteristics are not the best in the literature, but more than enough for the device

physics study in the next two sections.

Figure 2.8 The stability of the device for different resistance states measured at an

elevated temperature (100 °C).

38

Table 2.1 The characteristics of the HfOx/AlOx RRAM device.

Switching Voltage <3 V

Reset Current ~100 µA

Switching Speed ~10 ns

HRS/LRS ratio 10 kΩ/1 MΩ

Endurance ~106 cycles

Retention >2 hours @100 °C

2.3 Electron Conduction Process in HfOx Based RRAM

2.3.1 I-V Measurements at Various Temperatures

In previous studies of high-k gate materials [85], the leakage current through HfO2 was

usually attributed to Poole-Frenkel emission. Therefore, we first investigated this

possibility as follows. Figure 2.9 shows the Poole-Frenkel fitting results. Figure 2.9 (a)

shows the relation of ln(I/V) versus √V in HRS under the negative bias for different

temperatures. The DC sweep stops at 0.5 V (<the switching voltage) to avoid any

changes to the resistance state unintentionally. At the low bias regime (<0.2 V), the

experimental data deviates significantly from the Poole-Frenkel model prediction. Figure

39

2.9 (b) shows ln(I/V) versus 1/kT for different biases, and if we extract the activation

energy (Ea) from the slope of Figure 2.9 (b), we can plot Ea as a function of √V in Figure

2.9 (c). Extrapolating the curve of Figure 2.9 (c) to the zero bias point, the trap energy

(Et) is extracted to be around 0.08 eV, and from the slope of Figure 2.9 (c), the dielectric

constant (εr) for HfO2 is extract to be about 79. The value of Et~0.08 eV below the

conduction band is too shallow and is not corroborated with the ab-initio calculation

results (~1.2 eV - 2.1 eV) [86], experimental data (~1.5 eV) obtained by Poole-Frenkel

fitting [85] or (~1.2 eV) obtained by spectroscopic ellipsometry [87]. The value of the

extracted dielectric constant εr ~79 is much higher than the known value ~20 for HfO2.

Therefore, both of these two key parameters determined by fitting to a Poole-Frenkel

model are unreasonable. Figure 2.10 shows the temperature dependency of different

resistance states. The current is found to be insensitive to the temperature change, and

similar trends were observed in HfOx RRAM by others [34][88] as well. Therefore, we

suggest that the conduction may be dominated by the tunneling.

40

(b)

(a)

41

Figure 2.9 Poole-Frenkel emission model fitting results: (a) In(I/V) vs. sqrt(V) for

electron injection from TiN under a negative bias. (b) In(I/V) vs. 1/kT to extract the

activation energy (Ea) (c) Extrapolating Ea to zero bias, the trap energy (~0.08 eV) is

extracted from the intersection on the y-axis, and the dielectric constant (~79) is extracted

from the slope. These two parameters are unreasonable compared to their known values.

(c)

42

Figure 2.10 Temperature dependency (In(I/V) vs. 1/kT) for multi-level resistance states at

both positive and negative biases.

Our measured current is neither a strong function of the temperature (ruling out the

possibility of Schottky emission) nor of the electric field (ruling out the possibility of F-N

tunneling). And direct tunneling for a 10 nm thick oxide should be negligible. This leaves

the TAT path (the steps (4)(7)(8) in Figure 2.1) as a most probable conduction

mechanism. Poole-Frenkel emission is essentially one step of the TAT processes (step (5)

in Fig. 1 (b)), which rises from the barrier lowering effect due to the Coulomb attraction

between the positively charged trap center and negatively charged conduction electron

into the conduction band. It should be noted that the Poole-Frenkel model implicitly

assumes that the electron injection from cathode is easy and that there are plenty of

43

electrons available in the filled traps for the subsequent emission. However, this

assumption may not be valid when the electron injection is limited by a significant

interface potential barrier or when the trap density is very high which requires more

electrons to fill the traps than provided [89]. Whether any one particular process

dominates is determined by its transition rate (ν), and the step with the smallest transition

rate limits the whole conduction process. The first step for TAT is step (4), and among

the subsequent de-trapping steps (5)-(7), step (7) has the largest rate. This is because the

rate for thermal activation from the deep trap energy in HfO2 (Et~1.2-2 eV) with ν=

ν0exp(−Et/kT) is much smaller than the rate for the tunneling from trap to trap with ν =

ν0exp(−2R/ξ). Here ν0~1013 Hz, ξ is the electron wavefunction localization length (~0.3

nm), R is the distance between traps (~0.3-0.6 nm) since the trap density is expected to be

very high in the CF. In order to further characterize the TAT process, low frequency

noise (LFN) measurement and AC conductance measurement were performed in the

following.

2.3.2 Low Frequency Noise (LFN) Measurement

LFN measurement is a technique that can electrically characterize the trap-assisted

conduction process in dielectrics [90]. A 1/f α –like power spectral density (PSD) has

been observed for TiO2 RRAM [91], NiO RRAM [92], etc. Here we perform LFN

measurement on our HfOx/AlOx RRAM. The LFN measurement set-up is as follows

(Figure 2.11): Agilent 4156C semiconductor parameter analyzer was used to provide the

44

DC bias. The noise current was fed into in a Stanford Research System SR 570 low noise

current to voltage amplifier, and the output signals were analyzed by a Stanford Research

System SR 760 spectrum analyzer. The electrical measurement was done in a shielded

environment. The SR 570 amplifier has a bandwidth of 1 MHz, and adjustable gain, to

measure LRS current, a gain of 105 Ω was used, and to measure HRS current, a gain of

108 Ω was used. The SR 760 spectrum analyzer was set to analyze the spectrum with 500

times averaging.

Figure 2.11 Schematic of the LFN measurement set-up.

To determine the background noise level of the measurement system, we measured the

noise level of the measurement set-up by applying zero bias on the device. The system

45

noise is introduced by the 60 Hz of the power supply. The measurement system’s power

spectral density (PSD) level is measured around 10-22~10-24 A2/Hz. The system noise

level actually depends on the gain amplitude used in the amplifier: for measuring LRS, a

lower gain 105 was used, leading to a PSD level ~10-22 A2/Hz; for measuring HRS, a

higher gain 108 was used, leading to a PSD level ~10-24 A2/Hz. In either case, the

measured memory device noise is about 3 orders larger than the measurement system’s

noise. Therefore, the effect of the measurement set-up is negligible. The theoretical value

of the equivalent resistors’ thermal noise (4kBT/R) is even below the measurement

system noise level. Figure 2.12 shows the RRAM device noise, measurement system’s

noise, and equivalent resistor’s thermal noise for LRS (a) and HRS (b).

(a)

46

Figure 2.12 The RRAM device noise, measurement system’s noise, and equivalent

resistor’s thermal noise for LRS (a) and HRS (b).

Figure 2.13 shows the relative noise current fluctuation in the time domain for

different resistance states. The higher HRS levels can be obtained by increasing the reset

stop voltages in a DC sweep. Generally, the higher the resistance state is, the larger the

relative fluctuation is. Figure 2.14 shows the normalized PSD (Si/I2) in the frequency

domain for different resistance states. Here the normalization is done by dividing the PSD

value (Si) by the square of the average current (I2). It is seen that the higher the resistance

state is, the larger the normalized PSD is, which is in agreement with the data in the time

domain. Also it is seen that for LRS the slope index α is close to 1, while for HRS there is

(b)

47

a cutoff frequency above which α changes from 1 to 2. Similar LFN behavior that shows

a slope index change was also observed in other RRAM devices [91]. Figure 2.15 shows

the PSD (Si) as a function of the squared DC bias for LRS (the inset) and HRS

respectively. It is seen that the PSD (Si) in LRS follows a linear relation with the squared

DC bias while that in HRS follows a super-linear relation with the squared DC bias,

which is consistent with the I-V characteristics. As a result, the normalized PSD (Si/I2)

for both HRS and LRS are almost independent of the DC bias as shown in Figure 2.16,

suggesting that there is no optimal bias point for maximum signal to noise ratio (SNR)

for the read operation.

Figure 2.13 The relative noise current fluctuation in the time domain for different

resistance states.

48

Figure 2.14 The normalized PSD (Si/I2) in the frequency domain for different states.

Figure 2.15 The PSD (Si) as a function of the squared DC bias for LRS (the inset) and

HRS respectively.

49

Figure 2.16 The normalized PSD (Si/I2) as a function of the DC bias voltage.

In the following, we discuss the conduction mechanism that is revealed by the above

LFN characterization. In general, the electron trap/detrap processes give rise to noise

current on the top of the steady state current of the RRAM devices. In the three steps in

TAT, the tunneling between the electrode and traps is the bottleneck of the DC

conduction process (especially in HRS) because: 1) the electron injection/ejection is

limited by a significant interfacial potential barrier as the case of metal/oxide interface,

leading to a slow tunneling rate between the electrode and traps; 2) the trap density of the

remaining un-ruptured section of the CF is high, leading to a fast trap-to-trap tunneling

rate within the residual CF. The relaxation time τ in the trap/detrap process, is then

determined by the transition time of the bottleneck: tunneling between the electrode and

50

traps. The Wentzel-Kramers-Brillouin (WKB) approximation is used to calculate this τ as

Eq. 2.1.

τ = τ0/fFermi−Dirac(Eb − Et) ∙ exp(γ ∙ d) (2.1)

where d is the distance from electrode to trap, τ0 is the pre-exponential time factor

(~10-14 s [93]), γ = 2√2m∗Et/ℏ is the WKB approximation factor for the transmission

probability at low bias, trap energy Et=1.6 eV [85] in HfO2, TiN/HfO2 interface potential

barrier Eb=1.9 eV [94], and effective mass m*=0.1m0 for HfO2 [85]. For a specific τ, the

PSD of the noise current Si(ω) can be expressed in a Lorentzian function as Eq. 2.2 [90].

Si(ω) = (ΔI)2 ∙4τ

1+ω2τ2 (2.2)

where ω=2πf is the angular frequency, (ΔI)2 is the root mean square (RMS) value of

the noise current. And if the τ has a probability distribution p(τ), the PSD form is

modified to be Eq. 2.3.

Si(ω) = ∫ (ΔI)2 ∙4τ

1+ω2τ2∙ p(τ)dτ

τ(max)

τ(min) (2.3)

where τ(min) is the relaxation time determined by the shortest tunneling distance, and

τ(max) can be considered to be infinite if we consider the direct tunneling from one

electrode to another across entire 10 nm thick oxide is negligible.

51

Figure 2.17 shows the Lorentzian function for different tunneling distances between

the electrode and traps. In LRS, CF connect both electrodes, thus electrons can tunnel

from the electrode to all the traps nearby with various relaxation times. If we assume a

spatially uniform distribution of traps in LRS, p(R) = nt ∙ 4πR2, with a trap density nt,

by changing the integration variable: p(τ) = p(R)dR/dτ , we can complete the

integration at the limit when τ(min) is approaching zero for the case of LRS, and obtain

the result of Eq. 2.4.

Si(ω) = (ΔI)2 ∙ 8π2nt/γ ∙ Rω2 /ω (2.4)

where Rω = 1/γ ∙ ln(1/ωτe−t0 ) is the characteristic tunneling distance at frequency ω,

which has only a weak dependence on ω. With respect to the frequency, Si(ω)~1/ωα, here

the slope index 𝛼 = 1 + 2/ln(1/ωτ), sinceωτ ≪ 1at the LFN regime (f < 3 kHz), α

approaches 1. Intuitively speaking, in LRS the electrons have multiple choices when

they tunnel from the electrode to the traps nearby, and the contribution from all these

transitions will smooth the 1/f 2 Lorentzian function thus the envelope leads to 1/f α-like

(α~1) LFN behavior. In HRS, the CF is partially ruptured and the shortest distance

between the first trap and the electrode causes a minimum τ, thus ω=1/τ(min) corresponds

to the cutoff frequency in the Lorentizian. Therefore, the cutoff frequency becomes an

indicator of the ruptured CF length. For a typical HRS range (500 kΩ-50 MΩ), the

ruptured CFs length is estimated to be 1.5 nm-2 nm.

52

Figure 2.17 Lorentzian function 4τ/(1 + ω2τ2) for different τ. Single Lorentzian is 1/f 2,

and the envelope of multiple Lorentzian is 1/f. The cutoff frequency corresponding to a

shortest electrode-to-trap tunneling gap distance (~2 nm) is shown (the dash line).

2.3.3 AC Conductance Measurement

In addition to LFN measurement, we also performed the impedance spectroscopy on

our RRAM device. Impedance spectroscopy is a technique that can separate the electrical

components for electronic conduction in dielectrics, including resistance, inductance, and

capacitance and reveals the dielectric dispersion in the frequency domain [95].

Impedance spectroscopy has been employed to characterize NiO RRAM [96], and TiO2

RRAM [97], etc. Here we perform impedance spectroscopy on the HfOx/AlOx RRAM.

53

We further extract the AC conductance as a function of applied small signal frequency to

investigate the conduction mechanism. Impedance (Z) spectroscopy was carried out by

applying a 10 mV AC small signal (100 Hz to 1 MHz) to the RRAM device through

Agilent 4284 LCR-meter. Figure 2.18 shows the Nyquist plot (-Im(Z) vs Re(Z)) for HRS

and LRS. A semicircle in the Nyquist plot suggests the RRAM cell in HRS can be

modeled as a resistor (R) and a capacitor (C) in parallel:

Z =R

1+ωR2C2− j

ωR2C

1+ωR2C2 (2.5)

Note that in the Nyquist plot, the diameter of the semicircle is the resistance R, and at

the characteristic frequency ω=1/RC, the locus of the data points in the Nyquist plot

reaches the maximum value (-Im(Z)=R/2) of the negative imaginary impedance axis. C is

measured to be 1.15 pF for our oxide stack (10 nm thick and 10×10 µm2 in size), and C is

found to be independent on the resistance states. As an estimate, the characteristic

frequency for HRS=30 MΩ is about 4.6 kHz. In the inset of Figure 2.18, the points that

cluster near the Re(Z) axis suggests the RRAM cell in LRS is shunted by CF of a low

resistance ~20 kΩ. The characteristic frequency in this case becomes 6.9 MHz, which is

beyond the measurement frequency range limit (<1 MHz). Therefore, the locus of the

data points in the Nyquist plot cannot reach the maximum value (-Im(Z)=R/2).

54

Figure 2.18 The Nyquist plot (-Im(Z) vs Re(Z)) for HRS and LRS.

We further extract the conductance as a function of the small signal frequency from the

impedance spectroscopy. Figure 2.19 shows the total conductance spectrum in LRS and

HRS obtained under various DC biases. It is seen that LRS conductance is independent of

frequency, while HRS conductance has a corner frequency beyond which the

conductance rises. Figure 2.20 shows the total conductance spectrum in different

resistance levels in HRS. It is seen that the higher the HRS resistance is, the lower the

corner frequency is.

55

Figure 2.19 The total conductance spectrum in LRS and HRS under various DC biases.

Figure 2.20 The total conductance spectrum in different resistance levels in HRS.

56

The AC conductance in the HRS is obtained by subtracting the DC component from

the total conductance. Figure 2.21 shows the AC conductance spectrum (a) for various

DC biases at a particular HRS level and (b) for different HRS levels at a particular DC

bias. It is seen that a universal f β-like (with β~2) AC conductance is observed

independent of the DC bias and also independent of the resistance of the different HRS

states.

57

Figure 2.21 The AC conductance spectrum (a) for various DC biases at a particular HRS

level and (b) for different HRS levels at a particular DC bias.

58

Next, we discuss the conduction mechanism that is revealed by the AC conductance

measurement. The key observation is a rise of conductance beyond a certain corner

frequency for the HRS. In general, the increase of conductance with AC small signal

frequency can be attributed to the dielectric relaxation process, which typically has

several origins: electronic polarization, ionic polarization, dipolar polarization [98].

However, all of the aforementioned processes occurs at very high frequency (>1 GHz).

Therefore, the only possibility that can cause the rise of AC conductance in the medium

frequency range (kHz-MHz) is the electron hopping (trap-to-trap tunneling) contribution

[98]. This relaxation is often described in terms of the following Debye form [99]:

σ(ω) = 𝛼 ∙ω2τ

1+ω2τ2(2.6)

where σ(ω) is the AC conductance as a function of the angular frequency ω, 𝛼 =

e2d2/12kT is the polarizability due to electron hopping between a pair of sites [99], here

e is the electronic charge, and d is the hopping distance. τ is the transition time for

electron hopping between a pair of sites. As a rough estimate, for hopping (trap-to-trap

tunneling), τ = t0exp(2d/ξ) = 2 × 10−12s . Here t0~10-13 Hz, ξ is the electron

wavefunction localization length (~0.3 nm), d is the distance between traps (assuming

d~0.5 nm for traps in CFs). Therefore, in the AC conductance measurement regime (kHz-

MHz), ωτ≪1, thus σ(ω) ~ ωβ with a slope index β~2. It is noted that the trap-to-trap

tunneling that contributes to the AC conductance should be those hoppings that are

59

confined within the nearest neighbor traps with a single τ, otherwise the trap-to-trap

tunneling with different τ would smooth Eq. (2.6) to be σ(ω) ~ ωβ with a slope index β~1.

We have to point out that the measured f β -like (β~2) AC conductance in our device is

not a result of the lead series resistance as suggested in [100], because we did not observe

the saturation of AC conductance or the cutoff frequency due to the lead series resistance

that were observed in [100].

To summarize, Figure 2.22 shows a schematic of the DC and AC conduction processes

in RRAM device. In HRS, the CF is partially ruptured and a gap region lack of traps is

formed near one electrode, leaving behind the residual CF that is rich in traps. The

electrode-to-trap tunneling across the gap region causes the DC noise current. Unlike DC

conduction that is associated with conducting paths extending to both electrodes, AC

conduction is based on the pair approximation of the transition between two sites

responding to the external AC field. As long as the traps within the residual CF can

interact with each other, no matter how long the ruptured CF is (which determines for the

DC resistance level), the AC conductance (and the value of β) should be independent on

the DC bias or the HRS resistance (see Figure 2.21). However, for AC conductance to be

observable, it must be larger than its DC component above a certain corner frequency.

Since AC conductance is independent of the resistance of the multi-level states, a lower

corner frequency correlates with a longer transition time for electron tunneling between

60

the electrode and traps for DC conduction, which corresponds to a larger tunneling gap in

higher HRS.

Figure 2.22 Schematic of the DC and AC conduction processes in RRAM device.

2.4 Ion Dynamics in HfOx Based RRAM

2.4.1 Voltage-Time Relationship in Switching Dynamics

What has been discussed in Section 2.3 is related to the electronic process in either

HRS or LRS, the resistive switching between the HRS and LRS should be related to the

ionic process. There is a well-known voltage-time dilemma of the requirement for fast

switching (~ns) and long time retention (~years) in the RRAM devices [70], which means

that the switching mechanism has a very strong nonlinearity in the electric field

dependence. Here we measure the voltage-time relationship for our HfOx/AlOx RRAM

61

device during both SET and RESET process, and discuss the physical origin of such

nonlinearity which is caused by the ion dynamics.

Figure 2.23 shows the pulse amplitudes needed to successfully trigger the SET/RESET

switching at fixed pulse widths. The trigger is successful if the device meets the pre-

defined target LRS/HRS resistances. Although the switching variations are noticeable,

the general trend is that the switching time for both SET and RESET exponentially

decreases with the increase of the pulse voltage. Similar exponential voltage-time

relationship has been observed in several other RRAM devices, e.g. ZnOx [101], TaOx

[102]. The origin of this universal relationship in the switching dynamics can be

attributed to the ion migration process through the potential wells of the oxide matrix

[103]. The drift velocity of the ion migration is given by [103]:

v = a0 ∙ f ∙ exp (−Em

kT) ∙ sinh (γ ∙

a0

L∙qV

kT)(2.7)

where f is the atomic attempt-to-escape frequency (~1013 Hz [103]), Em is the ion

migration energy (~1.5-1.9 eV for HfOx RRAM [100][104]), a0 is the atom spacing in

oxides (~0.25 nm), L is oxide thickness, q is electron charge, k is Boltzmann constant.

The local field acceleration parameter γ is introduced due to the strong polarizability in

high-k dielectrics [105]. Since the ion migration velocity has a hyperbolic-sinusoidal

dependence on the electric field, the corresponding growth/rupture evolution rate of CF

also has a hyperbolic-sinusoidal dependence on the electric field. When the electric field

is large (typically during switching F>1 MV/cm), the hyperbolic-sinusoidal dependence

62

reduces to the exponential dependence, leading to the universal voltage-time relationship

observed above. It should be noted that although this exponential relationship is observed

in the typical programming timescales (~ns to ~ms), it does not hold under certain

extreme conditions. When the electric field is much smaller compared to the

characteristic field F0 = 2kT/qa0 (~10 kV/cm), the hyperbolic-sinusoidal dependence

reduces to the linear dependence. That means without bias the retention time should be

infinite if only the drift component of the ion migration is considered. However, in reality

the retention time is finite due to the diffusion component of the ion migration. When the

electric field is sufficiently large, it may be difficult to further increase the switching

speed because the limiting factor of the switching time is no longer the ion migration

process but the electrochemical reaction process, e.g. there are already plenty of O2-

available for recombination with Vo during the RESET.

63

Figure 2.23 Measured voltage-time relationship: pulse amplitudes needed to successfully

trigger the SET/RESET switching at the some fixed pulse widths. The success of the

trigger is defined to meet pre-defined target LRS/HRS resistances (20 kΩ/1 MΩ).

2.4.2 Energy-Efficient Programming Scheme for Multi-bit Operation

In general, the RRAM devices can show multi-bit operation capability. It was found

that modulation of LRS resistances by varying set compliance current was not repeatable

in the single RRAM cell without the selection transistor. The reason may be the transient

current overshoot problem during the sharp transition in the SET process. Therefore, here

we focus on the modulation of HRS resistances by controlling the RESET programming

conditions. Figure 2.24 shows the DC I-V characteristics of HfOx/AlOx RRAM by

64

different RESET stop voltages (-2.1 V, -2.7 V, and -3.3 V). Three levels of HRS were

achieved.

Figure 2.24 The DC I-V characteristics of HfOx/AlOx RRAM by a different RESET stop

voltages (-2.1 V, -2.7 V, and -3.3 V).

Then we further explore this multi-level capability in the context of pulse cycling for

practical applications (Figure 2.25). Figure 2.25 (a) shows three levels of HRS can be

obtained by fixed pulse width (50 ns) but different pulse amplitudes (-2.3 V, -2.6 V, -2.9

V). Figure 2.25 (b) shows that similar three levels of HRS can also be obtained by fixed

pulse amplitude (-2.3 V) but different pulse widths (50 ns, 500 ns, 5 µs). The equivalence

of the above two programming schemes suggests that the final states are not only

65

determined by the applied voltage but also the duration time of the RESET programming.

Therefore, two programming schemes are available to achieve multiple HRS: 1) linearly

increase the programming pulse amplitude; 2) exponentially increase the programming

pulse width. Both schemes can effectively achieve target resistance, and their

equivalence is essentially caused by the exponential voltage-time relationship as

discussed above.

(a)

66

Figure 2.25 (a) Three levels of HRS obtained by fixed pulse width (50 ns) but different

pulse amplitudes (-2.3 V, -2.6 V, -2.9 V). (b) Similar three levels of HRS obtained by

fixed pulse amplitude (-2.3 V) but different pulse widths (50 ns, 500 ns, 5 µs).

The next question is which programming scheme is a better scheme in terms of energy

efficiency. To address this question, transient voltage/current waveforms were measured

for these two programing schemes. Figure. 2.26 shows that to achieve a target resistance

~50 kΩ from an initial resistance ~10 kΩ, two pulse schemes were used: one is applying -

2.3 V/50 ns; the other is applying -2 V/500 ns. Both transient current starts from similar

value and end with similar value, indicating the effect of the two schemes is the almost

same on the cell resistance. By integrating the voltage, current and time, we obtain the

(b)

67

energy consumption for the first scheme (-2.3 V/50 ns) is about 7.4 pJ and for the second

scheme (-2 V/500 ns) is about 60.9 pJ. Therefore, using larger but shorter pulse is

preferred to achieve both faster and more energy-efficient programming.

Figure 2.26 Transient voltage/current waveform for the two RESET programming

schemes. The first scheme: -2.3 V/50 ns; The second scheme: -2 V/500 ns. The initial

resistance is ~10 kΩ, and the final resistance is ~50 kΩ. The energy consumption for the

first scheme (-2.3 V/50 ns) is about 7.4 pJ and for the second scheme (-2 V/500 ns) is

about 60.9 pJ.

68

2.5 Summary

In this chapter, we investigated the physical mechanism of resistive switching

phenomenon in binary oxides. The resistive switching is attributed to the conductive

filament (made up of oxygen vacancies) formation and rupture in the oxides. As a model

system for device physics study, HfOx based RRAM devices were fabricated and device

performances such as switching voltage and resistance distribution, switching speed,

endurance and retention were characterized. By techniques such as I-V measurement at

various temperature, LFN measurement, AC conductance measurement, the TAT was

identified to be the dominant conduction mechanism. An exponential voltage-time

relationship was found between the switching time and the applied voltage, suggesting

the oxygen ion migration dynamics. Inspired by the voltage-time relationship, an energy-

efficient programming scheme was also proposed for multi-level operation.

69

Chapter 3

RRAM Device Modeling and Simulation 3

3.1 Kinetic Monte Carlo (KMC) Modeling

In the previous chapter, we have obtained understandings of the RRAM device physics

through a series of experiments, which sets a solid foundation for our modeling work in

this chapter. As illustrated in Section 2.1, for the typical bipolar switching RRAM, during

the FORMING/SET process, O2- are pulled out from lattice and Vo are generated and the

CF forms connecting both electrodes. Then the current flow through the CF. During the

RESET process, the CF is partially ruptured by the recombination of Vo with the O2- that

migrate from the oxygen reservoir at the electrode/oxide interface, thus a tunneling gap is

formed between the electrode and the residual CF. In order to simulate the above physical

picture, our strategy is to decouple the electronic process and the ionic process. For the

electronic process, we consider that TAT is the dominant conduction mechanism at low

bias regime in both HRS and LRS, in which case the electrons are in the localized states

where Vo are far away from each other. Two additional conduction mechanisms are

taken into account as well: the F-N tunneling at high bias regime in HRS, and the

metallic conduction in LRS, in which case the electrons are partially in the extended

states where Vo are very close to each other. The metallic conduction is implemented by

3 The work in this chapter was done in part by collaboration with Dr. Ximeng Guan, who

was a post-doctoral researcher at Stanford.

70

a resistor network model composed of non-linear resistors with resistance that

exponentially depends on the distance between adjacent Vo. For the ionic process, we

discrete the oxide matrix and track the evolution path of every Vo and O2- stochastically.

Figure 3.1 shows the developed KMC simulation flow: Starting from an initial Vo

configuration, the total current consisting of TAT current, F-N current, and metallic

current is first calculated. Then the local field and local temperature are calculated by the

resistor network and the Fourier heat transfer equation, respectively. Finally, the Vo

configuration is updated using a Monte Carlo method by calculating the event probability

of the Vo generation/recombination and O2- migration based on the activation energy of

these processes. The local temperature and local field play an important role for

determining the CF shape because these event probabilities are both field and temperature

dependent. The simulations are performed on a 2D matrix, and the parameters are

calibrated with the experimental data from our HfOx based RRAM.

Figure 3.1 The KMC simulation flow

71

3.1.1 TAT solver

Figure 3.2 shows the TAT process in the oxide RRAM.

Figure 3.2 The TAT process in the oxide RRAM.

The electron current continuity equation for TAT is given by Eq. 3.1 [106]:

dfn dt⁄ = (1 − fn)∑ rnmfmm − fn ∑ (1 − fm)rmn + Rcn(1 − fn) − Rnafnm (3.1)

where fn is the nth trap’s electron occupancy probability, rmn is the hopping rate from

the mth trap to the nth trap, rnm is the hopping rate from the nth trap to the mth trap, Rcn is

the tunneling rate from the cathode to the nth trap, Rna is the tunneling rate from the nth

trap to the anode, and ∑m stands for the summation over all the traps except the nth trap.

Here the hopping rate between traps takes the Mott hopping form [107] of Eq. 3.2:

rhopping = ν0exp(−2d/ξ + q∆V/kT) (3.2)

72

d is the distance between the two traps, ξ is the electron wavefunction localization

length ~0.3 nm, ν0~1013 Hz, q is the electron charge, ΔV is the voltage difference

between the two traps, kT is the thermal energy. Here the tunneling rate from the cathode

to the trap (Rc) and from the trap to the anode (Ra) is calculated by the WKB

approximation form in Eq. 3.3 and Eq. 3.4:

Rc = ν0 ∫ Ffermi(E)exp(−2/ℏ∫ √2m∗(Eb − E − qV(x)L

0)dx)

∞

Et−qV(L)dE (3.3)

Ra = ν0 ∫ (1 − Ffermi(E))exp(−2/ℏ∫ √2m∗(Eb − E + q(V − V(x)L

0)dx)

Et−qV(L)

−∞dE

(3.4)

E is the energy with respect to the Fermi-level in the electrode, Eb is the

electrode/oxide interface barrier height, Et is the trap energy below the conduction band

of the oxide, L is the distance between the trap and the electrode, m* is the effective mass

for HfO2 ~ 0.1 m0 , ν0~1014 Hz, ℏ is the reduced Planck’s constant, V(x) is the voltage

potential at the distance x from the electrode, V is the external applied voltage.

At the steady state, dfn dt⁄ = 0, and we need to self-consistently solve the m non-

linear equations like Eq. 3.1 (m is the total number of the traps in the oxide). The

Newton-Raphson iteration scheme was used for solving such series of non-linear

equations. After solving these equations, we obtain the electron occupancy probability at

each trap. Then the current flowing through the cell can be calculated by evaluating the

electron flux through a cross section, e.g. from the cathode to all the traps in the oxide:

I = −q∑ (Rcn(1 − fn)n − Rnafn) (3.4)

73

First let us apply the TAT solver for a 1D Vo chain. Consider the case in which there is

only one chain of Vo between the two electrodes of an RRAM cell. Assuming that the Vo

chain is connected with the anode, while it is disconnected from the cathode with a

tunneling gap. The electron from the cathode must tunnel through the gap before it hops

through the Vo chain to the anode on. Therefore the tunneling from the cathode to the

first Vo in the Vo chain across the gap becomes the bottleneck of the conduction. Since

the tunneling probability depends on the gap size exponentially, the current through the

cell has an exponential dependence on the gap distance, which is verified by the

simulation results of the TAT-solver (Figure 3.3). Therefore, any variation in the gap

distance is expected to have a significant impact on the HRS resistance.

Figure 3.3 I-V of 1D Vo chain for different gap distances simulated by TAT solver. The

inset shows current exponentially decreases with increasing gap distance, which is the

main cause of the HRS variation.

74

Furthermore, we examined the electron occupancy probability of the traps for this 1D

Vo chain case. Figure 3.4 shows the electron occupancy probability in the traps (a) for a

complete Vo chain connecting both electrodes (representing LRS) and (b) for a ruptured

Vo chain with a gap (representing HRS). It is observed that an increasingly negative bias

on the cathode drives the electrons to the anode. In other words, the traps near the

cathode tend to be electron-depleted in order to accept sufficient electron flux from the

cathode to maintain the current continuity. The depleted Vo becomes more positively

charged, hence it increases the capture cross-section with O2-, thus the recombination

becomes more favorable. Therefore the Vo near the cathode have a higher probability to

be recombined during the RESET, thus the CF tends to rupture near the cathode [107].

Figure 3.4 The electron occupancy probability in the traps in the 1D Vo chain (a) for a

complete Vo chain connecting both electrodes (representing LRS) and (b) for a ruptured

Vo chain with a gap (representing HRS).

(b)

(a)

75

3.1.2 Resistor Network

A resistor network model was developed to calculate the metallic component of the

current and also to estimate the local field and local temperature profile. Figure 3.5 shows

the schematic of the resistor network: each node in the network represents an oxygen

atom site in the oxide matrix. The node can be a red Vo site or a blue lattice oxygen (OL)

site. Then the resistors between the two nodes are defined in this exponential way:

R =R0

n∙ exp(n ∙

a0

d0) (3.5)

where R0 (105 Ω) and d0 (0.15 nm) are fitting parameters for characteristic resistance

and distance, a0 is the atom spacing (0.25 nm), and n stands for the nearest Vo at the nth

atom spacing away. After constructing the network, the Kirchhoff Voltage Law (KVL) is

solved on the network, which gives the metallic current and the local field profile.

Figure 3.5 Schematic of the resistor network: each node in the network represent an atom

site in the oxide matrix. The node can be a red Vo site or a blue lattice oxygen (OL) site.

76

Based on the resistor network’s local voltage/current profile, the local power

dissipation map P(x,y) can be constructed by calculating the local Joule heating on the

resistors. Then the local temperature profile (T) can be solved by the Fourier heat

equation:

dT

dt= κ(

∂2T

∂x2+

∂2T

∂y2) +

P(x,y)

Cth (3.6)

where κ is the thermal diffusivity, and Cth is the thermal capacitance. At steady state,

dT/dt=0. The 2D simulation was done in a 20 nm (wide) × 10 nm (thick) cell. Figure 3.6

shows an example of the simulated local electric potential in HRS with a gap between top

electrode and residual CF. The field is enhanced at the gap region, thus the CF tends to

reconnect there during the next SET cycle. Figure 3.7 shows an example of the simulated

local temperature in LRS when CF connects both electrodes. The elevated temperature

due to Joule heating enhances the O2- migration during the next RESET cycle. The

temperature rise is only noticeable in LRS and the heat generation in HRS is negligible.

Figure 3.6 An example of the simulated local electric potential in HRS with a gap

between top electrode and residual CF.

77

Figure 3.7 An example of the simulated local temperature in LRS when CF connects both

electrodes.

The local potential profile is forwarded as an input to the TAT solver. Figure 3.8

shows the simulated I-V curve for a device with gap ~1 nm considering multiple

conduction mechanisms contribute to the total current. The F-N current only plays a role

at a high negative bias.

78

Figure 3.8 Simulated I-V curve of a 1D Vo chain in RRAM cell with 1 nm tunneling gap

considering multiple conduction mechanisms.

3.1.3 KMC Module for Ionic Process

What we described above is how we model the electronic process in the RRAM. In

order to model the ionic process, we discretize the oxide into a grid and track the

evolution path of every Vo and every O2- in this grid. The event probability (P) of the Vo

generation/recombination and O2- migration is calculated based on the activation energy

(Ea) of these processes and the local field (F) and local temperature (T) in this way:

P(F, T, t) =t

t0exp(−

Ea−γa0F

kT) (3.7)

79

where t is the simulation time step (the time step should be small enough to ensure that

the P <<1), t0 is the atomic vibration time constant (~10-13 s [103]). Ea is the activation

energy, and it may have different values for Vo generation/recombination and for O2-

migration with the typical range 0.5 eV~2 eV, a0 is the atom spacing in oxides (~0.25

nm), k is Boltzmann constant. The local field acceleration parameter γ is introduced due

to the strong polarizability in high-k dielectrics [105]. To determine these probabilities,

the local field and local temperature profile obtained from the resistor network model is

used as an input to the KMC module. Then a Monte Carlo method is used to determine

the success of these events, and the Vo configuration is updated accordingly. Since we

trace every Vo and every O2-, we need to set up and maintain a list of the Vo’s

coordinates and a list of the O2- coordinates in the oxide grid. Figure 3.9 shows the

algorithm of KMC module.

80

Figure 3.9 The algorithm of KMC module.

1. Determine new Vo generation: Scan the oxide grid, if there is no Vo on the site,

calculate the Vo generation probability Pg, and then throw in a random number R

ranging (0,1), if R falls into (0, Pg), then a new Vo is generated; add this Vo into

the list of Vo. Meanwhile a new O2- is also generated; then determine the O2-

migration according the algorithm in the footnote below, and add this O2- into the

list of O2-.

2. Determine Vo and O2- recombination: Scan the list of O2-, and check the list of

Vo to see if they share the same coordinate; if yes, calculate the Vo

recombination probability Pr, and then throw in a random number R ranging

(0,1); if R falls into (0, Pr), then this Vo and this O2- annihilates, delete them in

the list of Vo and the list of O2-; if R falls into (Pr, 1), determine the O2- migration

according the algorithm in the footnote below, and update the list of O2-.

3. Determine O2- migration: Scan the list of O2-, and check the list of Vo to see if

they share the same coordinate; if no, determine the O2- migration according

algorithm in the footnote below, and update the list of O2-.

Footnote: calculate the O2- migration probabilities toward 4 directions (Pleft, Pright,

Pup, Pdown), throw in a random number R ranging (0,1), if R falls into (0, Pleft), O2-

migrates to the left adjacent site; if R falls into (Pleft, Pleft+ Pright), O2- migrates to the

right adjacent site; if R falls into (Pleft+ Pright, Pleft+ Pright+ Pup), O2- migrates to the up

adjacent site; if R falls into (Pleft+ Pright+ Pup, Pleft+ Pright+ Pup+ Pdown), O2- migrates to

the down adjacent site; if R falls into (Pleft+ Pright+ Pup+ Pdown, 1), O2- remains at the

current site.

81

By integrating the TAT solver, resistor network, and KMC module, our KMC

simulator can simulate many experimental observations. In the following, we utilize our

simulator to study the current overshoot, variability, and reliability problems in RRAM

device.

3.1.4 Simulation of Current Overshoot

Current overshoot in RRAM refers to the experimental observation that the RESET

current is larger than the FORMING/SET compliance current [108]. Figure 3.10 shows

the simulated I-V with overshoot effect for three cases: 1) FORMING with a RC time

constant τ=100 ns; 2) FORMING with τ=10 ns; 3) SET with τ=100 ns. (Note 1 pF

parasitic capacitance and 100 kΩ LRS can lead to 100 ns τ). Figure 3.11 shows the

simulated voltage drop on the RRAM as a function of time during the overshoot period.

Due to the parasitic capacitance, the voltage dropped on RRAM cannot follow the abrupt

resistance change. Instead, it exponentially decays after the compliance current is

reached. Figure 3.12 shows the simulated current during the overshoot period. It is seen

that the transient current of case 1) overshoot to a level higher than the pre-defined

compliance current.

82

Figure 3.10 Simulated I-V with overshoot: 1) FORMING with RC time constant τ=100

ns; 2) FORMING with τ=10 ns; 3) SET with τ=100 ns. The compliance is 10 µA.

Figure 3.11 Simulated voltage drop on the RRAM as a function of time during the

overshoot period.

83

Figure 3.12 Simulated current during the overshoot period. The compliance is 10 µA.

To understand what happens inside the RRAM cell during the overshoot period, the Vo

configuration of the three cases are tracked. Figure 3.13 shows the evolution of the CF

shape for the FORMING/RESET/SET cycles using KMC simulator. For a fresh sample, a

few intrinsic Vo (in pink color) exist at the grain boundary (subfigure a). Then we apply

FORMING conditions on the RRAM cell. At the onset moment for case 1), the O2- (in

blue color) start migrating towards the top electrode which is positively biased (subfigure

b). At the end of the overshoot period, the CF grows wider laterally (subfigure c),

suggesting that more Vo are undesirably generated due to the high voltage and high

temperature during the overshoot period. At the onset moment for case 2), similarly the

O2- (in blue color) start migrating towards the top electrode (subfigure d). At the end of

84

the overshoot period, the lateral growth of CF is limited (subfigure e), suggesting that the

smaller τ effectively shortens the high voltage and high temperature period, thus the

overshoot effect is suppressed. For case 1), we apply the RESET conditions on the

RRAM cell. O2- at the interface migrate back and partially rupture CF (subfigure f). Then

we continue applying SET conditions on the RRAM cell, which is the case 3). At the

onset moment for case 3), only a portion of the CF tip is connected to the electrode

(subfigure g). At the end of the overshoot period, the lateral growth of CF is also limited

(subfigure h), suggesting that the overshoot in the SET process is also suppressed as

compared with the FORMING process due to a lower voltage across the RRAM cell

during the overshoot transient period, although the overshoot period length is the same as

case 1). To summarize, eliminating the FORMING process and decreasing the parasitic

capacitance by integrating the RRAM cell with a selection transistor or a current limiter

with a saturating I-V characteristics is helpful for overcoming the overshoot problem.

85

(a) (b)

(c) (d)

(e) (f)

86

Figure 3.13 The evolution of the CF shape for the FORMING/RESET/SET cycles using

KMC simulator for (a) fresh sample; (b) Onset moment of FORMING with τ=100 ns; (c)

LRS after FORMING with τ=100 ns; (d) Onset moment of FORMING with τ=10 ns; (e)

LRS after FORMING with τ=10 ns; (f) RESET with τ=100 ns; (g) Onset moment of SET

with τ=100 ns; (h) LRS after SET with τ=100 ns. Pink dots are Vo. Blue dots are O2-.

3.1.5 Simulation of Variability

Our KMC simulator can continuously cycle the RRAM devices to obtain the statistics

of the switching. Figure 3.14 shows the simulated I-V curves during the repeated 100-

cycle DC sweep. The switching variations are well reproduced. Figure 3.15 shows (a) the

simulated resistance distribution and (b) switching distribution during the repeated 100-

cycle DC sweep. The simulation data here aims to reproduce the experimental data in

Figure 2.4.

(g) (h)

87

Figure 3.14 Simulated I-V curves during the repeated 100-cycle DC sweep.

88

Figure 3.15 Simulated (a) resistance distribution and (b) switching distribution during the

repeated 100-cycle DC sweep.

(b)

(a)

89

In the experimental data in Figure 2.4 (a) and the simulation results in Figure 3.15 (a),

it is observed that the LRS variation is less than the HRS variation. This is probably

because in HRS the current is exponentially dependent on the ruptured CF length, thus

any small variation of the tunneling gap distance may be magnified to be a large variation

of the HRS resistance. Therefore, we see remarkable tail bits (bits with lower resistance)

in the HRS resistance distribution as highlighted in Figure 3.15 (a). We correlate these

tail bits with their corresponding Vo configuration in the KMC simulator, and reveal that

the tail bits correspond to the existence of Vo in the gap region. Figure 3.16 shows an

example of the existence of Vo in the gap region which causes a tail bit in HRS.

Experimentally, we found the evidence of new Vo generation in the gap region during the

RESET. Figure 3.17 shows the transient current waveform measured during the RESET.

It is seen that a significant current fluctuation is present in the transient waveform, which

is caused by a competition between the simultaneous Vo generation and recombination

processes. Although the CF is partially ruptured, the high electric field present in the gap

region may also induce Vo generation. This is especially prominent at the beginning

stage of the current drop when the gap size is small and the field and temperature are both

high. The generated Vo in the gap tends to reconnect the ruptured filament with the

electrode, giving rise to a current jump. If the Vo generation occurs in the middle of a

programming pulse, it has the chance to be recombined again with the mobile oxygen

ions. But if the Vo generation occurs just before the end of a programming pulse, it has

no chance to be recombined again since the pulse is withdrawn afterwards (see the

90

experimentally observed last current jump in Figure 3.17). Therefore Vo are left in the

gap region and act as a bridge for the electron tunnel from the electrode to the residual of

the CF. To reduce the tail bits in HRS, the write-verify technique (write the cell first and

then read, if the cell resistance does not meet the resistance criterion, write it again) is

usually used [109].

Figure 3.16 An example of the existence of Vo (highlighted by red circle) in the gap

region which causes a tail bit in HRS.

91

Figure 3.17 The transient current waveform measured during the RESET with the last

current jump indicating a Vo left-over in the gap region.

3.1.6 Simulation of Reliability

Our KMC simulator is also capable of simulating the reliability. Figure 3.18 shows the

simulated endurance cycling at 125 °C. The reason that we chose an elevated temperature

is to accelerate the endurance degradation thus shorten the simulation time. During the

SET/RESET cycling, the HRS resistance decreases with cycling, which reflects the

experimental trend in Figure 2.6. Figure 3.19 shows the Vo configuration for the final

failure state (stuck at LRS). It shows there are insufficient O2- for recombining Vo at the

interface. During each SET cycle, O2- rushes towards the top electrode (TiN in our case),

92

and O2- that leak to the top electrode such as TiN may bond with the TiN to form TiON

layer thus cannot migrate back [110].

Figure 3.18 Simulated endurance cycling at 125 °C.

Figure 3.19 The Vo configuration for the final failure state (stuck at LRS).

93

Our simulator can also project the retention behavior versus time. Figure 3.20 shows

the statistically simulated LRS retention of 100 RRAM cells @150 °C. The resistance

gradually increases due to the gradual dissolution of CF. Figure 3.21 shows the LRS

failure time distribution at three baking temperatures. The LRS resistance increases

gradually during a high temperature baking test. The slope of extracted mean failure time

vs. 1/kT agrees well with O2- migration barrier parameter (~1.3 eV) used in the

simulation, which suggests that the LRS retention failure is caused by the O2- migration

from the interfacial oxygen reservoir. When the LRS fails, the tip of CF is ruptured by

O2- migration back from the interface.

Figure 3.20 Statistically simulated LRS retention of 100 RRAM cells @150 °C.

94

Figure 3.21 Simulated LRS failure time distribution at three baking temperatures.

3.2 Compact Modeling

Although the KMC simulator can reproduce many experimental observations, the

simulation is quite time-consuming since it is a numerical simulator. In order to enable

circuit/system level simulation, a compact model that can run fast while maintaining the

essential physics is useful. Therefore, we simplify the physical picture to be 1D filament

as shown in Figure 3.22.

95

Figure 3.22 Schematic of conductive filament with oxygen vacancies. The tunneling gap

distance g determines the device resistance.

The tunneling gap distance (g) determines the device resistance because quantum

tunneling through this gap region is the dominant conduction mechanism and the

resistance exponentially increases with the gap g. And g can be viewed as the average

tunneling gap distance, representing a state variable for the device. Therefore, the device

I-V relationship can be empirically expressed as:

I = I0 ∙ exp(−g g0⁄ ) ∙ sinh(V/V0) (3.8)

Here the “sinh” function is chosen to fit the linear dependence at low bias and

exponential dependence at high bias typically observed in experiments. The experimental

data from TiN/TiOx/HfOx/TiOx/HfOx/Pt RRAM device [111] was used for parameter

calibration. Fitting parameters I0 (~1 mA), g0 (~0.25 nm) and V0 (~0.25 V) are

determined by fitting with I-V curves at multiple resistance states (Figure 3.23). The

frontline of filament or the gap g evolves obeying the O2- migration velocity, which

96

originates from the O2- migration from one potential well to other in the following way

[103]:

dg

dt= −v0 ∙ exp(−

Em

kT) ∙ sinh(γ ∙

a0

L∙qV

kT) (3.9)

where Em is the O2- migration energy (~0.6 eV), a0 is the atom spacing (~0.25 nm), L is

the oxide thickness (~12 nm), q is the electron charge, k is the Boltzmann constant, and

v0 is a fitting parameter (~10 nm/ns). The local field acceleration parameter γ is

introduced due to the strong polarizability in high-k dielectrics as mentioned before, and

it is empirically fitted to be g dependent: γ = γ0 − β ∙ g3 where γ0 (~16) and β (~0.8) are

fitting parameters. In this way, γ increases as the frontline of filament gets closer to the

electrode and vice versa, thus it causes a positive feedback of the filament growth and a

negative feedback of the filament dissolution, leading to an abrupt SET process and a

gradual RESET process. T is the local temperature around filament due to Joule heating,

and the simplified heat equation follows:

dT

dt=

T−T0

τth+

V∙I

Cth (3.10)

where T0=298 K, τth is the thermal time constant, and Cth is the thermal capacitance.

Here we assume steady state, and then Rth= τth×Cth is equivalent thermal resistance

(~2000 K/W). The parameters in Eq. 3.9 and Eq. 3.10 are fitted by gradual RESET

transition curves by pulse train with different amplitudes (Figure 3.24). To capture the

stochastic RESET process, a Gaussian random number (δg) that accounts for the

randomness of oxygen ion migration is introduced:

dg = dg(ideal) + δg (3.11)

97

dg(ideal) is the gap distance calculated by Eq. 3.9, and the relative resistance

variability is given by δR R⁄ = δg g0⁄ , to match the experimentally measured variability

in Figure 3.24, δg ~0.0224 nm, leading to δR/R ~9%.

Figure 3.23 The I-V fitting of multi-level resistance states by varying the gap tunneling

distance in the compact model.

98

Figure 3.24 The gradual RESET transition curves by pulse train with different amplitudes.

With these analytical equations, we have the capability to implement the compact

model by MATLAB code or through the use of the SPICE engine in the form of

equivalent circuits [112] or Verilog-A codes for circuit/system level simulations 4.

3.3 Discussions and Summary

In this section, we discuss the comparison of our RRAM device model with other

models in the literature. Our model was developed for typical bipolar switching RRAM,

4 The RRAM compact models in the Verilog-A format can be downloaded from

https://nano.stanford.edu/model.php

https://nano.stanford.edu/model.php

99

and was calibrated with the experiments from HfOx based RRAM. We believe with

appropriate parameter fitting and adjustment, our model can also apply to other bipolar

RRAM devices, such as AlOx, ZrOx, etc. In the literature, there are many other RRAM

device models. Here we briefly review these models:

For unipolar switching, Ielmini’s group at Politecnico di Milano did a series of early

works on NiO RRAM based the thermal dissolution model [113][114][115]. The thermal

dissolution model postulates that the RESET of unipolar switching is driven by the self-

accelerated dissolution of the CF due to the Joule heating. This suggests that the lateral

shrinking of the diameter of the CF is caused by the Vo diffusion at raised temperature.

Later, the Kang group at Peking University proposed that thermal decomposition of O2-

from oxygen-rich clusters around CF and their recombination with Vo causes the rupture

of the CF in the unipolar NiO RRAM [83][116]. Ielmini’s model was the first one that

highlights the role of Joule heating in the switching, and indeed all the ionic processes are

thermally enhanced by the Joule heating in the RESET. Kang’s model revealed the role

of the oxygen source in the unipolar switching, and it could be unified with the bipolar

switching under the same general physical picture.

For bipolar switching, especially the HfOx bipolar RRAM, there are three major

models in the literature, the first one is our model (developed in collaboration with

Kang’s group at PKU), the second one is Larcher’s model from Universita di Modena

(with collaboration with SEMATECH) [117][118], and the third one is Degraeve’s model

100

from IMEC [119]. Below we discuss the common features and differences between these

three models.

For the electron conduction, our model assumes that in the HRS there is a bottleneck of

tunneling gap in a region near the top electrode with the residual CF conduction carried

by TAT; and in the LRS, the TAT and metallic conduction mechanisms co-exist.

Larcher’s model also assumes that in the HRS, TAT dominates the conduction with the

residual CF assumed to be metallic; and in the LRS, Larcher’s model assumes that the

conduction is purely metallic. Both models calculate the TAT current in the HRS, the

difference is that our model calculates the contribution of all the traps to the current,

while the Larcher’s model only considers the slowest trap/detrap process; in other words,

there is a dominant trap. For this dominant trap, the inelastic tunneling contribution

through phonon capture and emission is calculated. Degraeve’s model adopts a quantum

point contact (QPC) approach to describe conduction, in which the two electrodes are

separated by a constriction with variable width. The I-V relation in HRS and LRS are

fitted by changing the potential barrier’s parameters at the constriction. Although this

QPC approach may fit well with the I-V relation, it lacks a clear link with the physical

dimensions or shape of the CF.

For the ionic process and the interpretation of resistive switching, both our model and

Larcher’s model speculate that during the SET, Vo are generated and O2- migrate towards

the top electrode which is positively biased during SET, and during the RESET, the O2-

migrate back from the electrode/oxide interface to recombine with the Vo and rupture the

101

filament. The difference between the two is that our model predicts that the CF is

ruptured near the top electrode, because the traps near the cathode (which is the top

electrode during RESET) tend to be electron-depleted as indicated by the TAT solver

(and discussed earlier in Fig. 3.4). Thus the depleted Vo becomes more positively

charged, leading to an increase of the capture cross-section with O2-, thus the

recombination becomes more favorable in this region. In contrast, Larcher’s model

assumes that the CF is ruptured near the bottom electrode, because according to their

simulation the trap density near the bottom electrode is lower based on the argument that

the phonon energy is mostly released in the region near the top electrode when the

electrons are injected from the bottom electrode. This lower trap density leads to a higher

resistance in this region and causes a higher temperature during the RESET. Therefore,

the CF tends to rupture at this high temperature region. Degraeve’s model is very

different than the other two models. In this model, there is no generation/annihilation of

Vo, and Vo just migrate inward and outward of the constriction region, which is in the

middle of the oxide layer. SET is a lateral growth of CF and RSET is a lateral shrink of

CF. Therefore, it is referred as an “hour glass” model.

So far, there is no solid evidence to support any of these above models. Dedicated in-

suit TEM study with EELS analysis may help identify the rupture location of the CF

during the RESET. Nevertheless, we have established a platform of a KMC numerical

simulator and a compact model of RRAM, which can reproduce many experimental

observations, gain more insights about the RRAM device physics and provide guidelines

102

for future RRAM optimization. The KMC model can simulate the DC I-V sweep, pulse

switching, endurance cycling, and retention baking, etc. The tail bits in the resistance

distribution are attributed to the oxygen vacancy left over in the gap region due to a

competition between the oxygen vacancy generation and recombination. The compact

device model captures the essential device physics, but remains analytical for fast

simulations in the SPICE simulator for the circuit/system level design.

103

Chapter 4

Oxide Based Synaptic Device for

Neuromorphic Computing

4.1 Background

In the memory hierarchy of today’s von Neumann digital system, the increasing gap

between the caches and the non-volatile storage devices in terms of write/read speed has

become the performance bottleneck of the whole system. Bio-inspired neuromorphic

computing breaks this von Neumann bottleneck because it takes the advantage of massive

parallelism that comes from the distributed computing and localized storage in networks

[120][121]. Neuromorphic computing is also inherently error-tolerant, thus it is especially

attractive for applications such as image or speech recognition which involve a huge

amount of correlated input data in a changing and indeterministic enviroment [122]. The

most advanced neuromorphic computing systems today are implemented by artificial

neural netwoks in software. For example, the IBM team performed a cortical simulation

at the complexity of the cat brain on Blue Gene supercomputer, which required huge

amount of computation resources:147,456 microprocessors and 144 TB of memories

consuming a power of 1.4 MW [123]. The parallelism of a multi-core computer pales in

comparison to the highly distributed computing in 1011 neurons and 1015 synapses in the

human brain. As an alternative approach, the hardware implementation of neuromorphic

computing may physically reproduce the parallelism on chip. Previously, neuromorphic

system in hardware with both neurons and synapses was implemented by CMOS circuits

104

[124]. The scaling-up of these systems are mainly constrained by the device density and

energy consumption of the synapses since there are thousands of synapses connecting to

one neuron. Previously, each synapse was implemented with quite a few transistors, e.g.

the 8-T SRAM cells [125] that occupies a huge area (>100 F2, F is the minimum feature

size of the lithography technology) and consumes substantial static power. Recently, two-

terminal emerging memory devices that show electrically-triggered resistive switching

phenomenon have been proposed as artificial synapse [126]. These emerging memories

have the advantage of a small cell area (4F2, and 4F2/m if 3D stackable, m is the number

of 3D stack layer). In the literature, Ge2Sb2Te5 based PCRAM [127][128], Ag/a-Si [129],

Ag/Ag2S [130] based CBRAM, and TiOx [131], WOx [132] based oxide RRAM have

been reported showing synaptic behaviors. Among these candidates, oxide RRAM is

attractive for large-scale demonstration of a neuromorphic system due to a relatively

lower energy consumption (as compared to the phase change memory), the compatibility

with CMOS technology and the potential for 3D integration [64] [65]. Mb-scale to Gb-

scale prototype oxide RRAM chips have been demonstrated recently [38][39][40].

Therefore, a hybrid neuromorphic system with CMOS neurons and oxide RRAM

synapses integrated on top of CMOS neurons at the metal interconnect layers can be

envisioned. Figure 4.1 shows an analogy between the biological synapse and the artificial

oxide synaptic device: the biological synapse changes its conductance by

activating/deactivating ion channels between the membrane and the synaptic junction

when the action potential arrives, while the oxide synaptic device changes its resistance

105

by generation and migration of the Vo and O2- when the programming voltage pulse is

applied. The vision is that the parallel neural network can be emulated by the cross-point

RRAM array for large-scale integration.

Figure 4.1 An analogy between the biological synapse and the artificial oxide synaptic

device. The vision is that the parallel neural network can be emulated by the cross-point

RRAM array for large-scale integration.

4.2 Synaptic Plasticity and Learning

The neurons and synapses are the two basic computational units in the brain. Neuron

computation is performed by integrating the inputs coming from other neurons and

generating spikes as a result. The synapses contribute to the computation by changing

106

their connection strength as a result of neuronal activity, which is known as synaptic

plasticity. Synaptic plasticity is the mechanism that is believed to be underlying the

learning and memory formation of the brain [133]. The concept of synaptic plasticity

has been greatly influenced by Hebb’s postulate stating that the connection strength

between neurons are modified based on neural activities in pre-synaptic and post-synaptic

neurons [133]. The translation of the Hebbian learning rule into the hardware suggests

that the synaptic devices should serve as programmable weights between neurons [134].

In late 1990s, a form of Hebbian learning, spike-timing-dependent plasticity (STDP)

attracted tremendous interest in both experimental and computational fields of

neuroscience [134]. Figure 4.2 shows the asymmetric form of STDP which states that the

synaptic weight depends on the relative timing of arrival of the pre- and post-synaptic

spikes [135]: the synapse potentiates (increase in synaptic weight or conductance) if pre-

synaptic spike precedes post-synaptic spike repeatedly; and the synapse depresses

(decrease in synaptic weight or conductance) if post-synaptic spike precedes pre-synaptic

spike repeatedly. STDP can be generalized as a mechanism, which regulates the

connection strength of neurons depending on their co-activity and spike timing. Owing to

its simplicity, biological plausibility and computational power, STDP has been widely

used in computational neuroscience for pattern recognition, temporal sequence learning,

coincidence detection, navigation, and orientation selectivity.

107

Figure 4.2 The asymmetric form of STDP states that the synaptic weight depends on the

relative timing of pre- and post-synaptic spikes [135]. ΔG = (Gafter-Gbefore)/Gbefore. Gbefore

is the conductance before the pre and post spike pair, and Gafter is the conductance after

the pre and post spike pair.

We are interested in how to realize the synaptic plasticity in the oxide RRAM based

synaptic device. The lowest requirement is that the RRAM device can act as

programmable weights for Hebbian learning. The programmable weights should have

multi-level resistance states. A higher requirement is to emulate the STDP learning rule,

thus the timing information should be incorporated into the plasticity by an appropriate

signaling scheme. In the following, we explore the multi-level capability of our

HfOx/AlOx RRAM device for serving the programmable weights, and we design a pulse-

108

train with varying amplitudes for implementing the STDP learning. The fabrication of the

HfOx/AlOx RRAM can be referred to Section 2.2.

The practical application of synaptic devices would place the synaptic device under

spike communication. Therefore, we investigate the multi-level capability of the RRAM

under pulse programming. Figure 4.3 (a) shows the resistance evolution of the device for

the first 100 pulse cycles. The pulse widths were fixed to be 50 ns. Starting from the

LRS, the RESET process was first performed: 5 pulses with amplitudes consecutively

increased from -2.2 V to -2.6 V with -0.1 V step were applied. The RRAM resistance was

thus gradually increased from around 10 kΩ to several hundred kΩ. Then the SET

process was performed: 5 pulses with the amplitudes consecutively increased from 1.4 V

to 1.8 V with 0.1 V step were applied. The RRAM resistance was decreased from several

hundred kΩ to around 10 kΩ. After this 10-pulse sequence, the cell returned to the

original LRS. This up-down cycling could be repeated for many times. It is noted that

there are noticeable resistance variations in the cycling, and also the RESET process is in

general more gradual than the SET process (the SET has fewer intermediate states). This

observation is in agreement with the abrupt SET and gradual RESET in the DC I-V

characteristics (see Figure 2.3). Figure 4.3 (b) shows the similar up-down cycling after

the device underwent the 105- cycle endurance test (±3 V/50 ns for SET/RESET).

Although the highest resistance value tends to decrease a bit, the device still retains the

multi-level capability.

109

Figure 4.3 The up-down cycling of the HfOx/AlOx synaptic device for (a) the first 100

cycles and for (b) the 100 cycles after 105 endurance testing cycles. The pulse widths

were fixed to be 50 ns. The pulse sequence is 5 pulses increased from -2.2 V to -2.6 V

with a step -0.1 V and then 5 pulses with increased from 1.4 V to 1.8 V with a step 0.1 V.

(a)

(b)

110

In order to implement the STDP learning rule, we develop a signaling scheme that

utilizes the pulse amplitude to tune the resistance states. The generation of these specific

signals is done by the corresponding neuron circuit design in practical applications. Let

us first examine how the pulse amplitude in a single pulse can effectively tune the

resistance states. Figure 4.4 shows resistance modulation of 10 states achieved by varying

the amplitudes of positive pulses and negative pulses from an intermediate initial state

(200 kΩ ~ 300 kΩ, the shaded region in the plot). As we can see, for a linear increase of

positive pulse amplitude from 1.6 V to 2V, the resistance gradually decreases to around

30 kΩ. For a linear increase of the negative pulse amplitude from -2.4 V to -2.8 V, the

resistance gradually increases to around 3 MΩ. To exploit the capability of the pulse

amplitude modulation for STDP, the shape of the spike is designed as in Figure 4.5. The

spike consists of a series of single pulses in consecutive timeslots. A negative pulse

occupies the first timeslot; then positive pulses with decreasing amplitudes follow in

subsequent timeslots. The design principle is that the no single pulse in any timeslot can

affect the device’s resistance. Only when there is an overlap of the pre-spike and post-

spike, a programming pulse can be generated for which the amplitude is large enough to

modulate the resistance. The voltage dropped on the synapse is defined as the voltage of

the pre-spike minus the voltage of the post-spike. Thus, the negative pulse in the first

timeslot acts as an enabling function to determine which positive pulse could contribute

to the programming pulse. Closer spike timing means the negative pulse would overlap

positive pulse with larger amplitude, leading to a larger resistance change. If the pre-

111

spike precedes the post-spike, a positive programming pulse (the red one in the figure) is

produced. Otherwise, a negative one (the blue one in the figure) is produced. An example

design of the spike is outlined here: The period for each pulse in one spike is designed to

be 1 µs. The duty cycle for each positive pulse can be very short, e.g. 50 ns. However, the

duty cycle for the first negative pulse should be relatively longer, e.g. 500 ns, since it acts

as an enabling function that must tolerate the timing variability caused by the RC delay

through the interconnect. Therefore the total time for one spike is 6 µs, corresponding to

a maximum operating frequency is 167 kHz. Using the signaling scheme explained

above, the STDP-like curve could be reproduced in the RRAM based synaptic devices as

Figure 4.6. The relative conductance changes were calculated with the data in Figure 4.4.

Figure 4.4 The HfOx/AlOx synaptic device resistance modulation by the pulse amplitude

starting from an initial state with an intermediate resistance value (200 kΩ ~ 300 kΩ, the

shaded region in the plot). Each error bar consists of 50 independent tests.

112

Figure 4.5 Spike-timing-dependent plasticity (STDP) realization schemes developed with

time-division multiplexing and pulse amplitude modulation. The pulse amplitudes for the

pre-spike are -1.4 V, 1 V, 0.9 V, 0.8 V, 0.7 V, 0.6 V, consecutively, and for the post-

spike are -1V, 1.4 V, 1.3 V, 1.2 V, 1.1 V, 1V, consecutively.

Figure 4.6 The STDP-like curve calculated with the data in Figure 4.4 employing the

signaling schemes in Figure 4.5

113

4.3 Analog Synapse and Binary Synapse for WTA Network

In the previous section, it was shown that by varying the pulse amplitude, up to 10

resistance states could be obtained in HfOx/AlOx synaptic devices; however, the pulse-to-

pulse variation was significant. Here we investigate a TiN/TiOx/HfOx/TiOx/HfOx/Pt stack

as synaptic device because it has better switching uniformity. This multi-layer stacked

device is also forming-free without requiring an initial large voltage to trigger the

subsequent switching events. The device fabrication and memory performance

characterization can be referred to [111]. It was shown that with this multiple oxide layer,

the cycle-to-cycle and device-to-device variation is significantly reduced [111]. Figure

4.7 shows the DC I-V characteristic of the TiOx/HfOx/TiOx/HfOx synaptic device. Multi-

level resistance states in this measurement were achieved by varying the RESET stop

voltages. It is seen that the SET process is abrupt while the RESET process is gradual.

This is because the SET process is a positive feedback between the Vo generation rate

and the temperature and local field strength, while the RESET process is a negative

feedback. Thus we utilized the gradual RESET process as analog synapse since it can

obtain multiple immediate states as the programmable weights in the neural network.

114

Figure 4.7 The DC I-V characteristic of the TiOx/HfOx/TiOx/HfOx synaptic device.

4.3.1 Analog Synapse for Depression Learning

Since we utilize the gradual RESET process, we started the training from two initial

resistance states (~500 Ω and ~20 kΩ), and applied 400 identical RESET pulses (-1.1

V/10 ns and -1.3 V/10 ns), and the results collected from 5 different devices are shown in

Figure 4.8. During the depression learning process, the resistance gradually increases;

about 100 intermediate states were achieved although the variation is still present. To test

the device endurance, we repeated such training process for 1000 cycles (1 cycle includes

400 identical RESET pulses). After one RESET training cycle, we used DC sweep to

SET the device to ~500 Ω or ~20 kΩ by controlling the compliance current. Figure 4.9

shows the RESET training process at several milestone cycles in the endurance test. No

115

significant degradation was found after 1000 cycles. The energy consumption per spike

depends on the initial resistance states. Figure 4.10 shows the training process starting

from 3 different initial states (~500 Ω, ~3 kΩ and ~20 kΩ). The higher resistance the

initial state is, the less the energy per spike the device consumes. For an initial resistance

state of ~20 kΩ, the energy per spike drops to ~0.85 pJ. It should be mentioned that here

the maximum energy consumption at the initial stage of the training is estimated. As the

training progresses, the resistance gets higher, and the energy consumption also

decreases. As far as we know, this is the first sub-pJ synaptic device reported in the

literature.

Figure 4.8 The depression learning process by 400 consecutive identical RESET pulses

for 5 different devices. The initial resistance states are chosen to be ~500 Ω and ~20 kΩ,

respectively.

116

Figure 4.9 The depression learning process at several milestones during the 1000 cycle

endurance cycling test. 1 cycle includes 400 consecutive identical RESET pulses.

Figure 4. 10 Dependence of the energy per spike on the initial resistance states for the

training process. If starting at ~ 20 kΩ, the maximum energy per spike drops below 1 pJ.

117

To capture the gradual RESET process, we employed our compact model that has been

calibrated with this TiOx/HfOx/TiOx/HfOx RRAM (see Section 3.2). Figure 4.11 show the

simulated depression learning process starting from ~500 Ω by varying the pulse

amplitudes, mimicing the the experimental data in Figure 3.24. In Section 4.3.3, we apply

this compact model of depression learning for the system-level simulation.

Figure 4.11 Simulated depression learning process starting from ~500 Ω by varying the

pulse amplitudes, mimicing the the experimental data in Figure 3.24.

4.3.2 Binary Synapse for Stochastic Learning

In the previous section, we explored an analog synapse utilizing depression learning.

The reason why we only utilized the depression is that the gradual RESET transition

offers hundreds of intermediate states while the abrupt SET transition only offers binary

118

states. It is believed that the analog synapse generally outperforms the binary synapse for

neuromorphic computing because a limited number of synaptic states dramatically reduce

the storage capacity of a neural network [136]. If the synaptic strength cannot be changed

by an arbitrarily small amount as in the case of the binary synapse, the newly learned

patterns quickly overwrite the previously learned ones, thus the storage capacity is

limited. This problem can be overcome by a stochastic learning rule that changes only a

small fraction of synapses randomly chosen at each training cycle [136]. How can this

random choice be realized in an oxide binary synaptic device without increasing the

complexity of the CMOS neuron circuit design? Here we demonstrate that the SET

transition of the oxide RRAM becomes probabilistic under a weak programming

condition (applying a smaller voltage than the nominal switching voltage for a successful

programming event). We propose utilizing this inherent switching variability to realize

the stochastic learning rule in the binary synapse.

Figure 4.12 shows the measured SET/RESET continuous cycling with different SET

pulse amplitudes (+1.9 V/10 ns, +1.6 V/10 ns, +1.3 V/10 ns, respectively). It is seen that

with the decrease of the SET pulse amplitude, the SET success probability decreases as

well, thus the SET process becomes stochastic under weak programming conditions. The

resistive switching is inherently stochastic as suggested by our Kinetic Monte Carlo

simulation described before. The remarkable switching parameter variability is a well-

known technical challenge for the oxide RRAM array design and substantial research

efforts were spent to reduce the variability. Here we make use of this nominal

119

disadvantage (from a digital memory perspective) to realize the stochastic learning rule

for the binary synapse.

(a)

(b)

120

Figure 4.12 Measured SET/RESET continuous cycling with different SET pulse

amplitudes (a) +1.9 V/10 ns, (b) +1.6 V/10 ns, (c) +1.3 V/10 ns, respectively.

The SET process becomes stochastic under weak programming conditions.

To obtain the statistics for both cycle-to-cycle variation and device-to-device variation,

we measured the pulse amplitudes required for triggering the SET transition: In each

cycle, a strong RESET pulse (-1.9 V/10 ns) was applied to achieve a complete off-state

(~500 kΩ), then a weak SET pulse with amplitudes from +0.6 V to +3 V (with linearly

spaced steps with increasing amplitude) with a 10 ns width was applied to determine the

switching probability. Such cycle was repeated for 100 cycles for each device. Then 50

different devices on the wafer were measured. Figure 4.13 shows the measured statistical

distribution: (a) for a particular device, the pulse amplitude for a successful SET

operation roughly follows a Gaussian distribution with a standard deviation about 0.3 V;

(c)

121

(b) across various devices, the medium pulse amplitude for a successful SET operation is

centered around 1.95 V with a standard deviation about 0.15 V (a straight line in this plot

indicates a Gaussian distribution). If we design the pulse amplitude applied to the device

to be 1.6 V, then on average, around 12% SET trials will be successful. Certainly, due to

device-to-device variation, some device may have success probability higher than 12%,

while others may have success probability lower than 12%. Nevertheless, the SET

transition becomes probabilistic under this weak programming condition. In the learning

simulation that follows, we use a weak SET condition (e.g. +1.6 V/10 ns) with a strong

RESET condition (e.g. -1.9 V/10 ns) for a stochastic learning rule. Here a strong RESET

is needed to switch the device to a complete off-state to avoid any unintentional

switching under a weak SET programming condition in the next cycle.

(a)

122

Figure 4.13 Measured statistical distribution of pulse amplitude required for triggering

the SET switching from the off-state. In (a), the probability of SET switching is measured

from one representative device for 100 cycles. In (b), 50 different devices on the wafer

were measured with one type of symbol in the figure representing the data from one

device.

According to the measurement results in Figure 4.13, the stochastic switching behavior

is modeled as follows: the cycle-to-cycle variation of the binary synapse is modeled as a

Gaussian distribution of the threshold SET pulse amplitude (with a standard deviation 0.3

V); and the median value of the Gaussian distribution shifts from device to device,

reflecting the device-to-device variation, which is modeled as a Gaussian distribution

(b)

123

centred around 1.95 V with a standard deviation 0.15 V. When this model was

implemented in the following simulation, each synapse in the network is randomly

assigned with a Gaussian distribution of SET threshold with the parameters extracted

aforementioned. In Section 4.3.3, we apply this model of stochastic learning for the

system-level simulation.

4.3.3 Winner-Take-All Network Simulation

For oxide RRAM devices to be used for emulating synapses, there are several general

open questions needed to be addressed at the system-level, e.g., why does a useful

learning algorithm require the multi-level intermediate states, how many resistance states

are needed, how does the inherently stochastic device variation affect system

performance, and can binary synapse be useful if a stochastic learning rule is used? In

order to answer these questions, we perform a simulation of a two-layer winner-take-all

(WTA) neural network [137] as a toy model, comparing binary synapse and analog

synapse implementations.

Figure 4.14 shows the WTA architecture implemented by integrate-and-fire neurons

and oxide synaptic devices: Every neuron in the output layer connects with all the

neurons in the input layer through excitatory synapses based oxide RRAM devices. Every

neuron in the output layer also connects to one another through inhibitory synapses based

on fixed resistors. The unsupervised competitive learning algorithm allows such two-

layer network to perform the orientation classification function [137]. Figure 4.15 shows

a spiking scheme for implementing the unsupervised competitive learning algorithm

124

designed for (a) binary synapse and for (b) analog synapse: The input layer neurons fire

according to the light intensity of the input pattern; if the light intensity exceeds the

neuron firing threshold, the neurons send a relatively long but small positive pulse to all

the output layer neurons through the excitatory synapses. The output layer neurons sum

and integrate the input currents on the membrane capacitor independently, and the one

with the largest input current fires first (becomes the “winner”), then it discharges the

membrane capacitor of all the other output layer neurons and prevent them from firing

(“takes all”) through the inhibitory synapses. Meanwhile this winner neuron sends a short

two-phase pulse with a small negative pulse followed by a large positive pulse back to all

the input layer neurons for binary synapse, or it sends a positive pulse with amplitude of

the actual RESET programming pulse for analog synapse. Thus the excitatory synapse

strength gets modified according to the input pattern. For stochastic learning using binary

synapse under a weak programming condition (e.g. +1.6 V/10 ns), the update of the

synapse conductance map is an incremental process. For depression learning using analog

synapse (e.g. -1.1 V/10 ns), the update of the synapse conductance map is also an

incremental process. After a certain number of training images, a self-organized

conductance map is expected to emerge.

125

Figure 4.14 The WTA architecture implemented by integrate-and-fire neurons and oxide

synaptic devices.

(a)

126

Figure 4.15 A spiking scheme for implementing the unsupervised competitive learning

algorithm designed for (a) binary synapse and for (b) analog synapse.

In the following simulation, 32×32 neurons in the input layer are used and 2×2 neurons

in the output layer are used. Thus there are 4096 oxide synaptic devices between the two

layers. The neuron firing threshold is set to be 1 V, and the decay time constant of the

membrane voltage is set to be 1 µs. For binary synapse, the pre-synaptic forward spike

from the input layer neuron is designed to be a positive pulse (e.g. +0.8 V/500 ns) that is

half the amplitude of the actual SET programming pulse, and the post-synaptic backward

spike from the output layer neuron is designed to be a negative pulse (e.g. -0.8 V/10 ns)

that is half the amplitude of the actual SET programming pulse followed by a positive

pulse (e.g. +1.9 V/10 ns). For analog synapse, the pre-synaptic forward spike from the

(b)

127

input layer neuron is designed to be positive pulse (e.g. +0.55 V/500 ns) that is half

amplitude of the actual RESET programming pulse, and the post-synaptic backward

spike is designed to be a positive pulse with amplitude of the actual RESET programming

pulse (e.g. +1.1 V/10 ns).

Initially, the resistances of all the oxide synaptic devices were randomized with a

distribution centered at the on-state (~500 Ω). During the training, 200 gray-scale testing

images with 32×32 pixels were presented into the input layer neurons. The input patterns

have the shape of a 2D Gaussian bar with random orientation. The decay length of the 2D

Gaussian bar in longitude direction is 16 pixels and the decay length in latitude direction

is 4 pixels. The input layer neuron fires if the relative intensity is larger than 0.5. These

200 test images have a non-uniform distribution in 4 orientations (centered at 0o, 45o, 90o,

and 135o with a standard deviation of 7.5o). The target of the network is to converge at

these 4 dominate orientations. When the training was completed after 200 training images

by definition, standard images of 2D Gaussian bar in 24 different orientations (0o to 180

with a step of 7.5o) were used for testing the orientation selectivity of the network.

Figure 4.16 shows the evolution of the normalized conductance map between the input

layer neurons and the output layer neurons for the binary synapse with stochastic learning

(a)-(c) and the analog synapse with depression learning (d)-(f). Initially, the resistances of

all the oxide synaptic devices were randomized with a distribution centered at an on-state

(~500 Ω), see (a) for binary synapse and (d) for analog synapse. After the training, the

resistances split into groups of the on-state and the off-state. With appropriate

128

programming condition, the 4 distinct orientations emerge, see (b) for binary synapse

using +1.6 V/10 ns SET pulse and (e) for analog synapse using -1.1 V/10 ns RESET

pulse. It is noted that for the analog synapse, there are many noisy pixels caused by the

intermediate states. If the programming condition not optimized, only 3 distinct

orientations emerge, see (c) for the binary synapse using +2 V/10 ns SET pulse and (f)

for the analog synapse using -1.4 V/10 ns RESET pulse.

Figure 4.16 Simulated normalized conductance map between the input layer neurons and

the output layer neurons utilizing binary synapse with stochastic learning (a)-(c) and

analog synapse with depression learning (d)-(f). The normalization is done with respect

to a reference that is the highest conductance in the synapse array before the training.

129

To compare the system performance between the binary synapse and the analog

synapse, three metrics are used: 1) the orientation selectivity defined as the contrast of the

output layer neuron’s response intensity to the 1st preferred orientation over the 2nd

preferred orientation; 2) the orientation storage capacity defined as the number of distinct

orientations stored in the output layer (ideally, 4 distinct orientations will be detected); 3)

the energy consumed on the synaptic devices during the whole training, including the

read energy for summing the current through the synapses and the write energy for

programming the synapses. Figure 4.17 shows the average values of these metrics as a

function of programming conditions for the system with the binary synapse (a)-(b) and

the system with the analog synapse (c)-(d) through 100 independent simulation runs. The

trends can be explained as follows: For the binary synapse, increasing the SET pulse

amplitude means increasing the SET success probability. As a result, the selectivity

increases because more pixels are switched to “white” and the contrast is improved. The

orientation storage capacity can achieve the maximum value 4 at 1.6 V, thus +1.6 V/10 ns

is chosen as the optimized programming condition for the binary synapse, which

corresponds to a SET success probability ~ 12% on average. The loss of the orientation

storage capacity below 1.6 V SET pulse amplitude is due to insufficient SET success

probability, which limits the ability of the network to learn sufficient patterns for a fixed

(limited) set of training images (200 images in this case). On the other hand, the rapid

drop of the orientation storage capacity beyond 1.6 V SET pulse amplitude is due to

excessive SET success probability, which hastens the network’s forgetting process

130

(overwriting the learned patterns too frequently), thus only the final patterns are

remembered (see Fig. 4.16 (c) as an example). The total energy consumption (including

the read and write energy) increases with the increase of the SET pulse amplitude. The

energy consumption roughly follows the relationship ~ E = (V2/R)×t. For the analog

synapse, increasing the RESET amplitude means that the RESET transition becomes less

gradual and fewer intermediate states are available (see Figure 3.24). As a result, both the

selectivity and the orientation storage capacity decreases with increasing RESET pulse

amplitude (see Fig. 4.16 (f) as an example). Here -1.1 V/10 ns is chosen as the optimized

programming condition for the analog synapse. Under depression learning mode, the

learning becomes saturated as the devices quickly RESET to the completely off-state if

the number of possible intermediate states is insufficient. The write energy consumption

decreases with the increase of RESET pulse amplitude since the learning saturates faster.

The read energy has a turning point due to the competing trends of increasing voltage and

increasing resistance in the relationship ~E = V2/R×t. At the optimized programming

condition for the binary synapse and the analog synapse respectively, the same full

network storage capacity of 100% is achievable, the selectivity of the binary synapse is

14.2% and that of the analog synapse is 9.9%, and the total energy consumption of the

binary synapse is 156 µJ and the that of the analog synapse is 60 µJ. The feasibility of the

adaptive learning with either binary synapse or analog synapse is demonstrated through

this system-level simulation.

131

(a)

(b)

132

Figure 4.17 Simulated system performance metrics as a function of programming

conditions. Network orientation selectivity and orientation storage capacity for binary

synapse in (a) and for analog synapse in (c); Energy consumption of the synaptic devices

(c)

(d)

133

during the whole training (200 training images) for binary synapse in (b) and for analog

synapse in (d). The average values through 100 independent simulation runs are shown.

To study the impact of synaptic device variation on the robustness of the neuromorphic

system, we increase the variability (δR/R) for the analog synapse and perform 100

independent simulations for each variability level (for the binary synapse simulation, the

variation effect is already considered in the model). Figure 4.18 shows that up to the

experimental variability level (~9%), the system is robust, and there is only slight

degradation of the selectivity even at the variability level ~ 40%. This simulation

demonstrates the robustness of the neuromorphic computing against the device variation

at the system level thanks to the parallelism of the neural network.

Figure 4.18 Simulated orientation selectivity as a function of the analog synapse

variability δR/R. At each error bar, 100 independent simulations are performed.

134

4.4 Performance Metrics for Synaptic Device

The interest in building large-scale neuromorphic systems using synaptic devices has

motivated the device community to make significant progress in developing synaptic

devices mimicking the key characteristics of biological synapses. However, a clear

understanding of the performance metrics is still lacking, mainly due to the diversity of

the targeted application space. In this section, we discuss some general characteristics for

synaptic devices which are desirable for most of the applications. In order to evaluate

different synaptic device characteristics, a rule of thumb can be scalability to biological

levels. An ideal synaptic device should have characteristics such as size, energy

consumption, operation frequency, which are scalable to biological levels. The analysis

below is presented with that perspective in mind. Table 4.1 summarizes desirable

performance metrics for synaptic devices.

1) Synaptic Device Dimensions: The large-scale integration of neural network requires a

compact synaptic device with a small device footprint. Two terminal structures that

can enable 3D integration and potential scalability to nanometer regime are one of the

key guiding principles of synaptic devices research. The density of synapses in

human cortex is >~109/mm3 and the physical size of the synaptic cleft is <~20 nm

[133]. Thus synaptic devices with scalability down to sub-10 nm regime is preferred.

10 nm×10 nm HfOx RRAM devices has been demonstrated [26], showing a good

promise for scalability.

135

2) Energy Consumption: Among all performance metrics, energy consumption is the

most challenging and probably the most difficult to achieve considering the need for

energy efficient brain-inspired computational systems in the future. The number of

biological synapses is enormous (~1015) in the human brain [133]. As an order-of-

magnitude estimation, the synapses operate at about a frequency of 1~10 Hz [133],

and the human brain consumes a power of the order of 10 W [133]. Thus, on average

the energy consumption per synaptic event is around 1~10 fJ. Some of the HfOx

RRAM have already shown sub-pJ level energy consumption [26], while it holds the

potential to further decrease in synaptic energy consumption to the fJ level by further

lowering the programming current from tens of µA to sub-µA level.

3) Operating Speed/Programming Time: The brain operates at an average frequency of

1~10 Hz. Most of the synaptic device candidates can already operate faster than that.

The fast (~ns) programming speed of the oxide RRAM provides a lot of flexibility to

design the signaling scheme for implementing learning rules. A good strategy can be

developing plasticity schemes for synaptic devices with ns or µs programming times

to keep the synaptic energy consumption as low as possible (See the discussion about

energy-efficient programming in Section 2.4.2).

4) Multi-level States: Synaptic plasticity characteristics measured on biological synapses

show an analog-like behavior with many synaptic weight states between the highest

and the lowest conductance states [133]. In general, more multi-level states are

known to bring advantages in terms of network capacity and robustness, the precise

136

requirement for the number of conductance states remains strongly application-

specific. Our work (Section 4.3) and also other group’s work [138] on binary synapse

provide more design space with stochastic learning algorithm.

5) Dynamic Range: A biological synapse, which exhibits the STDP behavior, needs to

perform a cumulative weight change with a maximum of 100% and a minimum -50%

[133], which translates into a minimum dynamic range of 4 for the synaptic device

conductance. Hence, a dynamic range of 10 is more than sufficient to implement

biological plasticity. Most of the synaptic device candidates exhibit a wider (>100)

programmable resistance range, which provides flexibility in choosing the operation

regime. Since we have more freedom to choose the operating resistance range, for

oxide RRAM based synaptic device, the high resistance regime is found to be more

energy-efficient because the operating current is low in that regime. Besides, the high

resistance is useful for minimizing the read energy, and also helps to reduce the

neuron capacitor area. When the integrated-and-fire neuron sums the current from

hundreds of synapses, the total current charges the membrane capacitor of the neuron.

A high summated current will require a larger capacitor and, neuron circuitry will

occupy significantly large chip area.

6) Retention and Endurance: For the long-term memory, it needs to have a retention

property in the order of 10 years, to enable formation of life-long memories and to

maintain them for long periods of time. It can be envisioned that an endurance of

3x109 synaptic operations will guarantee 10 years of lifetime with 10 Hz operation

137

frequency. Most of the works published on synaptic devices has not given enough

attention to retention and endurance characteristics yet.

7) Uniformity and Variation: Poor uniformity and variation in device characteristics is a

major barrier to introducing novel nanoscale devices to CMOS logic or memory

applications. In contrast, brain-inspired or neuromorphic architectures promise

immunity against device variations. The level of variation that can be tolerated at the

system level strongly depends on the network architecture and the accuracy required

by the target application. Besides our simulation in Section 4.3.3, some works have

shown the robustness against device variations in different neural networks through

simulations [139][140]. Since the level of variation that can be tolerated is a

parameter that strongly depends on the characteristics of the neural network, we will

not specify a maximum level in this thesis.

Table 4.1 Summary of the desirable performance metrics for synaptic devices.

Performance metrics Targets

Device dimension < 10 nm

Energy consumption <10 fJ/spike

Operation frequency >>10 Hz

Multi-level states number 10~100

Dynamic range >4

Retention/Endurance >10 years/ 109 synaptic activities

138

4.5 Summary

We explore the potential of the RRAM technology as synaptic device for the hardware

implementation of neuromorphic computing. Programming schemes were proposed for

realizing spike-timing dependent plasticity (STDP) in the RRAM devices. The gradual

resistance modulation capability was explored to serve as analog synapse, and the

stochastic switching was explored to serve as binary synapse. A simulation of winner-

take-all network suggests that the orientation classification can be effectively realized

using RRAM synapses. And the system performance is found to be robust against the

RRAM device variability due to the adaptive algorithm and parallelism of the neural

network. Finally, the performance metrics of desired synaptic device were discussed.

139

Chapter 5

Conclusion and Outlook

5.1 Summary of contribution

This thesis addresses the key challenges of the RRAM technology development,

including the physics of resistive switching in oxides, the origin of the relatively poor

uniformity and large variability of the switching parameters. Additionally it also explores

the potential of using RRAM as synaptic device for neuromorphic computing. The

contributions of this thesis include:

1. A general physical picture of resistive switching phenomenon in binary oxides is

established, and is becoming a prevailing theory in the community. The resistive

switching is attributed to the conductive filament (made up of oxygen vacancies)

formation and rupture in the oxides. The physical picture was validated using the

HfOx-based RRAM devices through experiments and modeling.

2. The trap-assisted-tunneling was identified to be the dominant conduction

mechanism in oxide RRAM by various characterization techniques such as I-V

measurement at various temperature, low frequency noise measurement, AC

conductance measurement, etc.

3. An exponential voltage-time relationship was found between the switching time

and the applied voltage, revealing the physics of the oxygen ion migration

140

dynamics. Inspired by the voltage-time relationship, an energy-efficient

programming scheme was also proposed for multi-level operation.

4. For the first time, a Kinetic Monte Carlo (KMC) numerical simulator was

developed for RRAM. The idea of “discretizing” the oxide matrix and tracking

each individual oxygen vacancy and ion evolution path is a breakthrough in the

RRAM modeling as this method inherently includes stochastic nature of resistive

switching in the model and enables the model to describe the experimentally

observed variability of device characteristics. The KMC simulation can reproduce

many experimental observations in the DC I-V sweep, pulse switching, gradual

RESET, temporal fluctuation of RESET current, endurance cycling, and retention

baking, etc.

5. For the first time, the origin of the tail bits in the resistance distribution was

identified: tail bits in HRS are attributed to the oxygen vacancy left over in the

gap region due to a competition between the oxygen vacancy generation and

recombination.

6. A compact RRAM device model was developed for circuit/system-level

simulations, which has been implemented in MATLAB, SPICE macro model, and

Verilog-A and executable with the SPICE simulation engine.

7. The potential of the RRAM technology as synaptic device for the hardware

implementation of neuromorphic computing was explored. Programming schemes

141

were proposed for realizing spike-timing dependent plasticity in the RRAM

devices.

8. Pioneering works on the gradual resistance modulation capability as analog

synapse and the stochastic switching as binary synapse opens new opportunities

in the synaptic design.

9. A simulation of winner-take-all network was performed on the orientation

classification function using RRAM synapses. And the system performance is

found to be robust against the RRAM device variability due to the adaptive

algorithm and parallelism of the neural network.

5.2 Future work

This thesis presents a comprehensive study which provides a solid foundation for more

exciting future work in several directions.

With respect to the RRAM TAT modeling, the coupling of the Poisson equation with

the TAT current continuity equation is missing due to the convergence issue in numerical

solution. Ideally the Poisson equation should be added into the simulator in a self-

consistent manner.

With respect to the RRAM KMC modeling, currently the electrodes are treated as two

lines at the boundaries. In the future work, it is suggested to include the electrode into the

simulated region. By doing so, we can further study the electrode/oxide interface effect.

142

Effects due to the surface roughness can be handled as well. More aggressively, the KMC

simulator could be extended from a 2D simulator to 3D simulator to have better

estimation of the Vo density in the CF as well as a more physics-based three-dimensional

description of the current conduction paths.

With respect to the RRAM compact modeling, implementing the compact model in the

Verilog-A form in the SPICE simulator is useful to enable many circuit design. This

work is already underway and close to completion.

With respect to the synaptic device design, we have measured on a single device and

extracted the parameters for network-level simulation. The next step is to validate the

learning algorithms directly on a cross-point RRAM array.

143

Bibliography

[1] A. Szalay and J. Gray, “2020 computing: science in an exponential world,” Nature,

vol. 440, no. 7083, pp. 413–414, 2006.

[2] J. Akerman, “Toward a universal memory,” Science, vol. 743, no. 9, pp. 508–510,

2005.

[3] B. J. Zhu, “Magnetoresistive random access memory: the path to competitiveness

and scalability,” Proceedings of the IEEE, vol. 96, no. 11, pp. 1786–1798, 2008.

[4] H.-S. P. Wong, S. Raoux, S. Kim, J. Liang, J. P. Reifenberg, B. Rajendran, M.

Asheghi, and K. E. Goodson, “Phase change memory,” Proceedings of the IEEE,

vol. 98, no. 12, pp. 2201–2227, 2010.

[5] H.-S. P. Wong, H.-Y. Lee, S. Yu, Y.-S. Chen, Y. Wu, P.-S. Chen, B. Lee, F. T.

Chen, and M.-J. Tsai, “Metal–oxide RRAM,” Proceedings of the IEEE, vol. 100,

no. 6, pp. 1951–1970, 2012.

[6] C. W. Smullen, V. Mohan, A. Nigam, S. Gurumurthi, and M. R. Stan, “Relaxing

non-volatility for fast and energy-efficient STT-RAM caches,” IEEE International

Symposium on High Performance Computer Architecture, pp. 50–61, 2011.

144

[7] G. W. Burr, B. N. Kurdi, J. C. Scott, C. H. Lam, K. Gopalakrishnan, and R. S.

Shenoy, “Overview of candidate device technologies for storage-class memory,”

IBM Journal of Research and Development, vol. 52, no. 4/5, pp. 449–464, 2008.

[8] Y. Xie, “Modeling, architecture, and applications for emerging memory

technologies,” IEEE Design & Test of Computers, vol. 28, no. 1, pp. 44–51, 2011.

[9] T. W. Hickmott, “Low-frequency negative resistance in thin anodic oxide films,”

Journal of Applied Physics, vol. 33, no. 9, pp. 2669–2682, 1962.

[10] W. W. Zhuang, W. Pan, B. D. Ulrich, J. J. Lee, L. Stecker, A. Burmaster, D. R.

Evans, S. T. Hsu, A. Shimaoka, K. Inoue, T. Naka, N. A. K. Sakiyama, Y. Wang,

S. Q. Liu, N. J. Wu, and A. Ignatiev, “Novell colossal magnetoresistive thin film

nonvolatile resistance random access memory (RRAM),” IEEE International

Electron Devices Meeting, pp. 193–196, 2002.

[11] A. Asamitsu, Y. Tomioka, H. Kuwahara, and Y. Tokura, “Current switching of

resistive states in magnetoresistive manganites,” Nature, vol. 388, no. 3, pp. 1995–

1997, 1997.

[12] A. Beck, J. G. Bednorz, C. Gerber, C. Rossel, and D. Widmer, “Reproducible

switching effect in thin oxide films for memory applications,” Applied Physics

Letters, vol. 77, no. 1, pp. 139–141, 2000.

145

[13] Y. Watanabe, J. G. Bednorz, A. Bietsch, C. Gerber, D. Widmer, A. Beck, and S. J.

Wind, “Current-driven insulator–conductor transition and nonvolatile memory in

chromium-doped SrTiO3 single crystals,” Applied Physics Letters, vol. 78, no. 23,

pp. 3738–3740, 2001.

[14] I. G. Baek, M. S. Lee, S. Sco, M. J. Lee, D. H. Seo, D.-S. Suh, J. C. Park, S. O.

Park, H. S. Kim, I. K. Yoo, U.-I. Chung, and J. T. Moon, “Highly scalable non-

volatile resistive memory using simple binary oxide driven by asymmetric unipolar

voltage pulses,” IEEE International Electron Devices Meeting, pp. 587–590, 2004.

[15] S. Seo, M. J. Lee, D. H. Seo, E. J. Jeoung, D.-S. Suh, Y. S. Joung, I. K. Yoo, I. R.

Hwang, S. H. Kim, I. S. Byun, J.-S. Kim, J. S. Choi, and B. H. Park,

“Reproducible resistance switching in polycrystalline NiO films,” Applied Physics

Letters, vol. 85, no. 23, pp. 5655–5657, 2004.

[16] B. J. Choi, D. S. Jeong, S. K. Kim, C. Rohde, S. Choi, J. H. Oh, H. J. Kim, C. S.

Hwang, K. Szot, R. Waser, B. Reichenberg, and S. Tiedke, “Resistive switching

mechanism of TiO2 thin films grown by atomic-layer deposition,” Journal of

Applied Physics, vol. 98, no. 3, p. 033715, 2005.

[17] A. Chen, S. Haddad, Y. J. Wu, T. Fang, Z. Lan, S. Avanzino, M. Buynoski, M.

Rathor, W. D. Cai, N. Tripsas, C. Bill, M. Vanbuskirk, and M. Taguchi, “Non-

146

volatile resistive switching for advanced memory applications,” IEEE

International Electron Devices Meeting, pp. 746 - 749, 2005.

[18] C. Lin, C. Wu, C. Wu, T. Tseng, and C. Hu, “Modified resistive switching

behavior of ZrO2 memory films based on the interface layer formed by using Ti

top electrode,” Journal of Applied Physics vol. 102, no. 9, p. 094101, 2007.

[19] N. Xu, L. Liu, X. Sun, X. Liu, D. Han, Y. Wang, R. Han, J. Kang, and B. Yu,

“Characteristics and mechanism of conduction/set process in TiN∕ZnO∕Pt

resistance switching random-access memories,” Applied Physics Letters, vol. 92,

no. 23, p. 232112, 2008.

[20] H. Y. Lee, P. S. Chen, T. Y. Wu, Y. S. Chen, C. C. Wang, P. J. Tzeng, C. H. Lin, F.

Chen, C. H. Lien, and M. Tsai, “Low power and high speed bipolar switching with

a thin reactive Ti buffer layer in robust HfO2 based RRAM,” IEEE International


[21] Z. Wei, Y. Kanzawa, K. Arita, Y. Katoh, K. Kawai, S. Muraoka, S. Mitani, S. Fujii,

K. Katayama, M. Iijima, T. Mikawa, T. Ninomiya, R. Miyanaga, Y. Kawashima,

K. Tsuji, A. Himeno, T. Okada, R. Azuma, K. Shimakawa, H. Sugaya, T. Takagi,

R. Yasuhara, K. Horiba, H. Kumigashira, and M. Oshima, “Highly reliable TaOx

ReRAM and direct evidence of redox reaction mechanism,” IEEE International


147

[22] W. Kim, S. I. Park, Z. Zhang, Y. Y. Liauw, D. Sekar, H.-S. P. Wong, and S. S.

Wong, “Forming-free nitrogen-doped AlOx RRAM with sub-μA programming

Current,” IEEE Symposium on VLSI Technology, pp. 22-23, 2011.

[23] R. Waser, and M. Aono, “Nanoionics-based resistive switching memories,” Nature

Materials, vol. 6, pp. 833–840, 2007.

[24] I. Valov, R. Waser, J. R. Jameson, and M. N. Kozicki, “Electrochemical

metallization memories--fundamentals, applications, prospects,” Nanotechnology,

vol. 22, no. 25, p. 254003, 2011.

[25] S. Yu, and H.-S. P. Wong, “A phenomenological model for the reset mechanism of

metal oxide RRAM,” IEEE Electron Device Letters, vol. 31, no. 12, pp. 1455–

1457, 2010.

[26] B. Govoreanu, G. S. Kar, Y. Chen, V. Paraschiv, S. Kubicek, A. Fantini, I. P. Radu,

L. Goux, S. Clima, R. Degraeve, N. Jossart, O. Richard, T. Vandeweyer, K. Seo, P.

Hendrickx, G. Pourtois, H. Bender, L. Altimime, D. J. Wouters, J. A. Kittl, and M.

Jurczak, “10×10nm2 Hf /HfOx crossbar resistive RAM with excellent performance ,

reliability and low-energy operation,” IEEE International Electron Devices

Meeting, pp. 729–732, 2011.

[27] J. S. Choi, J.-S. Kim, I. R. Hwang, S. H. Hong, S. H. Jeon, S.-O. Kang, B. H. Park,

D. C. Kim, M. J. Lee, and S. Seo, “Different resistance switching behaviors of NiO

148

thin films deposited on Pt and SrRuO3 electrodes,” Applied Physics Letters, vol. 95,

no. 2, p. 022109, 2009.

[28] C. Rohde, B. J. Choi, D. S. Jeong, S. Choi, J.-S. Zhao, and C. S. Hwang,

“Identification of a determining parameter for resistive switching of TiO2 thin

films,” Applied Physics Letters, vol. 86, no. 26, p. 262907, 2005.

[29] C. Yoshida, K. Tsunoda, H. Noshiro, and Y. Sugiyama, “High speed resistive

switching in Pt∕TiO2∕TiN film for nonvolatile memory application,” Applied

Physics Letters, vol. 91, no. 22, p. 223510, 2007.

[30] W.-Y. Chang, Y.-C. Lai, T.-B. Wu, S.-F. Wang, F. Chen, and M.-J. Tsai,

“Unipolar resistive switching characteristics of ZnO thin films for nonvolatile

memory applications,” Applied Physics Letters, vol. 92, no. 2, p. 022110, 2008.

[31] N. Xu, L. F. Liu, X. Sun, C. Chen, Y. Wang, D. D. Han, X. Y. Liu, R. Q. Han, J. F.

Kang, and B. Yu, “Bipolar switching behavior in TiN/ZnO/Pt resistive nonvolatile

memory with fast switching and long retention,” Semiconductor Science and

Technology, vol. 23, no. 7, p. 075019, 2008.

[32] C. Lin, C. Wu, C. Wu, T. Lee, and F. Yang, “Effect of top electrode material on

resistive switching properties of ZrO2 film memory devices,” IEEE Electron

Device Letters, vol. 28, no. 5, pp. 366–368, 2007.

149

[33] Y.-M. Kim and J.-S. Lee, “Reproducible resistance switching characteristics of

hafnium oxide-based nonvolatile memory devices,” Journal of Applied Physics,

vol. 104, no. 11, p. 114115, 2008.

[34] L. Goux, Y.-Y. Chen, L. Pantisano, X.-P. Wang, G. Groeseneken, M. Jurczak, and

D. J. Wouters, “On the gradual unipolar and bipolar resistive switching of

TiN\HfO2\Pt memory systems,” Electrochemical and Solid-State Letters, vol. 13,

no. 6, p. G54, 2010.

[35] K. M. Kim, B. J. Choi, B. W. Koo, S. Choi, D. S. Jeong, and C. S. Hwang,

“Resistive switching in Pt∕Al2O3∕TiO2∕Ru stacked structures,” Electrochemical and

Solid-State Letters, vol. 9, no. 12, p. G343, 2006.

[36] C.-Y. Lin, C.-Y. Wu, C.-Y. Wu, C. Hu, and T.-Y. Tseng, “Bistable resistive

switching in Al2O3 memory thin films,” Journal of The Electrochemical Society,

vol. 154, no. 9, p. G189, 2007.

[37] M.-J. Lee, C. B. Lee, D. Lee, S. R. Lee, M. Chang, J. H. Hur, Y.-B. Kim, C.-J.

Kim, D. H. Seo, S. Seo, U.-I. Chung, I.-K. Yoo, and K. Kim, “A fast, high-

endurance and scalable non-volatile memory device made from asymmetric Ta2O5-

x/TaO2-x bilayer structures,” Nature Materials, vol. 10, pp. 625–630, 2011.

[38] S.-S. Sheu, M.-F. Chang, K.-F. Lin, C.-W. Wu, Y.-S. Chen, P.-F. Chiu, C.-C. Kuo,

Y.-S. Yang, P.-C. Chiang, W.-P. Lin, C.-H. Lin, H.-Y. Lee, P.-Y. Gu, S.-M. Wang,

150

F. T. Chen, K.-L. Su, C.-H. Lien, K.-H. Cheng, H.-T. Wu, T.-K. Ku, M.-J. Kao,

and M.-J. Tsai, “A 4Mb embedded SLC resistive-RAM macro with 7.2ns read-

write random-access time and 160ns MLC-access capability,” IEEE International

Solid-State Circuits Conference, pp. 200–202, 2011.

[39] A. Kawahara, R. Azuma, Y. Ikeda, K. Kawai, Y. Katoh, K. Tanabe, T. Nakamura,

Y. Sumimoto, N. Yamada, N. Nakai, S. Sakamoto, Y. Hayakawa, K. Tsuji, S.

Yoneda, A. Himeno, K. Origasa, K. Shimakawa, T. Takagi, T. Mikawa, and K.

Aono, “An 8Mb multi-layered cross-point ReRAM macro with 443MB/s write

throughput,” IEEE International Solid-State Circuits Conference, pp. 432–434,

2012.

[40] T.-Y. Liu, T. H. Yan, R. Scheuerlein, Y. Chen, J. K. Lee, G. Balakrishnan, G. Yee,

H. Zhang, A. Yap, J. Ouyang, T. Sasaki, S. Addepalli, A. Al-Shamma, C.-Y. Chen,

M. Gupta, G. Hilton, S. Joshi, A. Kathuria, V. Lai, D. Masiwal, M. Matsumoto, A.

Nigam, A. Pai, J. Pakhale, C. H. Siau, X. Wu, R. Yin, L. Peng, J. Y. Kang, S.

Huynh, H. Wang, N. Nagel, Y. Tanaka, M. Higashitani, T. Minvielle, C. Gorla, T.

Tsukamoto, T. Yamaguchi, M. Okajima, T. Okamura, S. Takase, T. Hara, H. Inoue,

L. Fasoli, M. Mofidi, R. Shrivastava, K. Quader, “A 130.7mm2 2-layer 32Gb

ReRAM memory device in 24nm technology” IEEE International Solid-State

Circuits Conference, pp. 210–212, 2013.

151

[41] W.-C. Chien, M.-H. Lee, Y.-Y. Lin, and K.-Y. Hsieh, “Multi-level 40nm WOx

resistive memory with excellent reliability,” IEEE International Electron Devices

Meeting, pp. 725-728, 2011.

[42] S. R. Lee, Y.-B. Kim, M. Chang, K. M. Kim, C. B. Lee, J. H. Hur, D.-S. Park, D.

Lee, M.-J. Lee, C. J. Kim, U. Chung, I.-K. Yoo, and K. Kim, “Multi-level

switching of triple-layered TaOx RRAM with excellent reliability for storage class

memory,” IEEE Symposium on VLSI Technology, pp. 71-72, 2012.

[43] J. Liang and H.-S. P. Wong, “Cross-point memory array without cell selectors—

device characteristics and data storage pattern dependencies,” IEEE Transactions

on Electron Devices, vol. 57, no. 10, pp. 2531–2538, 2010.

[44] J. Liang, S. Yeh, S. S. Wong, and H.-S. P. Wong, “Effect of wordline/bitline

scaling on the performance, energy consumption, and reliability of cross-point

memory array,” ACM Journal on Emerging Technologies in Computing Systems,

vol. 9, no. 1, p. 9, 2013.

[45] Y. Deng, P. Huang, B. Chen, X. Yang, B. Gao, J. Wang, L. Zeng, G. Du, J. F. Kang,

and X. Y. Liu, “RRAM crossbar array with cell selection device : a device and

circuit interaction study,” IEEE Transactions on Electron Devices, vol. 60, no. 2,

pp. 719–726, 2013.

152

[46] M.-J. Lee, Y. Park, B.-S. Kang, S.-E. Ahn, C. Lee, K. Kim, W. Xianyu, G.

Stefanovich, J.-H. Lee, S.-J. Chung, Y.-H. Kim, C.-S. Lee, J.-B. Park, I.-G. Baek,

and I.-K. Yoo, “2-stack 1D-1R cross-point structure with oxide diodes as switch

elements for high density resistance RAM applications,” IEEE International


[47] G. H. Kim, J. H. Lee, Y. Ahn, W. Jeon, S. J. Song, J. Y. Seok, J. H. Yoon, K. J.

Yoon, T. J. Park, and C. S. Hwang, “32×32 crossbar array resistive memory

composed of a stacked Schottky diode and unipolar resistive memory,” Advanced

Functional Materials, vol. 23, no. 11, pp. 1440–1449, 2013.

[48] J.-J. Huang, Y.-M. Tseng, W.-C. Luo, C.-W. Hsu, and T.-H. Hou, “One selector-

one resistor (1S1R) crossbar array for high-density flexible memory applications,”

IEEE International Electron Devices Meeting, pp. 733–736, 2011.

[49] W. Lee, J. Park, J. Shin, J. Woo, S. Kim, G. Choi, S. Jung, S. Park, D. Lee, E. Cha,

H. D. Lee, S. G. Kim, S. Chung, and H. Hwang, “Varistor-type bidirectional

switch (JMAX>107A/cm2, selectivity~104) for 3D bipolar resistive memory arrays,”

IEEE Symposium on VLSI Technology, pp. 37-38, 2012.

[50] M.-J. Lee, Y. Park, D.-S. Suh, E.-H. Lee, S. Seo, D.-C. Kim, R. Jung, B.-S. Kang,

S.-E. Ahn, C. B. Lee, D. H. Seo, Y.-K. Cha, I.-K. Yoo, J.-S. Kim, and B. H. Park,

153

“Two series oxide resistors applicable to high speed and high density nonvolatile

memory,” Advanced Materials, vol. 19, no. 22, pp. 3919–3923, 2007.

[51] M. Son, J. Lee, J. Park, J. Shin, G. Choi, S. Jung, W. Lee, S. Kim, S. Park, and H.

Hwang, “Excellent selector characteristics of nanoscale VO2 for high-density

bipolar ReRAM applications,” IEEE Electron Device Letters, vol. 32, no. 11, pp.

1579–1581, 2011.

[52] C. Ko and S. Ramanathan, “Observation of electric field-assisted phase transition

in thin film vanadium oxide in a metal-oxide-semiconductor device geometry,”

Applied Physics Letters, vol. 93, no. 25, p. 252101, 2008.

[53] X. Liu, S. M. Sadaf, M. Son, J. Shin, J. Park, J. Lee, S. Park, and H. Hwang,

“Diode-less bilayer oxide (WOx-NbOx) device for cross-point resistive memory

applications.,” Nanotechnology, vol. 22, no. 47, p. 475702, 2011.

[54] S. Kim, X. Liu, J. Park, S. Jung, W. Lee, J. Woo, J. Shin, G. Choi, C. Cho, S. Park,

D. Lee, E. Cha, B.-H. Lee, H. D. Lee, S. G. Kim, S. Chung, and H. Hwang,

“Ultrathin Nb2O5/NbO2 hybrid memory with both memory and selector

characteristics for high density 3D vertically stackable RRAM applications,” IEEE

Symposium on VLSI Technology, pp. 155–156, 2012.

[55] K. Gopalakrishnan, R. S. Shenoy, C. T. Rettner, K. Virwani, D. S. Bethune, R. M.

Shelby, G. W. Burr, A. Kellock, R. S. King, K. Nguyen, A. N. Bowers, M. Jurich,

154

B. Jackson, A. M. Friz, T. Topuria, P. M. Rice, and B. N. Kurdi, “Highly-scalable

novel access device based on mixed ionic electronic conduction (MIEC) materials

for high density phase change memory (PCM) arrays,” IEEE Symposium on VLSI

Technology, pp. 205–206, 2010.

[56] A. Goda, K. Parat, and M. T. January, “Scaling directions for 2D , 3D NAND

Flash cells,” IEEE International Electron Devices Meeting, pp. 13–16, 2012.

[57] H. Tanaka, M. Kido, K. Yahashi, M. Oomura, R. Katsumata, M. Kito, Y.

Fukuzumi, M. Sato, Y. Nagata, Y. Matsuoka, Y. Iwata, H. Aochi, and A.

Nitayama, “Bit cost scalable technology with punch and plug process for ultra high

density Flash memory,” IEEE Symposium on VLSI Technology, pp. 14–15, 2007.

[58] W. Kim, S. Choi, J. Sung, T. Lee, C. Park, H. Ko, J. Jung, I. Yoo, and Y. Park,

“Multi-layered vertical gate NAND Flash overcoming stacking limit for terabit

density storage,” IEEE Symposium on VLSI Technology, pp. 188–189, 2009.

[59] J. Jang, H. Kim, W. Cho, H. Cho, J. Kim, S. Il Shim, Y. Jang, J. Jeong, B. Son, D.

W. Kim, K. Kim, J. Shim, J. S. Lim, K. Kim, S. Y. Yi, J. Lim, D. Chung, H. Moon,

S. Hwang, J. Lee, Y. Son, U. Chung, and W. Lee, “Vertical cell array using TCAT

(terabit cell array transistor) technology for ultra high density NAND Flash

memory,” IEEE Symposium on VLSI Technology, pp. 192–193, 2009.

155

[60] M.-J. Lee, S. I. Kim, C. B. Lee, H. Yin, S.-E. Ahn, B. S. Kang, K. H. Kim, J. C.

Park, C. J. Kim, I. Song, S. W. Kim, G. Stefanovich, J. H. Lee, S. J. Chung, Y. H.

Kim, and Y. Park, “Low-temperature-grown transition metal oxide based storage

materials and oxide transistors for high-density non-volatile memory,” Advanced

Functional Materials, vol. 19, no. 10, pp. 1587–1593, 2009.

[61] H. S. Yoon, I. Baek, J. Zhao, H. Sim, M. Y. Park, H. Lee, G. Oh, J. C. Shin, I. Yeo,

and U. Chung, “Vertical cross-point resistance change memory for ultra-high

density non-volatile memory applications,” IEEE Symposium on VLSI Technology,

pp. 26–27, 2009.

[62] I. G. Baek, C. J. Park, H. Ju, D. J. Seong, H. S. Ahn, J. H. Kim, M. K. Yang, S. H.

Song, E. M. Kim, S. O. Park, C. H. Park, C. W. Song, G. T. Jeong, S. Choi, H. K.

Kang, and C. Chung, “Realization of vertical resistive memory (VRRAM) using

cost effective 3D process,” IEEE International Electron Devices Meeting, pp.

737–740, 2011.

[63] W. C. Chien, F. M. Lee, Y. Y. Lin, M. H. Lee, S. H. Chen, C. C. Hsieh, E. K. Lai,

H. H. Hui, Y. K. Huang, C. C. Yu, C. F. Chen, H. L. Lung, K. Y. Hsieh, and C.-Y.

Lu, “Multi-layer sidewall WOx resistive memory suitable for 3D ReRAM,” IEEE


156

[64] H.-Y. Chen, S. Yu, B. Gao, P. Huang, J. F. Kang, and H.-S. P. Wong, “HfOx based

vertical resistive random access memory for cost-effective 3D cross-point

architecture without cell selector,” IEEE International Electron Devices Meeting,

pp. 497–500, 2012.

[65] S. Yu, H. Chen, Y. Deng, B. Gao, Z. Jiang, J. F. Kang, and H.-S. P. Wong, “3D

vertical RRAM – scaling limit analysis and demonstration of 3D array operation,”

IEEE Symposium on VLSI Technology, pp. 158–159, 2013.

[66] P.-F. Chiu, M.-F. Chang, C.-W. Wu, C.-H. Chuang, S.-S. Sheu, Y.-S. Chen, and

M.-J. Tsai, “Low store energy, low VDDmin, 8T2R nonvolatile latch and SRAM

with vertical-stacked resistive memory (memristor) devices for low power mobile

applications,” IEEE Journal of Solid-State Circuits, vol. 47, no. 6, pp. 1483–1496,

2012.

[67] Y. Y. Liauw, Z. Zhang, W. Kim, A. E. Gamal, and S. S. Wong, “Nonvolatile 3D-

FPGA with monolithically stacked RRAM-based configuration memory,” IEEE

International Solid-State Circuits Conference, pp. 406–408, 2012.

[68] T. Fujii, M. Kawasaki, A. Sawa, H. Akoh, Y. Kawazoe, and Y. Tokura,

“Hysteretic current–voltage characteristics and resistance switching at an epitaxial

oxide Schottky junction SrRuO3∕SrTi0.99Nb0.01O3,” Applied Physics Letters, vol. 86,

no. 1, p. 012107, 2005.

157

[69] Y. Xia, W. He, L. Chen, X. Meng, and Z. Liu, “Field-induced resistive switching

based on space-charge-limited current,” Applied Physics Letters, vol. 90, no. 2, p.

022907, 2007.

[70] R. Waser, R. Dittmann, G. Staikov, and K. Szot, “Redox-based resistive switching

memories - nanoionic mechanisms, prospects, and challenges,” Advanced

Materials, vol. 21, no. 25–26, pp. 2632–2663, 2009.

[71] J. Y. Son and Y.-H. Shin, “Direct observation of conducting filaments on resistive

switching of NiO thin films,” Applied Physics Letters, vol. 92, no. 22, p. 222106,

2008.

[72] D.-H. Kwon, K. M. Kim, J. H. Jang, J. M. Jeon, M. H. Lee, G. H. Kim, X.-S. Li,

G.-S. Park, B. Lee, S. Han, M. Kim, and C. S. Hwang, “Atomic structure of

conducting nanofilaments in TiO2 resistive switching memory.,” Nature

Nanotechnology, vol. 5, no. 2, pp. 148–53, 2010.

[73] X. Cartoixa, R. Rurali, and J. Sune, “Transport properties of oxygen vacancy

filaments in metal/crystalline or amorphous HfO2/metal structures,” Physical

Review B, vol. 86, no. 16, p. 165445, 2012.

[74] P. Calka, E. Martinez, V. Delaye, D. Lafond, G. Audoit, D. Mariolle, N. Chevalier,

H. Grampeix, C. Cagli, V. Jousseaume, and C. Guedj, “Chemical and structural

158

properties of conducting nanofilaments in TiN/HfO2-based resistive switching

structures.,” Nanotechnology, vol. 24, no. 8, p. 085706, 2013.

[75] S. Privitera, G. Bersuker, B. Butcher, A. Kalantarian, S. Lombardo, C. Bongiorno,

R. Geer, D. C. Gilmer, and P. D. Kirsch, “Microscopy study of the conductive

filament in HfO2 resistive switching memory devices,” Microelectronic

Engineering, vol. 109, pp. 75–78, 2013.

[76] C.-Y. Lin, S.-Y. Wang, D.-Y. Lee, and T.-Y. Tseng, “Electrical properties and

fatigue behaviors of ZrO2 resistive switching thin films,” Journal of The

Electrochemical Society, vol. 155, no. 8, p. H615, 2008.

[77] Q. Liu, W. Guan, S. Long, R. Jia, M. Liu, and J. Chen, “Resistive switching

memory effect of ZrO2 films with Zr+ implanted,” Applied Physics Letters, vol. 92,

no. 1, p. 012117, 2008.

[78] H. Y. Lee, P. Chen, T. Wu, Y. S. Chen, F. Chen, C. Wang, P. Tzeng, C. H. Lin, M.

Tsai, and C. Lien, “HfOx bipolar resistive memory with robust endurance using

AlCu as buffer electrode,” IEEE Electron Device Letters, vol. 30, no. 7, pp. 703–

705, 2009.

[79] S. Yu, X. Guan, and H.-S. P. Wong, “Conduction mechanism of TiN/HfOx/Pt

resistive switching memory: A trap-assisted-tunneling model,” Applied Physics

Letters, vol. 99, no. 6, p. 063507, 2011.

159

[80] G.-H. Buh, I. Hwang, and B. H. Park, “Time-dependent electroforming in NiO

resistive switching devices,” Applied Physics Letters, vol. 95, no. 14, p. 142101,

2009.

[81] M. Fujimoto, H. Koyama, M. Konagai, Y. Hosoi, K. Ishihara, S. Ohnishi, and N.

Awaya, “TiO2 anatase nanolayer on TiN thin film exhibiting high-speed bipolar

resistive switching,” Applied Physics Letters, vol. 89, no. 22, p. 223509, 2006.

[82] G. Bersuker, D. C. Gilmer, D. Veksler, J. Yum, H. Park, S. Lian, L. Vandelli, A.

Padovani, L. Larcher, K. Mckenna, A. Shluger, V. Iglesias, M. Porti, M. Nafria, W.

Taylor, P. D. Kirsch, and R. Jammy, “Metal oxide RRAM switching mechanism

based on conductive filament microscopic properties,” IEEE International


[83] B. Gao, J. F. Kang, Y. S. Chen, F. F. Zhang, B. Chen, P. Huang, L. F. Liu, X. Y.

Liu, Y. Y. Wang, X. A. Tran, Z. R. Wang, H. Y. Yu, and A. Chin, “Oxide-based

RRAM: unified microscopic principle for both unipolar and bipolar switching,”

IEEE International Electron Devices Meeting, pp. 417–420, 2011.

[84] B. Gao, H. W. Zhang, S. Yu, B. Sun, L. F. Liu, X. Y. Liu, Y. Wang, R. Q. Han, J.

F. Kang, B. Yu, and Y. Y. Wang, “Oxide-based RRAM: uniformity improvement

using a new material-oriented methodology,” IEEE Symposium on VLSI

Technology, pp. 30–31, 2009.

160

[85] W. J. Zhu, T.-P. Ma, T. Tamagawa, J. Kim, and Y. Di, “Current transport in

metal/hafnium oxide/silicon structure,” IEEE Electron Device Letters, vol. 23, no.

2, pp. 97–99, 2002.

[86] K. Xiong, J. Robertson, M. C. Gibson, and S. J. Clark, “Defect energy levels in

HfO2 high-dielectric-constant gate oxide,” Applied Physics Letters, vol. 87, no. 18,

p. 183505, 2005.

[87] H. Takeuchi, D. Ha, and T.-J. King, “Observation of bulk HfO2 defects by

spectroscopic ellipsometry,” Journal of Vacuum Science & Technology A: Vacuum,

Surfaces, and Films, vol. 22, no. 4, pp. 1337-1341, 2004.

[88] X. A. Tran, H. Y. Yu, Y. C. Yeo, L. Wu, W. J. Liu, Z. R. Wang, Z. Fang, K. L.

Pey, X. W. Sun, A. Y. Du, B. Y. Nguyen, and M. F. Li, “A High-yield HfOx-based

unipolar resistive RAM employing Ni electrode compatible with Si-diode selector

for crossbar integration,” IEEE Electron Device Letters, vol. 32, no. 3, pp. 396–

398, 2011.

[89] D. S. Jeong and C. S. Hwang, “Tunneling-assisted Poole-Frenkel conduction

mechanism in HfO2 thin films,” Journal of Applied Physics, vol. 98, no. 11, p.

113701, 2005.

161

[90] M. Nardone, V. Kozub, I. Karpov, and V. Karpov, “Possible mechanisms for 1/f

noise in chalcogenide glasses: a theoretical description,” Physical Review B, vol.

79, no. 16, p. 165206, 2009.

[91] J.-K. Lee, H. Y. Jeong, I.-T. Cho, J. Y. Lee, S.-Y. Choi, H.-I. Kwon, and J.-H. Lee,

“Conduction and low-frequency noise analysis in Al/α-TiOx/Al bipolar switching

resistance random access memory devices,” IEEE Electron Device Letters, vol. 31,

no. 6, pp. 603–605, 2010.

[92] S. B. Lee, S. Park, J. S. Lee, S. C. Chae, S. H. Chang, M. H. Jung, Y. Jo, B. Kahng,

B. S. Kang, M.-J. Lee, and T. W. Noh, “Large 1/f noise of unipolar resistance

switching and its percolating nature,” Applied Physics Letters, vol. 95, no. 12, p.

122112, 2009.

[93] G. Jegert, A. Kersch, W. Weinreich, U. Schroder, and P. Lugli, “Modeling of

leakage currents in high-κ dielectrics: three-dimensional approach via kinetic

Monte Carlo,” Applied Physics Letters, vol. 96, no. 6, p. 062113, 2010.

[94] C. Walczyk, C. Wenger, R. Sohal, M. Lukosius, A. Fox, J. Dabrowski, D.

Wolansky, B. Tillack, H.-J. Mussig, and T. Schroeder, “Pulse-induced low-power

resistive switching in HfO2 metal-insulator-metal diodes for nonvolatile memory

applications,” Journal of Applied Physics, vol. 105, no. 11, p. 114103, 2009.

162

[95] J. T. S. Irvine, D. C. Sinclair, and A. R. West, “Electroceramics: characterization

by impedance spectroscopy,” Advanced Materials, vol. 2, no. 3, pp. 132–138,

1990.

[96] Y.-H. You, B.-S. So, J.-H. Hwang, W. Cho, S. S. Lee, T.-M. Chung, C. G. Kim,

and K.-S. An, “Impedance spectroscopy characterization of resistance switching

NiO thin films prepared through atomic layer deposition,” Applied Physics Letters,

vol. 89, no. 22, p. 222105, 2006.

[97] M. H. Lee, K. M. Kim, G. H. Kim, J. Y. Seok, S. J. Song, J. H. Yoon, and C. S.

Hwang, “Study on the electrical conduction mechanism of bipolar resistive

switching TiO2 thin films using impedance spectroscopy,” Applied Physics Letters,

vol. 96, no. 15, p. 152909, 2010.

[98] A. K. Jonscher, “Dielectric relaxation in solids,” Journal of Physics D: Applied

Physics, vol. 32, p. R57, 1999.

[99] S. R. Elliott, “A.C. conduction in amorphous chalcogenide and pnictide

semiconductors,” Advances in Physics, vol. 36, no. 2, pp. 135–217, 1987.

[100] P. Gonon, M. Mougenot, C. Vallee, C. Jorel, V. Jousseaume, H. Grampeix, and F.

E. Kamel, “Resistance switching in HfO2 metal-insulator-metal devices,” Journal

of Applied Physics, vol. 107, no. 7, p. 074507, 2010.

163

[101] B. Gao, J. F. Kang, L. F. Liu, X. Y. Liu, and B. Yu, “A physical model for bipolar

oxide-based resistive switching memory based on ion-transport-recombination

effect,” Applied Physics Letters, vol. 98, no. 23, p. 232108, 2011.

[102] T. Tsuruoka, K. Terabe, T. Hasegawa, and M. Aono, “Forming and switching

mechanisms of a cation-migration-based oxide resistive memory,”

Nanotechnology, vol. 21, no. 42, p. 425205, 2010.

[103] D. B. Strukov and R. S. Williams, “Exponential ionic drift: fast switching and low

volatility of thin-film memristors,” Applied Physics A, vol. 94, no. 3, pp. 515–519,

2008.

[104] B. Gao, J. F. Kang, H. W. Zhang, B. Sun, B. Chen, L. F. Liu, X. Y. Liu, R. Q. Han,

Y. Y. Wang, B. Yu, Z. Fang, H. H. Yu, and D.-L. Kwong, “Oxide-based RRAM:

physical based retention projection,” IEEE European Solid-State Device Research

Conference, pp. 392–395, 2010.

[105] J. McPherson, J.-Y. Kim, A. Shanware, and H. Mogul, “Thermochemical

description of dielectric breakdown in high dielectric constant materials,” Applied

Physics Letters, vol. 82, no. 13, pp. 2121-2123, 2003.

[106] B. Gao, B. Sun, H. W. Zhang, L. F. Liu, X. Y. Liu, R. Q. Han, J. F. Kang, and B.

Yu, “Unified physical model of bipolar oxide-based resistive switching memory,”

IEEE Electron Device Letters, vol. 30, no. 12, pp. 1326–1328, 2009.

164

[107] N. Xu, B. Gao, L. F. Liu, B. Sun, X. Y. Liu, R. Q. Han, J. F. Kang, and B. Yu, “A

unified physical model of switching behavior in oxide-based RRAM,” IEEE


[108] K. Kinoshita, K. Tsunoda, Y. Sato, H. Noshiro, S. Yagaki, M. Aoki, and Y.

Sugiyama, “Reduction in the reset current in a resistive random access memory

consisting of NiOx brought about by reducing a parasitic capacitance,” Applied

Physics Letters, vol. 93, no. 3, p. 033506, 2008.

[109] Y. S. Chen, H. Y. Lee, P. S. Chen, P. Y. Gu, C. W. Chen, W. P. Lin, W. H. Liu, Y.

Y. Hsu, S. S. Sheu, P. C. Chiang, W. S. Chen, F. T. Chen, C. H. Lien, and M.-J.

Tsai, “Highly scalable hafnium oxide memory with improvements of resistive

distribution and read disturb immunity,” IEEE International Electron Devices

Meeting, pp. 105–108, 2009.

[110] H. Y. Lee, Y. S. Chen, P. S. Chen, P. Y. Gu, Y. Y. Hsu, S. M. Wang, W. H. Liu, C.

H. Tsai, S. S. Sheu, P. C. Chiang, W. P. Lin, C. H. Lin, W. S. Chen, F. T. Chen, C.

H. Lien, and M.-J. Tsai, “Evidence and solution of over-RESET problem for HfOx

based resistive memory with sub-ns switching speed and high endurance,” IEEE

International Electron Devices Meeting, pp. 460–461, 2010.

[111] Z. Fang, H. Y. Yu, X. Li, N. Singh, G. Q. Lo, and D. L. Kwong,

“HfOx/TiOx/HfOx/TiOx multilayer-based forming-free RRAM devices with

165

excellent uniformity,” IEEE Electron Device Letters, vol. 32, no. 4, pp. 566–568,

2011.

[112] X. Guan, S. Yu, and H.-S. P. Wong, “A SPICE compact model of metal oxide

resistive switching memory with variations,” IEEE Electron Device Letters, vol.

33, no. 10, pp. 1405–1407, 2012.

[113] U. Russo, D. Ielmini, C. Cagli, A. L. Lacaita, “Self-accelerated thermal dissolution

model for reset programming in unipolar resistive-switching memory (RRAM)

devices,” IEEE Transactions on Electron Devices, vol. 56, no. 2, pp. 193–200,

2009.

[114] C. Cagli, F. Nardi, and D. Ielmini, “Modeling of set/reset operations in NiO-based

resistive-switching memory devices,” IEEE Transactions on Electron Devices, vol.

56, no. 8, pp. 1712–1720, 2009.

[115] U. Russo, D. Ielmini, C. Cagli, A. L. Lacaita, “Filament conduction and reset

mechanism memory (RRAM) devices,” IEEE Transactions on Electron Devices,

vol. 56, no. 2, pp. 186–192, 2009.

[116] Y. S. Chen, J. F. Kang, B. Chen, B. Gao, L. F. Liu, X. Y. Liu, Y. Y. Wang, L. Wu,

H. Y. Yu, J. Y. Wang, Q. Chen, and E. G. Wang, “Microscopic mechanism for

unipolar resistive switching behaviour of nickel oxides,” Journal of Physics D:

Applied Physics, vol. 45, no. 6, p. 065303, 2012.

166

[117] G. Bersuker, D. C. Gilmer, D. Veksler, P. Kirsch, L. Vandelli, A. Padovani, L.

Larcher, K. McKenna, A. Shluger, V. Iglesias, M. Porti, and M. Nafria, “Metal

oxide resistive memory switching mechanism based on conductive filament

properties,” Journal of Applied Physics, vol. 110, no. 12, p. 124518, 2011.

[118] L. Vandelli, A. Padovani, L. Larcher, G. Broglia, G. Ori, M. Montorsi, G.

Bersuker, and P. Pavan, “Comprehensive physical modeling of forming and

switching operations in HfO2 RRAM devices,” IEEE International Electron

Devices Meeting, pp. 421–424, 2011.

[119] R. Degraeve, A. Fantini, S. Clima, B. Govoreanu, L. Goux, Y. Y. Chen, D. J.

Wouters, P. Roussel, G. S. Kar, G. Pourtois, S. Cosemans, J. A. Kittl, G.

Groeseneken, M. Jurczak, and L. Altimime, “Dynamic ‘hour glass’ model for set

and reset in HfO2 RRAM,” IEEE Symposium on VLSI Technology, pp. 75–76,

2012.

[120] C. Mead, “Neuromorphic electronic systems,” Proceedings of the IEEE, vol. 78,

no. 10, pp. 1629–1636, 1990.

[121] C.-S. Poon, and K. Zhou, “Neuromorphic silicon neurons and large-scale neural

networks: challenges and opportunities.,” Frontiers in Neuroscience, vol. 5, p. 108,

2011.

167

[122] Q. V. Le, M. A. Ranzato, M. Devin, G. S. Corrado, and A. Y. Ng, “Building high-

level features using large scale unsupervised learning,” International Conference

in Machine Learning, 2012.

[123] R. Ananthanarayanan, S. K. Esser, H. D. Simon, and D. S. Modha, “The cat is out

of the bag : cortical simulations with 109 neurons, 1013 synapses,” Conference on

High Performance Computing Networking, Storage and Analysis, 2009.

[124] G. Indiveri, E. Chicca, and R. Douglas, “A VLSI array of low-power spiking

neurons and bistable synapses with spike-timing dependent plasticity,” IEEE

Transactions on Neural Networks, vol. 17, no. 1, pp. 211–21, 2006.

[125] P. Merolla, J. Arthur, F. Akopyan, N. Imam, R. Manohar, and D. S. Modha, “A

digital neurosynaptic core using embedded crossbar memory with 45pJ per spike

in 45nm,” IEEE Custom Integrated Circuits Conference, 2011.

[126] G. S. Snider, “Self-organized computation with unreliable, memristive

nanodevices,” Nanotechnology, vol. 18, no. 36, p. 365202, 2007.

[127] O. Bichler, M. Suri, D. Querlioz, D. Vuillaume, B. DeSalvo, and C. Gamrat,

“Visual pattern extraction using energy-efficient ‘2-PCM synapse’ neuromorphic

architecture,” IEEE Transactions on Electron Devices, vol. 59, no. 8, pp. 2206–

2214, 2012.

168

[128] D. Kuzum, R. G. D. Jeyasingh, B. Lee, and H.-S. P. Wong, “Nanoelectronic

programmable synapses based on phase change materials for brain-inspired

computing,” Nano Letters, vol. 12, no. 5, pp. 2179–2186, 2012.

[129] S. H. Jo, T. Chang, I. Ebong, B. B. Bhadviya, P. Mazumder, and W. Lu,

“Nanoscale memristor device as synapse in neuromorphic systems,” Nano letters,

vol. 10, no. 4, pp. 1297–1301, 2010.

[130] T. Ohno, T. Hasegawa, T. Tsuruoka, K. Terabe, J. K. Gimzewski, and M. Aono,

“Short-term plasticity and long-term potentiation mimicked in single inorganic

synapses.,” Nature Materials, vol. 10, no. 8, pp. 591–595, 2011.

[131] K. Seo, I. Kim, S. Jung, M. Jo, S. Park, J. Park, J. Shin, K. P. Biju, J. Kong, K. Lee,

B. Lee, and H. Hwang, “Analog memory and spike-timing-dependent plasticity

characteristics of a nanoscale titanium oxide bilayer resistive switching device,”

Nanotechnology, vol. 22, no. 25, p. 254023, 2011.

[132] T. Chang, S.-H. Jo, and W. Lu, “Short-term memory to long-term memory

transition in a nanoscale memristor,” ACS Nano, vol. 5, no. 9, pp. 7669–7676,

2011.

[133] E. Kandel, J. Schwartz, and T. Jessell, Principles of Neural Science, Fourth

Edition. McGraw-Hill Medical, 2000.

169

[134] S. Song, K. D. Miller, and L. F. Abbott, “Competitive Hebbian learning through

spike-timing-dependent synaptic plasticity,” Nature Neuroscience, vol. 3, no. 9, pp.

919–926, 2000.

[135] G. Q. Bi, and M. M. Poo, “Synaptic modifications in cultured hippocampal

neurons: dependence on spike timing, synaptic strength, and postsynaptic cell type,”

Journal of Neuroscience, vol. 18, no. 24, pp. 10464–10472, 1998.

[136] W. Senn, and S. Fusi, “Convergence of stochastic learning in perceptrons with

binary synapses,” Physical Review E, vol. 71, no. 6, p. 061907, 2005.

[137] C. Zamarreno-Ramos, L. A Camunas-Mesa, J. A Perez-Carrasco, T. Masquelier, T.

Serrano-Gotarredona, and B. Linares-Barranco, “On spike-timing-dependent-

plasticity, memristive devices, and building a self-learning visual cortex,”

Frontiers in Neuroscience, vol. 5, p. 26, 2011.

[138] M. Suri, O. Bichler, D. Querlioz, G. Palma, E. Vianello, D. Vuillaume, C. Gamrat,

and B. DeSalvo, “CBRAM devices as binary synapses for low-power stochastic

neuromorphic systems: auditory (cochlea) and visual (retina) cognitive processing

applications,” IEEE International Electron Devices Meeting, pp. 235–238, 2012.

[139] D. Querlioz, O. Bichler, and C. Gamrat, “Simulation of a memristor-based spiking

neural network immune to device variations,” International Joint Conference on

Neural Networks, pp. 1775–1781, 2011.

170

[140] D. Kuzum, R. G. D. Jeyasingh, S. Yu, and H.-S. P. Wong, “Low-energy robust

neuromorphic computation using synaptic devices,” IEEE Transactions on

Electron Devices, vol. 59, no. 12, pp. 3489–3494, 2012.

171

Author’s Biography

Shimeng Yu received the B.S. degree in microelectronics from Peking University in

2009, and the M.S. degree and Ph.D. degree in electrical engineering from Stanford

University in 2011, and in 2013, respectively. He joined Arizona State University as

assistant professor of computer engineering and electrical engineering in the fall of 2013.

He did summer internships in IMEC, Belgium in 2011, and IBM TJ Watson Research

Center, Yorktown Heights, New York, in 2012.

He has been working on the fabrication, characterization, and modeling of oxide based

resistive random access memory (RRAM) since 2008. He has also pioneered the use of

artificial synaptic device design for neuromorphic applications. His works have been

widely recognized in this field. By the end of his PhD study, his publications have been

cited over 500 times, with h-index 14.

His general research interests are the fundamental physics exploration of the emerging

device technologies and their applications for future computing paradigms, such as 3D

memory integration, embedded memory, logic-in-memory, reconfigurable computing,

bio-inspired neuromorphic computing, etc.

He has been the recipients of numerous awards due to his outstanding research

achievements, including Stanford Graduate Fellowship (SGF) from 2009 to 2012, IEEE

Electron Device Society Masters Student Fellowship in 2010 and PhD Student

Fellowship in 2012, Chinese Government Fellowship for Outstanding Self-Financed

Students Abroad in 2012, Qualcomm Fellowships for the 2013 Telluride Workshop, etc.

172

Publications

Book chapters

1) S. Yu, B. Lee, and H.-S. P. Wong, “Metal oxide resistive switching memory”,

Functional Metal Oxide Nanostructures, Springer, 2011, invited review.

Journal papers:

1) S. Yu, B. Gao, Z. Fang, H. Y. Yu, J. F. Kang, and H.-S. P. Wong, “A low energy

oxide-based electronic synaptic device for neuromorphic visual system with

tolerance to device variation”, Adv. Mater., vol. 25, no. 12, pp. 1774-1779, 2013.

2) S. Yu, H.-Y. Chen, B. Gao, J. F. Kang, and H.-S. P. Wong, “A HfOx based

vertical resistive switching random access memory for bit-cost-effective three-

dimensional cross-point architecture”, ACS Nano, vol. 7, no. 3, pp. 2320-2325,

2013.

3) H. Tian, H.-Y. Chen, B. Gao, S. Yu, J. Liang, Y. Yang, D. Xie, J. Kang, T.-L.

Ren, Y. Zhang, and H.-S. P. Wong, “Monitoring oxygen movement by Raman

spectroscopy of resistive random access memory with a graphene-inserted

electrode”, Nano Lett., vol. 13, no. 2, pp. 651-657, 2013.

4) D. Kuzum, S. Yu, and H.-S. P. Wong, “Synaptic electronics: materials, devices

and applications”, Nanotechnology, 2013, invited review.

5) S. Yu, R. Jeyasingh, Y. Wu, and H.-S. P. Wong, “Characterization of low-

frequency noise in the resistive switching of transition metal oxide HfO2”, Phys.

Rev. B, vol. 85, 045324, 2012.

6) X. Guan, S. Yu, and H.-S. P. Wong, “On the switching parameter variation of

metal oxide RRAM - part I: physical modeling and simulation methodology”,

IEEE Trans. Electron Devices, vol. 59, no. 4, pp. 1172-1182, 2012.

173

7) S. Yu, X. Guan, and H.-S. P. Wong, “On the switching parameter variation of

metal oxide RRAM - part II: model corroboration and device design strategy”,

IEEE Trans. Electron Devices, vol. 59 no. 4, pp. 1183-1189, 2012.

8) S. Yu, Y. Y. Chen, X. Guan, H.-S. P. Wong, J. A. Kittl, “A Monte Carlo study of

the low resistance state retention of HfOx based resistive switching memory”,

Appl. Phys. Lett., vol. 100, 043507, 2012.

9) X. Guan, S. Yu, and H.-S. P. Wong, “A SPICE compact model of metal oxide

resistive switching memory with variations”, IEEE Electron Device Lett., vol.

33, no. 10, pp. 1405-1407, 2012.

10) D. Kuzum, R. Jeyasingh, S. Yu, and H.-S. P. Wong, “Low energy, robust

neuromorphic computation using synaptic devices”, IEEE Trans. Electron

Devices, vol. 59, no. 12, pp. 3489-3494, 2012.

11) H.-S. P. Wong, H.-Y. Lee, S. Yu, Y.-S. Chen, Y. Wu, P.-S. Chen, B. Lee, F. T.

Chen, and M.-J. Tsai, “Metal oxide RRAM”, Proc. IEEE, vol. 100, no. 6, pp.

1951-1970, 2012, invited review.

12) S. Yu, R. Jeyasingh, Y. Wu, and H.-S. P. Wong, “AC conductance measurement

and analysis of the conduction processes in HfOx based resistive switching

memory”, Appl. Phys. Lett., vol. 99, 232105, 2011.

13) S. Yu, X. Guan, and H.-S. P. Wong, “Conduction mechanism of TiN/HfOx/Pt

resistive switching memory: a trap-assisted-tunneling model”, Appl. Phys. Lett.,

vol. 99, 063507, 2011.

14) S. Yu, Y. Wu, and H.-S. P. Wong, “Investigating the switching dynamics and

multilevel capability of bipolar metal oxide resistive switching memory”, Appl.

Phys. Lett., vol. 98, 103514, 2011.

15) S. Yu, Y. Wu, R. Jeyasingh, D. Kuzum, and H.-S. P. Wong, “An electronic

synapse device based on metal oxide resistive switching memory for

neuromorphic computation”, IEEE Trans. Electron Devices, vol. 58, no. 8, pp.

2729-2737, 2011.

174

16) S. Yu, and H.-S. P. Wong, “Compact modeling of conducting bridge random

access memory (CBRAM)”, IEEE Trans. Electron Devices, vol. 58, no. 5, pp.

1352-1360, 2011.

17) Y. Wu, S. Yu, B. Lee, and H.-S. P. Wong, “Low-power TiN/Al2O3/Pt resistive

switching device with sub-20 µA switching current and gradual resistance

modulation”, J. Appl. Phys. vol. 110, 094104, 2011.

18) Y. Chai, Y. Wu, K. Takei, H.-Y. Chen, S. Yu, P. C. H. Chan, A. Javey, and H.-S.

P. Wong, “Nanoscale bipolar and complementary resistive switching memory

based on amorphous carbon”, IEEE Trans. Electron Devices, vol. 58, no. 11, pp.

3933-3939, 2011.

19) S. Yu, and H.-S. P. Wong, “A phenomenological model for the reset mechanism

of metal oxide RRAM”, IEEE Electron Device Lett. vol. 31, no. 12, pp.1455-

1457, 2010.

20) S. Yu, J. Liang, Y. Wu, and H.-S. P. Wong, “Read/write schemes analysis for the

novel complementary resistive switches in passive crossbar memory arrays”,

Nanotechnology, vol. 21, 465202, 2010.

Conference papers:

1) S. Yu, H.-Y. Chen, Y. Deng, B. Gao, Z. Jiang, J. F. Kang, and H.-S. P. Wong,

“3D vertical RRAM - scaling limit analysis and demonstration of 3D array

operation”, Symposium on VLSI Technology 2013, pp. 158-159, Kyoto, Japan.

2) D. Kuzum, R. J. D. Jeyasingh, S. B. Eryilmaz, S. Yu, and H.-S. P. Wong,

“Programming phase change synaptic devices for neuromorphic computation”,

MRS Spring Meeting 2013, San Francisco, USA, invited.

3) S. Yu, and H.-S. P. Wong, “Characterization and modeling of the conduction and

switching mechanism of HfOx based RRAM”, MRS Fall Meeting 2013, Boston,

USA, invited.

175

4) Y. Wu, S. Yu, H.-Y. Chen, J. Liang, Z. Jiang, and H.-S. P. Wong, "Resistive

switching random access memory (RRAM): materials, device, scaling, and array

design”, 60th International Symposium of the American Vacuum Society (AVS)

2013, Long Beach, CA, USA, invited.

5) H.-Y. Chen, S. Yu, Y. Wu, and H.-S. P. Wong, “3D vertical RRAM architecture

and electrode/oxide interface engineering for next generation mass

storage”, International Conference on Solid State Devices and Materials (SSDM)

2013, Fukuoka, Japan, invited.

6) C.-S. Lee, S. Yu, X. Guan, J. Luo, L. Wei, and H.-S. P. Wong, “Compact models

of emerging devices”, International Conference of Electron Devices and Solid-

State Circuits (EDSSC) 2013, Hong Kong, invited.

7) S. Yu, B. Gao, Z. Fang, H. Y. Yu, J. F. Kang, and H.-S. P. Wong, “A

neuromorphic visual system using RRAM synaptic devices with sub-pJ energy

and tolerance to variability: experimental characterization and large-scale

modeling”, IEEE International Electron Devices Meeting (IEDM) 2012, pp. 239-

242, San Francisco, USA.

8) S. Yu, X. Guan, and H.-S. P. Wong, “Understanding metal oxide RRAM current

overshoot and reliability using Kinetic Monte Carlo simulation”, IEEE

International Electron Devices Meeting (IEDM) 2012, pp. 585-588, San

Francisco, USA.

9) H.-Y. Chen, S. Yu, B. Gao, P. Huang, J. F. Kang, and H.-S. P. Wong, “HfOx

based vertical RRAM for cost-effective 3D cross-point architecture without cell

selector”, IEEE International Electron Devices Meeting (IEDM) 2012, pp. 497-

500, San Francisco, USA.

10) H.-Y. Chen, H. Tian, B. Gao, S. Yu, J. Liang, J. Kang, Y. Zhang, T.-L. Ren, and

H.-S. P. Wong, “Electrode/oxide interface engineering by inserting single-layer

graphene: application for HfOx-based resistive random access memory”, IEEE

176

International Electron Devices Meeting (IEDM) 2012, pp. 489-492, San

Francisco, USA.

11) Y. Wu, S. Yu, H.-S. P. Wong, Y.-S. Chen, H.-Y. Lee, S.-M. Wang, P.-Y. Gu, F.

Chen, and M.-J. Tsai, “AlOx-based resistive switching device with gradual

resistance modulation for neuromorphic device application”, IEEE International

Memory Workshop (IMW) 2012, pp. 111-114, Milan, Italy.

12) H.-S. P. Wong, X. Guan, D. Kuzum, R. Jeyasingh, and S. Yu, “Variability in

emerging memory devices: physical understanding, modeling, and mitigation”,

IEEE Workshop on Variability Modeling and Characterization (VMC) 2012, San

Jose, USA, invited.

13) Y. Wu, J. Liang, S. Yu, X. Guan, and H.-S. P. Wong, “Resistive switching

random access memory - materials, device, interconnects, and scaling

considerations”, IEEE International Integrated Reliability Workshop (IIRW)

2012, Lake Tahoe, USA, invited.

14) Y. Wu, S. Yu, X. Guan, and H.-S. P. Wong, “Recent progress of resistive

switching random access memory (RRAM)”, IEEE Silicon Nanoelectronics

Workshop (SNW) 2012, Hawaii, USA, invited.

15) X. Guan, S. Yu, and H.-S. P. Wong, “On the variability of HfOx RRAM: from

numerical simulation to compact modeling”, IEEE Workshop on Compact

Modeling (WCM) 2012, Santa Clara, CA, USA, invited.

16) S. Yu, R. Jeyasingh, Y. Wu, and H.-S. P. Wong, “Understanding the conduction

and switching mechanism of metal oxide RRAM through low frequency noise

and AC conductance measurement and analysis”, IEEE International Electron

Devices Meeting (IEDM) 2011, pp. 275-278, Washington DC, USA.

17) S. Yu, X. Guan, and H.-S. P. Wong, “On the stochastic nature of resistive

switching in metal oxide RRAM: physical modeling, Monte Carlo simulation, and

experimental characterization”, IEEE International Electron Devices Meeting

(IEDM) 2011, pp. 413-416, Washington DC, USA.

177

18) S. Yu, Y. Wu, Y. Chai, J. Provine and H.-S. P. Wong, “Characterization of

switching parameters and multilevel capability in HfOx/AlOx bi-layer RRAM

devices”, International Symposium on VLSI Technology, Systems and

Applications (VLSI-TSA) 2011, pp. 106-107, Hsinchu, Taiwan.

19) Y. Wu, Y. Chai, H.-Y. Chen, S. Yu, and H.-S. P. Wong, “Resistive switching

AlOx-based memory with CNT electrode for ultra-low switching current and high

density memory application”, Symposium on VLSI Technology 2011, pp. 26-27,

Kyoto, Japan.

20) H.-S. P. Wong, S. Kim, B. Lee, M. A. Caldwell, J. Liang, Y. Wu, R. Jeyasingh,

and S. Yu, “Recent progress of phase change memory (PCM) and resistive

switching random access memory (RRAM)”, IEEE International Memory

Workshop (IMW) 2011, pp. 10-14, Monterey, USA, invited.

21) S. Yu, and H.-S. P. Wong, “Modeling the switching dynamics of programmable-

metallization cell (PMC) memory and its application as synapse device for a

neuromorphic computation system”, IEEE International Electron Devices

Meeting (IEDM) 2010, pp. 520-523, San Francisco, USA.

22) S. Yu, and H.-S. P. Wong, “A phenomenological model of oxygen ion transport

for metal oxide resistive switching memory”, IEEE International Memory

Workshop (IMW) 2010, pp. 54-57, Seoul, Korea.

23) Y. Chai, Y. Wu, K. Takei, H.-Y. Chen, S. Yu, P. C. H. Chan, A. Javey, and H.-S.

P. Wong, “Resistive switching of carbon-based RRAM with CNT electrodes for

ultra-dense memory”, IEEE International Electron Devices Meeting (IEDM)

2010, pp. 214-217, San Francisco, USA.

24) H.-S. P. Wong, S. Kim, B. Lee, M. A. Caldwell, J. Liang, Y. Wu, R. Jeyasingh,

and S. Yu, “Recent progress of phase change memory (PCM) and resistive

switching random access memory (RRAM)”, IEEE International Conference on

Solid-State and Integrated Circuits Technology (ICSICT) 2010, pp. 1055-1060,

Shanghai, China, invited.

178

25) Y. Wu, S. Yu, B. Lee and H.-S. P. Wong, “Gradual set and reset in TiN/Al2O3/Pt

resistive switching device with sub-20 µA current”, MRS Fall Meeting 2010,

paper K4.2, Boston, USA.

Date post:	08-Jun-2020
Category:	Documents
Upload:	others
View:	3 times
Download:	0 times

RESISTIVE SWITCHING MEMORY FOR NON-VOLATILE … › file › druid:xh257vm1382...Conventional...

Documents