0% found this document useful (0 votes)
74 views

Perspectives of Racetrack Memory Based On Current-Induced Domain Wall Motion: From Device To System

zhang

Uploaded by

janet
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
74 views

Perspectives of Racetrack Memory Based On Current-Induced Domain Wall Motion: From Device To System

zhang

Uploaded by

janet
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Perspectives of Racetrack Memory Based on CurrentInduced Domain Wall Motion: From Device to System

Yue Zhang1, Chao Zhang3, Jacques-Olivier Klein1, Dafine Ravelosona1, Guangyu Sun3, Weisheng Zhao1,2
Email: [email protected] [email protected]
Institut dElectronique Fondamentale, Univ. Paris-Sud/UMR 8622 CNRS, Orsay, France
2
Spintronics Interdisciplinary Center, Beihang University, Beijing, China
3
Center for Energy-Efficient Computing and Applications, Peking University, Beijing, China
1

AbstractCurrent-induced domain wall motion (CIDWM) is


regarded as a promising way towards achieving emerging highdensity, high-speed and low-power non-volatile devices.
Racetrack memory is an attractive concept based on this
phenomenon, which can store and transfer a series of data along
a magnetic nanowire. Although the first prototype has been
successfully fabricated, its advancement is relatively arduous
caused by certain technique and material limitations.
Particularly, the storage capacity issue is one of the most serious
bottlenecks hindering its application for practical systems. In this
paper, we present two alternative solutions to improve the
capacity of racetrack memory: magnetic field assistance and
chiral domain wall (DW) motion. The former one can lower the
current density for DW shifting; the latter one can utilize
materials with low resistivity. Both of them are able to increase
the nanowire length and allow higher feasibility of large-capacity
racetrack memory. Furthermore, system level simulation shows
that a racetrack memory based cache can improve system
performance by about 15.8% and significantly reduces the
energy consumption, compared to the SRAM counterpart.

Firstly, in order to overcome the issue of thermal stability,


materials with perpendicular magnetic anisotropy (PMA) have
being intensively studied and can offer various other
performance improvements compared with those with in-plane
magnetic anisotropy, such as lower DW nucleation critical
current and higher DW shifting speed [7-8]. Secondly, it was
demonstrated that, due to Walker breakdown effect, magnetic
field could trigger the CIDWMs below intrinsic current
threshold [9]. Magnetic field can thus be one of alternative
solutions to handle the problem of high DW shifting current.
Thirdly, chiral DW motions have been discovered to allow a
high efficiency in terms of magnetization switching and DW
shifting, compared with conventional spin transfer torque (STT)
based CIDWMs [10-14]. This phenomenon is derived from
spin-orbit torque (SOT). Thanks to low resistivity of material
inducing SOI, chiral DW motions can overcome the
aforementioned high resistivity drawback.
In this paper, we present two racetrack memory device
structures based on new mechanisms: magnetic field
assistance and chiral DW motions. These two implementations
can improve the capacity of racetrack memory. By using
SPICE compatible electrical model [15-17], mixed simulations
and performance analyses are performed. Beyond the device
and circuit design, the system level investigations are also
performed. It demonstrates that a racetrack memory based
cache can outperform traditional SRAM based cache, in
respect of performance and energy consumption.
In section II, we introduce briefly the basic structure and
theory of racetrack memory. Two capacity-improving
solutions, magnetic field assistance and chiral DW motions,
will be presented in section III. The system-level study is then
detailed. At last, concluding remarks and perspectives are
discussed in section V.

Keywords Racetrack memory; Magnetic field assistance; Chiral


domain wall motion; L2 cache.

I.

INTRODUCTION

Recent progress of technique and material makes magnetic


domain wall (DW) motions be regarded as a promising way to
achieve non-volatile logic and memory devices [1]. In
particular, due to the prospects of facility, high speed and low
power, current-induced domain wall motion (CIDWM) draws
further attentions from academics and industries. Racetrack
memory is one of the most attractive applications based on this
phenomenon [2-3]. In the basic structure, a series of magnetic
domains move along magnetic nanowires, driven by a spin
polarized current. Thanks to the well-defined nanowires,
racetrack memory demonstrates an obvious potential in term of
scalability. However, the path of its realization is arduous. For
example, the prototype firstly fabricated was based on the
materials with in-plane magnetic anisotropy, which cannot
provide a sufficient thermal stability for further miniaturization
[4]. Furthermore, the high values of DW shifting critical
current and material resistivity limit the capacity of racetrack
memory [5-6]. These challenges incent the motivations of both
physical and material scientists.

978-1-4799-8391-9/15/$31.00 2015 IEEE

II.

RACETRACK MEMORY INTRODUCTION

As shown in Fig. 1, a basic cell of racetrack memory is


composed of three parts, i.e. write head, read head and storage
nanowire. The write and read heads can be constructed by
magnetic tunnel junctions (MTJs), which facilitate integration
with CMOS writing and reading circuits. Data, in the form of
magnetic domains, are stored along the magnetic nanowire.
Different magnetic domains are separated by DWs, which can

381

! 0 + X! / = H + u / + f pin
X! ! 0 = v sin 2 0 + u

be pinned by artificial potentials or constrictions when no


current is applied. As the distance between adjacent DWs can
be extremely short, this concept can achieve a considerably
high storage density. With respect to structural parts, three
circuits are used to generate currents accordingly. Iw and Ir
execute DW nucleation and detection respectively. Ish drives
the DWs or data from write head to read head.
Magnetic Field Assistance

Output

Inputb

Vddp

MN3

SA

MTJ0

constant

Input

MN1 MN2

Inputb

comes from the hard-axis magnetic anisotropy

. u is spin current velocity. Moreover, numbers of

experimental parameters (Co/Ni as an example shown in Tab.I)


have also been involved. By using this electrical model, ones
can design and analyze more complex logic and memory
circuits [18].
Beyond the thermal stability issue, the capacity limitation is
the most fatal bottleneck hindering the application of racetrack
memory. Indeed, high current density required for CIDWMs
and high resistivity of magnetic nanowire are two key reasons.
In the following, we will present two alternative solutions to
improve the capacity.

MTJ1

is the

gyromagnetic ratio, f pin is the pinning force. The velocity

Iref

Ku

is the angle that the

correction to the STT, H is the external field,

Hy
ENp

(2)

DW magnetization forms with the easy plane. is the width


of DW, is the Gilbert damping constant, is the dissipative

SOT based Chiral DW Motions

Vdda
Input MP1 MP2

where X is the position of a DW, and

(1)

Gnd

Domain wall motion

Gnd

Fig. 1.
Racetrack memory based on CIDWM, which is composed of one
write head (MTJ0), one read head (MTJ1) and one magnetic nanowire.
Writing circuit generates Iw to nucleate data or magnetic domain, propagation
circuit generates Ish to induce DW motion and sense amplifier (SA) generates
Ir to detect the magnetization direction. (the direction of Ish for the case of
SOT should be opposite to that for STT shown in this figure)

III.

Thanks to its multi-bit ability and scalability potential,


considerable efforts from academics and industries have been
put into the investigation of racetrack memory. The first
prototype has been successively fabricated despite of its limited
capacity of 256 bits [4]. Meanwhile, this prototype was based
on NiFe material with in-plane magnetic anisotropy, which
cannot provide a sufficient thermal stability for long data
retention in advanced technology nodes below 40 nm. Aiming
to tackle this problem, materials with PMA were observed to
offer higher energy barrier and show other advantages in terms
of current, speed and power consumption.

MAGNETIC FIELD ASSISTED AND CHIRAL DOMAIN


WALL MOTION BASED RACETRACK MEMORIES

Recent experimental progress showed that magnetic field


would trigger DW motions below intrinsic current threshold
due to Walker breakdown effect [9]. This observation offers a
promising orientation to relieve the pressure of high current
density, i.e. magnetic field assisted racetrack memory.
Through adding metal lines or coils, an approximately global
magnetic field is generated (the magnetic field Hy shown in
the red dash square in Fig. 1) [5-6].

TABLE I. PARAMETERS AND VARIABLES INTEGRATED IN RACETRACK


MEMORY ELECTRICAL MODEL (C O/NI)
Parameter

P
Ms
Hw

Ku
TMR
Jc_nucleation

Description
Gilbert damping constant
Nonadiabatic coefficient
Spin polarization rate
Saturation magnetization
Walker breakdown field
Gyromagnetic ratio
DW width
Uniaxial anisotropy
TMR of write head MTJ
DW Nucleation critical current density

Default Value
0.045
0.02
0.49
0.66 MA/m
4.4 mT
0.176 THz/T
10 nm
0.41 MJ/m3
120%
57 GA/m2
Fig. 2.
Dependence of critical current density required and shifting energy
versus different number of bits per nanowire in magnetic field assisted
racetrack memory.

With this background, racetrack memory based on


magnetic nanowire with PMA (e.g. CoFeB/MgO, Co/Ni, etc.)
was proposed [16]. In order to confirm the functionality of this
concept and extend it to other optimizations, we develop a
SPICE-compatible electrical model. It integrates several critical
physics, for example, one-dimension model to describe DW
motions shown as follows:

Due to this field, the current threshold for DW motions in


Co/Ni material could be reduced to 3.2 x 1011 Am-2 rather than
the intrinsic one 4.5 x 1011 Am-2. By fixing the voltage supply

382

Iw

M2(414.5uA)

IV.

Ish

To demonstrate the benefits of using racetrack memory in


system level, we replace a SRAM based L2 cache with a
racetrack memory based cache. We use gem5 [22], a cycle
accurate full system simulator, to simulate PARSEC [23]
benchmark suite for performance and power evaluation. All
benchmarks are fully executed and we count the overall
execution time of the system to evaluate the performance. For
energy evaluation, we collect the number of operations of each
benchmark and calculate the energy with parameters from a
circuit level simulation framework [24]. The system is
configured with 4 simple cores, private 32KB I/D cache, and a
shared L2 cache. In order to compare an on situ replacement,
we conduct an iso-area comparison. We use the 4M SRAM (~
6.5mm2) to compare with 64MB racetrack memory
(~6.2mm2). The detailed configurations are shown in Table II.

1 State of write head

0 0 0 0

M0(12.7ns)

1 State of read head


M1(166.1ns)

50

100

150
time (ns)

200

250

300

Fig. 3.
Transient simulation of 16-bit racetrack memory based on chiral
DW motions. Iw is used for inputting data, Ish is used for DW shifting.
12

Pt
Pd
CoFeB
(STT)

Nanowire length (um)

10

TABLE II. SYSTEM LEVEL EXPERIMENT SETUP DETAILS

Component
Processor
L1 I/D
Cache

L2
Cache

Main
Memory

0
0.5

1.5

2.5

3.5

Current density (1012 Am2)

4.5

SYSTEM LEVEL EVALUATION

Configuration
4 simple cores, 2GHz, 1-way issue
32/32KB, 2-way, 64B line, private, LRU
SRAM, 1/1 cycle, 6.2/2.3pJ
16-way, 64B, shared, LRU
SRAM: 2MB, 10/10 cycle, 0.57/0.54nJ, 3438mW
RM: 64MB, 9/20/5 cycle, 0.50/0.55/0.50nJ, 1062mW
8GB, DDR3, 1600MHz, 120cycle, 12.8GB/s

RM

1.2
1
0.8
0.6
0.4
0.2
0

Fig. 4.
Dependence of nanowire length versus DW shifting current density
for Pt and Pd. Dash line shows the case for the conventional STT based
CoFeB racetrack memory.

On the other hand, the SOT based chiral DW motions


demonstrates an unprecedented efficient magnetization
switching and a high shifting speed. Opposite to STT based
CIDWMs, the direction of chiral DW motion is the same with
that of applied current. Although it still needs a relatively high
current density for DW motions, the nanowire length is
increased thanks to the low resistivity of current-flowing nonmagnetic materials (e.g. Pt, Pd, etc.) [19-20]. Therefore, chiral
DW motions could be another promising solution to improve
the capacity of racetrack memory. By adding a non-magnetic
layer underneath the magnetic storage nanowire, the concept
of racetrack memory based on chiral DW motions is illustrated

SRAM

average

V (V)

in Fig. 1 with the green dash square [21]. We validate its


functionality with transient simulation of a 16-bit example. As
shown in Fig. 3, the data inputted at write head can totally be
detected by read head. In order to reveal its prospect for
capacity improvement, we compare two non-magnetic
materials, Pt and Pd, with conventional STT based CoFeB
case. From the result shown in Fig. 4, we find that the
proposed racetrack memory design can provide longer
nanowire than conventional one with the same current density.
Especially, Pt, with the smallest resistivity (1.56 x 10-7 .m),
exhibits an obviously high limit among all.

blackscholes
bodytrack
canneal
dedup
facesim
ferret
fluidanimate
freqmine
streamcluster
swaptions
vips
x264

I (uA)

4.5

V (V)

500
0
-500

I (mA)

to 3V and the distance between two adjacent DWs to 40 nm,


the limit value of bits per nanowire can exceed 100 and reach
up to 150. However, the generation of magnetic field
deteriorate overall power consumption. Fig. 2 shows our
analyses about the relation among storage capacity, critical
shifting current and power consumption. We can find that
there is a tradeoff when improving the storage capacity: more
bits stored per racetrack memory require lower critical current
and more energy consumption.

Fig. 5.
Comparison of system overall execution time. (RM: racetrack
memory)

383

The overall system performance is shown in Fig. 5. Due to


the increase of cache capacity, some benchmarks (e.g. canneal)
show significant performance improvement. But some
benchmarks (e.g. ferret) suffer from performance degradation
due to extra shift latency, which induces about 7.5% overhead.
The maximum performance improvement can be up to 38.3%,
and the average improvement is 15.8%.
The results of energy consumption are shown in Fig. 6.
Generally, racetrack memory reduces the energy consumed in
L2 cache. Even though the dynamic energy for racetrack
memory is higher than SRAM, the static leakage power of
racetrack memory is much lower. Thus racetrack memory
reduces the L2 cache energy up to 79.3%, and 73.0% on
average. Thus, the replacement will finally reduce the system
power consumption.

REFERENCES
[1]
[2]

[3]
[4]
[5]

[6]

[7]

RM

[8]

[9]

[10]

Fig. 6.

[11]

Average

blackscholes
bodytrack
canneal
dedup
facesim
ferret
fluidanimate
freqmine
streamcluster
swaptions
vips
x264

1.2
1
0.8
0.6
0.4
0.2
0

SRAM

[12]
[13]
[14]

Comparison of L2 cache energy. (RM: racetrack memory)

V.

CONCLUSION

[15]

This paper presents the prospect of racetrack memory


through the viewpoints from device design to system
evaluation. Firstly, two alternative ways to improve the
capacity have been proposed. Magnetic field assistance that
takes advantage of Walker breakdown effect could reduce
required current density. Chiral DW motions provide a new
mechanism to achieve racetrack memory and the related
materials have lower resistivity compared with conventional
STT based CIDWMs. The improvement of capacity could
make racetrack memory device more attractive for those
systems requiring large on-chop memory. By implementing
system-level evaluations, racetrack memory demonstrates its
assets in terms of system execution performance and energy
consumption.

[16]
[17]

[18]

[19]

[20]
[21]

ACKNOWLEDGMENT

[22]

The authors wish to acknowledge financial support from the


European FP7 program MAGWIRE (257707) and the French
national projects ANR-MARS, ANR-DIPMEM.

[23]
[24]

384

C. Chappert, A. Fert and F. Nguyen Van Dau, The emergence of spin


electronics in data storage, Nat Mater, vol. 6, pp. 813-823, 2007.
M. Hayashi, L. Thomas, R. Moriya, C. Rettner, and S.S.P. Parkin,
Current-controlled magnetic domain-wall nanowire shift register,
Science, vol. 320, pp. 209, 2008.
S. S. P. Parkin, M. Hayashi and L. Thomas, Magnetic Domain-Wall
Racetrack Memory, Science, vol. 320, pp. 190 -194, 2008.
A.J. Annunziata et al., Racetrack memory cell array with integrated
magnetic tunnel junction readout, Procs. of IEDM, pp.24.2.1, 2011.
Y. Zhang, W.S. Zhao, J.-O. Klein, C. Chappert and D. Ravelosona,
Current induced perpendicular-magnetic-anisotropy racetrack memory
with magnetic field assistance, Appl. Phys. Lett., vol. 104, pp. 032409,
2014.
Y. Zhang, W.S. Zhao, J.-O. Klein, C. Chappert and D. Ravelosona,
Implementation of magnetic field assistance to current-induced
perpendicular-magnetic-anisotropy racetrack memory, J. of Appl.
Phys., vol. 115, pp. 17D509, 2014.
D. Ravelosona, D. Lacour, J.A. Katine, B.D. Terris and C. Chappert,
Nanometer scale observation of high efficiency thermally assisted
current-driven domain wall depinning, Phys. Rev. Lett., vol. 95, pp.
117203, 2005.
S. Fukami et al., Current-induced domain wall motion in
perpendicularly magentized CoFeB nanowire, Appl. Phys. Lett., vol.
98, pp. 082504, 2011.
T. Koyama et al., Current-induced magnetic domain wall motion
below intrinsic threshold triggered by Walker breakdown, Nature
Nanotechnology, vol. 7, pp. 635-639, 2012.
I.M. Miron et al., Perpendicular switching of a single ferromagnetic
layer induced by in-plane current injection, Nature, vol. 476, pp.189193, 2011.
S. Emori, U. Bauer, S.-M. Ahn, E. Martinez and G.S.D. Beach,
Current-driven dynamics of chiral ferromagnetic domain walls, Nature
Materials, vol. 12, pp. 611-616, 2013.
P.P.J. Haazen et al., Domain wall depinning governed by the spin hall
effect, Nature Materials, vol. 12, pp. 299-303, 2013.
K.-S. Ryu, L. Thomas, S.-H. Yang and S.S.P. Parkin, Chiral spin
torque at magnetic domain walls, Nature Nano., vol. 8, pp. 527, 2013.
O.J. Lee et al., Central role of domain wall depinning for perpendicular
magnetization switching driven by spin troque from the spin Hall
effect, Phys. Rev. B, vol. 89, pp. 024418, 2014.
Y. Zhang et al., A Compact Model of Perpenticular Magnetic
Anistropy Magnetic Tunnel Junction , IEEE Trans. on Elect. Dev.,vol.
59, pp. 819-826, 2012.
Y. Zhang et al., Perpendicular-magnetic-anisotropy CoFeB racetrack
memory, J. of Appl. Phys, vol. 111, pp. 093925, 2012.
Y. Zhang et al., Electrical modeling of stochastic spin transfer torque
writing in magnetic tunnel junctions for memory and logic applications,
IEEE Trans. Magnetics, vol. 49, pp. 4375-4378, 2013.
Y. Zhang, W.S. Zhao, J.-O. Klein, D. Ravelosona and C. Chappert,
Ultra-high density content addressable memory based on current
induced domain wall motion in magnetic track, IEEE Trans. Magnetics,
vol. 48, pp. 3219-3222, 2012.
T. Kimura, Y. Otani, T. Sato, S. Takahashi and S. Maekawa, Roomtemperature reversible spin hall effect, Phys. Rev. Lett., vol. 98, pp.
156601, 2007.
Z. Tang et al., Temperature dependence of spin hall angle of
palladium, Appl. Phys. Exp. Vol. 6, pp. 083001, 2013.
Y. Zhang et al., Peristaltic perpendicualr-magnetic-anisotropy racetrack
memory based on chiral domain wall motions, submitted to J. of Phys.
D: Appl. Phys.
N. Binkert et al., The gem5 simulator, SIGARCH Comput. Archit.
News 39, 2 (August 2011), 1-7.
C. Bienia, Benchmarking Modern Multiprocessors, Ph.D. Thesis.
Princeton University, January 2011.
C.Zhang,G.Sun et al., Aquantitative modeling of racetrack memory, a
tradeoff among area, performance, and power, in ASP-DAC, Jan 2015.

You might also like