A Fully Static True-Single-Phase-Clocked Dual-Edge-Triggered Flip-Flop For Near-Threshold Voltage Operation in Iot Applications
A Fully Static True-Single-Phase-Clocked Dual-Edge-Triggered Flip-Flop For Near-Threshold Voltage Operation in Iot Applications
March 6, 2020.
Digital Object Identifier 10.1109/ACCESS.2020.2976773
ABSTRACT A Dual-Edge-Triggered (DET) flip-flop (FF) that can reliably operate at low voltage is pro-
posed in this paper. Unlike the conventional Single-Edge-Triggered (SET) flip-flops, DET-FFs can improve
energy efficiency by latching input data at both clock edges. When combined with aggressive voltage scaling,
significant efficiency improvement is expected. However, prior DET-FF designs were susceptible to Process,
Voltage and Temperature (PVT) variations, limiting their operation at low voltage regimes. A fully static true-
single-phase-clocked DET-FF is proposed to achieve reliable operation at voltages as low as a near-threshold
regime. Instead of the two-phase or pulsed clocking scheme in conventional DET-FFs, a True-Single-Phase-
Clocking (TSPC) scheme is adopted to overcome clock overlap issues and enable low-power operation.
Fully static implementation also enables robust operation in a low voltage regime. The proposed DET-FF
is designed in 28nm CMOS technology, and a comprehensive analysis including post-layout Monte Carlo
simulation for wide PVT ranges is performed to validate the design approaches. Extensive analysis and
comparison with prior-art DET-FFs confirmed that the proposed DET-FF can operate at the lowest voltage
of 0.28 V for a temperature range of −40 ◦ C to 120 ◦ C while maintaining nearly-best energy efficiency and
power-delay-product.
INDEX TERMS Flip-flop, dual-edge-triggered (DET), single phase, low power, near-threshold voltage.
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see http://creativecommons.org/licenses/by/4.0/
40232 VOLUME 8, 2020
Y. Lee et al.: Fully Static True-Single-Phase-Clocked DET-FF for Near-Threshold Voltage Operation in IoT Applications
B. PULSED-LATCH-TYPE DET-FF
As alternatives to the Latch-Multiplexer-type DET-FF, many
pulse-triggered DET FFs have been proposed [9]–[18]. In this
topology, a pulse generator generates a clock-synchronized
FIGURE 2. Pulsed-Latch-type DET-FF. (a) Dual-Edge Conditional
pulse at both the rising and falling clock edges. These clock- Pre-charge (DECP) FF [17] (b) Sense-Amplifier (SA) DET FF [11].
synchronized pulses are used as triggers for the latching
operation of the pulsed latches. Pulsed-latch-type DET-FFs require pulse-generating cir-
Pulsed-Latch-type DET FFs (PL-DET FFs) consist of cuits. Since the transparency window of pulsed latches
a pulse generator and a pulsed latch and can be classi- is determined by the pulse length, timing is very impor-
fied into two types – implicit pulse [13]–[18] and explicit tant. However, since typical pulse-generating circuits rely
pulse [9]–[12]. Fig. 2 shows a PL-DET FF using an on transistor delay to control the pulse length, the pulse
implicit pulse (Fig. 2(a)) and one using an explicit pulse characteristics can be easily affected by Process, Volt-
(Fig. 2(b)) [11], [17]. The Dual-edge Conditional Pre-charge age and Temperature (PVT) variations. Therefore, careful
flip-flop (DECP FF) [17] shown in Fig. 2(a) utilizes control pulse-generation circuit design is required to make it robust
signals generated by the input CLK. The delayed version against PVT variations. Such careful design often requires
of CLK (CK4), the inverted version of CLK (CKD), and large margins for variation tolerance, resulting in large power
the CLK signal generate a short transparency window after and area overhead.
both the rising and falling edge of the CLK. During these
windows, the input data is forwarded to the output node Q C. C-ELEMENT-TYPE DET-FF
in the pulsed latch. This implicit pulse-type DET-FF requires Another type of recently designed DET-FF includes a circuit
a robust timing window for reliable operation, so the careful element called a ‘C-element’ [23], [24]. As illustrated in
sizing of transistors is an important design issue. Fig. 3(a), when the two inputs of a C-element are the same,
An example of a pulsed-latch-type DET-FF with explicit the output is updated to the input value, whereas the earlier
pulse is the Sense-Amplifier DET flip-flop (SA-DET FF) output value is retained when the two input values differ from
[11] shown in Fig. 2(b). This DET-FF utilizes pulses gen- each other. Thanks to the ability of C-element-type DET-
erated by complementary two-phase clock signals, namely FFs to latch new data or retain data depending on the input
CLK1 and CLK2. When these signals become equal for a conditions, many different topologies have been proposed to
very short period of time due to their generation circuit struc- implement them.
ture (series-connected inverters), a pulse generator circuit Fig. 3(b), (c) show examples of DET-FF designs using
generates a short pulse signal, which is used to trigger the C-elements. The New low-power C-element Dual Data Rate
latching operation in a pulsed latch. Since the pulses are flip-flop (NCDDR FF) [23] shown in Fig. 3(b) includes two
generated whenever CLK1 and CLK2 are both ‘1’ or both latches that store the input values in opposite phases. The
‘0’, the data is latched at both CLK edges. latched input is stored in nodes A and B, which are used as
targeted for IoT applications, Low-Power (LP) process, rather This approach also includes the use of non-minimum-length
than General Purpose (GP), is chosen. transistors for reliable pulse lengths in pulsed-latch-type
Fig. 8 presents the layout of the 7 DET-FFs designed DET-FFs.
for post-layout analysis. All DET-FF designs are carefully
sized to guarantee reliable operation at room temperature A. STANDARD SUPPLY VOLTAGE OPERATION
and standard supply voltage (0.95 V) while keeping the size Fig. 9 shows the simulation setup for fair data measurement
as small as possible for energy efficiency. RC parasitics and comparison, which is similar to the one used in [16], [37].
are extracted for the layouts shown in Fig. 8 so that the Two inverters are utilized as input drivers for the input
post-layout simulations can be performed at various operation clock (CLK) and data (D). To precisely model the power con-
conditions. sumption of the DET-FFs, the power consumed for driving
Among the 3 types of DET-FFs, the pulsed-latch-type each pin is estimated by measuring the power consumed on
DET-FF (DECP FF [17], SA-DET FF [11]) and the these drivers. For example, the power consumed for driving
C-element-type DET-FF (NCDDR FF [23], FN_C-DET the CLK input (PCLK ) can be calculated as
FF [24]) are more susceptible to PVT variations than the
PCLK = PDriver_CLK − PDriver_CLK _int (10)
Latch-MUX-type DET-FF (SDET FF [20], TSPC-SDET
FF [27] and FS-TSPC-DET FF) since the pulsed-latch-type where PDriver_CLK is the total measured power consump-
DET-FF requires precise pulse timing and the C-element-type tion of the power supply of the CLK-driving inverter, and
DET-FF has contention with the weak inverters. Therefore, PDriver_CLK _int is the CLK-driving inverter’s intrinsic power
these DET-FFs require proper transistor sizing for stable consumption, which can be measured by measuring the
operation. However, sizing up transistors to ensure a mini- power consumption of the CLK-driving inverter without the
mum reliable pulse length for pulsed-latch-type DET-FFs or DET-FF’s CLK pin loading. Similarly, the power consumed
making a weak inverter for C-element-type DET-FFs would for driving D input (PD ) can be calculated as
incur greater power and area overhead. Due to this trade-off,
the transistor sizes are carefully chosen for these two types PD = PDriver_D − PDriver_D_int (11)
of DET-FF designs so that stable operation at the standard
supply voltage and room temperature is guaranteed while and the power consumed for the internal switching of the flip-
keeping the area compact and energy efficiency reasonable. flop (PFlip−Flop ) can be measured by the power consumption
FIGURE 8. Layout of 7 DET-FFs. (a) SDET FF (b) TSPC-SDET FF (c) DECP FF (d) SA-DET FF (e) NCDDR FF (f) FN_C-DET FF (g) FS-TSPC-DET FF.
FIGURE 11. CLK-to-Q delay versus D-to-CLK delay. (a) all output
transitions at both clock edges in the proposed FS-TSPC-DET FF. (b) setup
time comparison with 7 DET-FFs.
FIGURE 14. Test pattern and truth table for Monte Carlo simulation.
FIGURE 15. Monte Carlo simulation results with 4 DET-FFs with aggressive voltage and temperature scaling.
data transition is assumed so that only the functional failures represents ‘pass’ in 10k Monte Carlo simulations and a red
not related to setup or hold-time violations are checked. If a box represents ‘fail’ for failing at least 1 test case. Among
DET-FF operates with no functional failure, the output Q is the 4 evaluated DET-FFs, the SDET FF and FN_C-DET FF
expected as shown Fig. 14, synchronizing the input data at have limited functionality at low voltage and low temper-
each marked point (a–f). ature. This is because these DET-FFs utilize a two-phase
Monte Carlo simulations were performed for a supply clocking scheme, where the clock overlap issue can be worse
voltage range of 0.2 V to 0.5 V with 20-mV steps and a with PVT variations. Furthermore, in the FN_C-DET FF,
temperature range of −40 ◦ C to 120 ◦ C with 10 ◦ C steps. the strong and weak C-elements have contention on the data
For each condition, 10k Monte Carlo points were simulated storage node, making it susceptible to PVT variations. Thanks
for accurate evaluation. The Shmoo plots for the 4 DET-FFs to their single-phase clocking scheme, TSPC-SDET FF and
are shown in Fig. 15. In these Shmoo plots, a green box FS-TSPC-DET FF can operate without failure at the lower
voltages and temperatures. However, TSPC-SDET FF cannot prior-art designs at a given supply voltage but also can operate
function below 0.3 V due to the weak pull-up/pull-down (at with the lowest supply voltage. These properties make the
node XP or XN in Fig. 1(b)). Thanks to the fully static imple- proposed FF an excellent candidate as an energy-efficient
mentation, the proposed FS-TSPC-DET FF has the lowest sequential circuit element for energy/cost-sensitive IoT appli-
functional supply voltage and temperature – it could operate cations with aggressive dynamic voltage scaling down to a
without failure at 0.28 V for the entire tested temperature near-threshold voltage of 0.28 V.
range.
Table 1 summarizes the evaluation results with standard
supply voltage conditions (VDD = 0.95 V, input clock fre- V. CONCLUSION
quency = 1 GHz, temp. = 27 ◦ C at TT corner). The nor- A dual-edge-triggered flip-flop for high energy efficiency
malized areas of the 7 DET-FFs are compared based on the and robust near-threshold operation is proposed in this paper.
layout sizes shown in Fig. 8 (FS-TSPC-DET FF = ‘1’). Unlike most of the conventional DET-FFs, such as the
The area is not directly proportional to the transistor counts Pulsed-latch-type or Latch-MUX-type DET-FF using a com-
since some designs require careful transistor sizing for robust plementary two-phase clock, the proposed FS-TSPC-DET
operation, especially when they have contention. Thanks to FF utilizes a true single-phase clock and implements a fully
their symmetric and static structure, Latch-MUX-type DET- static structure without floating state and contention. The
FFs (the SDET FF, TSPC-SDET FF and FS-TSPC-DET FF) comprehensive post-layout analysis of the DET FFs imple-
can have smaller layout areas compared to other DET-FF mented in 28nm CMOS technology showed that the proposed
designs despite the use of a large number of transistors. FS-TSPC-DET FF achieves excellent power-delay trade-off
The proposed FS-TSPC-DET FF has low power con- and high tolerance against PVT variation, thanks to the simple
sumption comparable to the best performing FN_C-DET FF. and fully static implementation. Extensive comparison with
In addition, it also has low CLK-to-Q delay comparable to the prior-art DET-FFs confirmed that the proposed DET-FF can
best performing TSPC-SDET FF, resulting in the lowest PDP operate at the lowest voltage of 0.28 V for a temperature
for α = 10% and 50% and second lowest PDP for α = 25%. range of −40 ◦ C to 120 ◦ C, making it an excellent candidate
Besides the excellent trade-off between power and perfor- as an energy-efficient flip-flop for energy/cost-sensitive IoT
mance, the proposed FS-TSPC-DET FF also exhibits robust applications.
operation at low voltage. Thanks to its fully static design
without contention, the minimum operational supply voltage
was as low as 0.28 V. Such high energy efficiency and wide ACKNOWLEDGMENT
voltage scalability suggest that the proposed FS-TSPC-DET The authors would like to thank the IC Design Education
FF not only consumes relatively lower power compared to Center (IDEC), South Korea, for providing EDA tool.
GICHEOL SHIN (Student Member, IEEE) YOONMYUNG LEE (Senior Member, IEEE)
received the B.S. degree in electrical engineering received the B.S. degree in electronic and electrical
from Sungkyunkwan University, Suwon, South engineering from the Pohang University of Sci-
Korea, in 2017, where he is currently pursuing the ence and Technology (POSTECH), Pohang, South
joint M.S./Ph.D. degree in electronic, electrical, Korea, in 2004, and the M.S. and Ph.D. degrees
and computer engineering. His research inter- in electrical engineering from the University of
ests include ultra-low-power circuit and hardware Michigan, Ann Arbor, in 2008 and 2012, respec-
accelerator design for deep learning applications. tively.
From 2012 to 2015, he was a Research Faculty
with the University of Michigan and performed
research on ultra-low-power circuit design for mm-scale sensor platforms.
In 2013, he co-founded CubeWorks Inc., a start-up company specialized in
mm-scale sensor platforms. In 2015, he joined Sungkyunkwan University,
Suwon, South Korea, where he is currently an Associate Professor. His cur-
rent research interests include energy-efficient integrated circuits design for
low-power high-performance VLSI systems and millimeter-scale wireless
sensor systems. He has been serving as a Technical Program Committee
Member for A-SSCC, since 2017. He has received numerous awards and
scholarships, including the Distinguished Undergraduate Scholarship from
the Korea Foundation for Advanced Studies, in 2001, the Samsung Scholar-
ship, in 2005, the Best Paper Award in ISLPED, in 2009, the DAC/ISSCC
Student Design Contest, in 2009 and 2011, the Intel Ph.D. Fellowship,
in 2011, and the Samsung Human Tech Thesis Contest Silver Award, in 2012.
He has been serving as an Associate Editor for the IEEE TRANSACTIONS ON
VERY LARGE SCALE INTEGRATION SYSTEMS, since 2019.