Clockgating Fpga 2
Clockgating Fpga 2
Abstract: We propose a novel merged clock gating Normally, the clock gating solution can be
architecture to design low power digital clock which groups implemented by identifying groups of registers which are
all clock gating signals together into a single clock gating controlled by a common enable signal. There are two
signal, then uses one DEMUX gate to process and split the clock gating styles. The AND logic gate is used to
single clock gating signal into many different clocks. We
multiply the enable signal and clock. Alternatively, Latch
compare the proposed technique with the conventional clock
gating technique and no-clock gating technique in term of gate can also be used to gate register clocks [8-9]. By
clock power, dynamic power and total power consumption. inserting the additional gates to circuit, the power penalty
The simulation results in Spartan-3E shows total is added to total power consumption. We have to consider
consumption power that the proposed architecture can save carefully the clock gating cells not to generate the power
3.45%, 26.53%, 50.69%, 53.15%, 53.13% compared to the overhead so much. To overcome this problem, we
no-clock gating and 1.19%, 1.85%, 11.83%, 14.76%, propose an optimum method of merged clock gating
15.67% compared to the conventional clock gating technique control.
in operation frequency of 100MHz, 1GHz, 10GHz, 100GHz
and 1THz, respectively. Moreover, the proposed technique In this paper, the author proposes a merged clock
can reduce to 20.88% and 5.28% compared to the no-clock gating technique to reduce the dynamic power for digital
gating and the conventional clock gating in term of the clock circuit design application. As the result, the power
created operation temperature, respectively. The number of consumption is reduced compared to the conventional
LUTs also decreases to 175 instead of 179 in the other clock clock gating technique. The author uses this innovative
gating techniques.
clock gating technique to build an integrated digital clock
Keywords: Clock Gating, Low Power, Dynamic Power, application with timer of seconds, minutes, hours, days of
Clock Power, Digital Clock. the month, and months of year in consideration of a leap
I. INTRODUCTION year. Here, in order to compare the effect of the proposed
clock gating method, we implemented the proposed clock
Reducing power consumption on the chip has become gating technique in comparison with the conventional
a major concern in the design of low consumption clock gating and non-clock gating techniques in term of
applications, particularly in battery powered applications. power dissipation, operation temperature and overhead
The consumed dynamic power when the circuit performs area, thereby providing the best possible evaluation to the
the switching function is the main component of the total proposed technique.
power consumption in the circuit [1]. The formula for
calculating the dynamic power is by II. DIGITAL CLOCK CIRCUIT DESIGN
1 clk_s s[5:0]
save dynamic power consumption. When integrating or dk_ms dk1 dk2 dk3
not integrating this low power clock gating technique, the Mod_50M Mod_60s q_next_en_mi Mod_60mi q_next_en_h Mod_24h
first digital clock circuit block still operates the same CLK
clk_s
CLK
clk_mi
CLK
clk_h
CLK
counting function. This digital clock circuit block counts
the seconds, minutes, hours, days, months, years, days
dk_d dk_mo
including the number of days per month in concern to
leap year, corresponding to the registers of s[5:0], m[5:0], q_next_en_d
Mod_days
q_next_en_mo
Mod_12m
q_next_en_y
Mod_y
h[5:0], d[5:0], mo[5:0], y [5:0], dotw [5:0]. Here, clk_d
CLK
clk_m
CLK
clk_y
CLK
283
283
2018 International Conference on Advanced Technologies for Communications (ATC)
2018 International Conference on Advanced Technologies for Communications
month, the circuit determines how many days are in a one clock gating block. By doing so, the clock network is
month. In February, temporary value for days is feb signal decreased very much in the control load, reducing the
which is handled as illustrated in Figure 5. number of buffers and reducing the switching power in
the clock network. Furthermore, reducing logic gates lead
29
to reduce the resource and switching power caused by
0
feb these logic gates. Therefore, the consumption power of
28
this improved clock gating method will be more efficient
x than using conventional clock gating technique.
next_enable[4:0] next_enable_temp[4:0]
D Q
Latch
-
q_next_y
>>2 <<2 G clk_s
clk_mi
1xxxx
q_reg_enable clk_h
Fig. 5: Processing leap year with 28 days in February =1'b1 D Q
x1xxx
q_next_enable
next_enable[5] clk_d
xx1xx
In Figure 5, the inputs show the year value register of clk_enable
clk clk_mo
q_next_y signal. In theory, the leap year is divisible by 4. CLK
xxx1x
The divisor is designed by two shift registers. If the year xxxx1 clk_y
284
284
2018 International Conference on Advanced Technologies for Communications (ATC)
2018 International Conference on Advanced Technologies for Communications
285
285
2018 International Conference on Advanced Technologies for Communications (ATC)
The operation temperature is created by circuit power presents a novel clock gating solution by grouping all
dissipation. The conventional clock gating technique is clock gating signals together into a single clock gating
lower than no-clock gating technique in term of the signal, then using one DEMUX gate to process and split
operation temperature. Specifically, it will be decreased the signal into many different clocks. By doing so, the
by 0.36%, 3.47%, 21.69%, 20.88%, at operation proposed design can reduce the number of inserted logic
frequencies of 100 MHz, 1 GHz, 10 GHz, 100 GHz, gates and reduce ~ 15.67% of the total power to the
respectively. There is no drop in temperature at 1THz digital clock design compared to using the conventional
frequency as shown in Table 4 because the operation clock gating technique at 1THz operation frequency. As a
frequency is too high and it reaches the level specified in result, the number of LUTs decreases to 175 instead of 179 in
the power simulation tool, thus, the operating temperature the other clock gating techniques, and the operating
is the same among three clock gating techniques. The new temperature is also reduced by using this merged clock
merged clock gating technique is lower than the gating technique.
conventional gating clock technique in term of the
REFERENCES
dissipated operation temperature. Specifically, it will be
decreased by 1.75%, 5.26% according to 10 GHz, 100 [1] Semiconductor Industry Assoc., ITRS, 2003 update;
GHz frequency, respectively. http://public.itrs.net
[2] Vo Minh Huan, Tong Van On, “Solutions to minimize the
TABLE 5 COMPARISON IN RESOURSE USAGE
power consumption in nanometer designs”, ISEE Proceedings,
Resource usage FF LUT International Symposium on Electrical & Electronics Engineering pp
64-72, Oct. 2007.
No-Clock gating 75 179
[3] Ya-Ting Shyu, et al, “Effective and Efficient Approach for
Conventional Clock gating 75 179 Power Reduction by Using Multi-Bit Flip-Flops”, IEEE Transactions On
Very Large Scale Integration (VLSI) Systems, Vol. 21, No. 4, April
The proposed clock gating 75 175 2013.
From Table 5 above, the usage resources of the three [4] Huan Minh Vo, Chul-Moon Jung, Eun-Sub Lee, and
schemes are approximately equal. The number of Flip Kyeong-Sik Min, “Carry select adder with sub-block power gating for
Flops is equal in all three cases. The number of FFs and reducing active-mode leakage in sub-32-nm VLSIs,” IEICE Electronics
Express, vol. 8, no. 16, pp. 1322-1329, Aug. 2011.(SCIE).
logic resources of the LUT are the same in cases of the
absence of clock gating and the conventional clock gating [5] Shmuel Wimer and Israel Koren, “ Design flow for flipflop
grouping in Data-Driven gating”, IEEE transactions on VLSI systems,
technique because they share the same circuit. For novel pp. 771-778, vol.2, 2014.
improved clock gating technique, the block_design is
designed the same counting function among the three [6] Xiaoxiao Ahang, et al, “ 32bit X32bit multiprecision razor
based dynamic voltage scaling multiplier with operands scheduler”,
techniques. So, it should be equal in term of logic IEEE transactions on VLSI systems, vol.22, pp. 759-770, April 2014.
resource usage. In case of the conventional clock gating
technique, it occupies 6 Flip Flops for 6 signals of clk_s, [7] Bishwajeet Pandey; Jyotsana Yadav; M. Pattanaik; Nitish
Rajoria, “Clock gating based energy efficient ALU design and
clk_mi, clk_h, clk_d, clk_mo, clk_y and they are all loads implementation on FPGA”, 2013 International Conference on Energy
of system clock. In case of the proposed technique, it uses Efficient Technologies for Sustainability (ICEETS), pp. 93-97, 2013.
a Flip Flop as the general clock gating block, 5 latches for [8] Endri Bezati , Simone Casale-Brunet , Marco Mattavelli ,
conditional signal processing, so the number of FFs is the and Jorn W. Janneck, “Clock-Gating of Streaming Applications for
same between the conventional clock gating technique Energy Efficient Implementations on FPGAs”, IEEE Transactions on
and the merged clock gating technique. However, the Computer-Aided Design of Integrated Circuits and Systems, pp. 699 –
703, Vol. 36, Issue 4, April 2017.
proposed technique has only one clock load. It is
explained in the previous analysis that the dissipation [9] H. Li, S. Bhunia, Y. Chen, K. Roy, and T. Vijaykumar, “Dcg:
power saving of the proposed technique is better than the deterministic clock-gating for low-power microprocessor design,” IEEE
Trans.Very Large Scale Integr. (VLSI) Syst., vol. 12, no. 3, pp. 245–
conventional one. In terms of the LUT's resources, the 254, 2004.
proposed improved clock gating technique results in
[10] Elio Consoli, et al, “Novel Class of Energy-Efficient Very
reduction of 4 LUTs for the digital clock circuit system. High-Speed Conditional Push–Pull Pulsed Latches”, IEEE Transactions
IV. CONCLUSION on VLSI systems, Vol. 22 No. 7 pp. 1593 1605, July 2014.
286
286