CAM basepaper
CAM basepaper
4, APRIL 2015
Abstract— We propose a low-power content-addressable [3], [4], database accelerators, image processing, parametric
memory (CAM) employing a new algorithm for associativity curve extraction [5], Hough transformation [6], Huffman cod-
between the input tag and the corresponding address of the ing/decoding [7], virus detection [8] Lempel–Ziv compres-
output data. The proposed architecture is based on a recently
developed sparse clustered network using binary connections that sion [9], and image coding [10].
on-average eliminates most of the parallel comparisons per- Due to the frequent and parallel search operations, CAMs
formed during a search. Therefore, the dynamic energy con- consume a significant amount of energy. CAM architectures
sumption of the proposed design is significantly lower compared typically use highly capacitive search lines (SLs) causing them
with that of a conventional low-power CAM design. Given an not to be energy efficient when scaled. For example, this
input tag, the proposed architecture computes a few possibilities
for the location of the matched tag and performs the comparisons power inefficiency has constrained TLBs to be limited to
on them to locate a single valid match. TSMC 65-nm CMOS tech- no more than 512 entries in current processors. In Hitachi
nology was used for simulation purposes. Following a selection SH-3 and StrongARM embedded processors, the fully asso-
of design parameters, such as the number of CAM entries, the ciative TLBs consume about 15% and 17% of the total
energy consumption and the search delay of the proposed design chip power, respectively [11]–[13]. Consequently, the main
are 8%, and 26% of that of the conventional NAND architecture,
respectively, with a 10% area overhead. A design methodology research objective has been focused on reducing the energy
based on the silicon area and power budgets, and performance consumption without compromising the throughput. Energy
requirements is discussed. saving opportunities have been discovered by employing either
Index Terms— Associative memory, content-addressable circuit-level techniques [14], [15], architectural-level [16], [17]
memory (CAM), low-power computing, recurrent neural techniques, or the codesign of the two, [18], some of which
networks, sparse clustered networks (SCNs). have been surveyed in [19]. Although dynamic CMOS circuit
techniques can result in low-power and low-cost CAMs, these
I. I NTRODUCTION
designs can suffer from low noise margins, charge sharing,
Authorized licensed use limited to: QIS College Of Engineering & Technology Autonomous. Downloaded on February 08,2023 at 08:49:28 UTC from IEEE Xplore. Restrictions apply.
JAROLLAHI et al.: ALGORITHM AND ARCHITECTURE FOR A LOW-POWER CAM 643
Authorized licensed use limited to: QIS College Of Engineering & Technology Autonomous. Downloaded on February 08,2023 at 08:49:28 UTC from IEEE Xplore. Restrictions apply.
644 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 23, NO. 4, APRIL 2015
Fig. 2. Classical BCAM cell types. (a) 10T NOR. (b) 9T NAND.
Authorized licensed use limited to: QIS College Of Engineering & Technology Autonomous. Downloaded on February 08,2023 at 08:49:28 UTC from IEEE Xplore. Restrictions apply.
JAROLLAHI et al.: ALGORITHM AND ARCHITECTURE FOR A LOW-POWER CAM 645
Authorized licensed use limited to: QIS College Of Engineering & Technology Autonomous. Downloaded on February 08,2023 at 08:49:28 UTC from IEEE Xplore. Restrictions apply.
646 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 23, NO. 4, APRIL 2015
result(s). On the other hand, no length reduction leads to the Var(λ) = (M − 1)(1/2q )(1 − 1/2q ). (4)
generation of no ambiguities, but a higher level of hardware If the input pattern is correlated in a sense that certain bits
complexity in the SCN-based classifier, since more neurons repeat their values among all of the CAM, (3) can be modified
are required. to the following:
M −1
E c (λ) = (5)
C. Data Distribution 2q−k
where k is the number of similar bits in q, and E c is the
The number of ambiguities, generated in PII is dependent on
expected value of the number of matched entries. We can
the correlation factor of the tag pattern, that is the number of
consider more complex models, such as when the reduced tags
similar repeating bits in the subset of tags. A higher degree of
are obtained using a Bernoulli distribution with parameter α.
similarities results in a higher number of ambiguous neurons.
We then partition reduced tags depending on the number of
If the tag pattern is previously known, it is possible to select
ones, i, they contain. In such a case, we obtain
the reduced-length tag bits among those that have a lower
q
level of similarity likelihood. Otherwise, SCN-CAM detects q
the valid match but with a higher cost of power consumption. E G (λ) = (M − 1) (α i (1 − α)q−i )2 . (6)
i
Assuming a random distribution for the CAM entries, the i=0
expected number of possible matches is the actual match In particular, we verify that for α = 1/2, E G (λ) corresponds
plus the expected number of ambiguities. The number of to the independent identically distributed uniform case. Fig. 6
ambiguities given a random and uniformly distributed input shows simulation results based on one million random and
pattern can be estimated using (2), where λ is a random uniformly distributed reduced-length tags and two different
variable representing the number of ambiguities. Pλ is the CAM sizes. It shows how the expected value of the number of
probability that exactly λ ambiguities occur using q bits of possible matches (E(λ)) is decreased to only one by increasing
the tag. Therefore, it follows a binomial distribution as shown the value of the number of bits in the reduced-length tag
in (2). Software simulations on numerical data samples verify as shown in (2) and (3). The algorithm of SCN-CAM is
the validity of (2). Therefore, according to the binomial law similar to that of the precomputation-based CAM (PB-CAM)
the expected value of λ, E(λ), and its variance, Var(λ), [16], [17]. A drawback of such methods, unlike SCN-CAM,
can be calculated as shown in (3) and (4), respectively. is that as the length of the tags is increased, the cycle time
If q = log2 (M), only one ambiguity is achieved on average and the circuit complexity of the precomputation stage are
leading to the activation of one extra neuron in PII in addition dramatically increased. Furthermore, we will show that unlike
to the actual match. Consequently, this value of q, and λ the PB-CAMs, SCN-CAM can potentially narrow down the
is used throughout this paper as a starting point to estimate search procedure to only one comparisons with a simple
the cycle time and the energy consumption of the proposed computational complexity that does not grow with the increase
CAM design. of the tag length.
Authorized licensed use limited to: QIS College Of Engineering & Technology Autonomous. Downloaded on February 08,2023 at 08:49:28 UTC from IEEE Xplore. Restrictions apply.
JAROLLAHI et al.: ALGORITHM AND ARCHITECTURE FOR A LOW-POWER CAM 647
Authorized licensed use limited to: QIS College Of Engineering & Technology Autonomous. Downloaded on February 08,2023 at 08:49:28 UTC from IEEE Xplore. Restrictions apply.
648 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 23, NO. 4, APRIL 2015
Fig. 7. Simplified schematic of the SCN-based classifier generating compare-enable signals for the CAM array.
Authorized licensed use limited to: QIS College Of Engineering & Technology Autonomous. Downloaded on February 08,2023 at 08:49:28 UTC from IEEE Xplore. Restrictions apply.
JAROLLAHI et al.: ALGORITHM AND ARCHITECTURE FOR A LOW-POWER CAM 649
Authorized licensed use limited to: QIS College Of Engineering & Technology Autonomous. Downloaded on February 08,2023 at 08:49:28 UTC from IEEE Xplore. Restrictions apply.
650 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 23, NO. 4, APRIL 2015
Fig. 14. Total estimated energy consumption per search and match for
SCN-CAM, and comparing them with the conventional low-power
NAND -type CAM design for various truncated tag bits (q).
Fig. 12. Total estimated dynamic energy consumption per search for
SCN-CAM, and comparing them with the conventional low-power
NAND -type CAM design for various values of the reduced-length tag (q).
Fig. 13. Estimated energy consumption per bit per search of the proposed
NOR architecture, and the conventional NAND -type CAM for various values Fig. 15. Simulation results for SCN-CAM based on reference design
of word lengths (N). parameters in Table I.
activity in their bit lines. Therefore, we have divided the energy SCN-based classifier can be estimated considering the area of
consumption of the SRAMs into the corresponding states. the decoders, the SRAM arrays, the precharge devices, the read
Furthermore, the CAM portion of the energy model consists and write circuits, the interconnections, and the standard cells.
of match (E CAMmatch ) and mismatch (E CAMmismatch ) portions, Similarly, the area of the CAM array can be estimated by
whose values depend on the number of ambiguities discussed considering the gaps between the CAM sub-blocks, pass-
in Section IV-B. Static energy consumption of the idle CAMs gate transistors, read and write circuits. The area overhead
(E CAMstat ) has also been included due to the presence of is estimated to be 10.1% higher than that of the conventional
leakage current occurring in advanced technologies. The esti- CAM design, for design selections in Table I.
mation of the number of required transistors follows a similar In the simulations for measuring the energy consumption
model. and the cycle time (/bit/search), on average half of the data
bits were assumed to mismatch in case of a word mismatch.
In Fig. 12, the relationship between the dynamic energy
B. Area Estimation and Simulation Results consumption of SCN-CAM and the tag length is depicted for
Fig. 11 shows the estimated overhead of the number of various number of entries of the CAM in comparison with
transistors in SCN-CAM for various number of entries of the the conventional CAMs. The estimated energy consumption
CAM in comparison with the conventional design. For design is obtained based on (6), and the extracted values for energy
selections in Table I, this overhead is only 3.4% compared consumption using HSPICE simulations. As the value of q
with that of the conventional CAM. The silicon area of the is increased, the energy consumption is decreased as well
Authorized licensed use limited to: QIS College Of Engineering & Technology Autonomous. Downloaded on February 08,2023 at 08:49:28 UTC from IEEE Xplore. Restrictions apply.
JAROLLAHI et al.: ALGORITHM AND ARCHITECTURE FOR A LOW-POWER CAM 651
TABLE II
R ESULT C OMPARISONS
since the number of comparisons is reduced but up to a point implementation of the conventional NAND-type and NOR-type
until the energy consumption of the SCN-based classifier itself CAMs. The technology-scaled version (to 65-nm CMOS) of
would dominate that of the CAM array. Therefore, the energy these results are evaluated according to the method described
consumption of the SCN-based classifier is not dependent on in [18]. Unless otherwise indicated, the reported results are
the original tag length, and rather on the number of entries in based on simulations. The energy consumption of SCN-CAM
the CAM array. is 4.08%, 8.02%, 3.59%, 66.7%, 53.89%, 30.5%, and 3.69%
Fig. 13 shows the effect of the word length on the energy of that of the referenced NOR-based, referenced NAND-based,
consumption in comparison with the conventional design. [1], [12], [15], [16], and [32], respectively. On the other
The original tag length (N) does not change the architecture hand, the cycle time of SCN-CAM is 132%, 31.4%, 224%,
and the energy consumption of the SCN-based classifier. 67.2%, 169%, 48.4%, and 107% of that of the referenced NOR,
Furthermore, due to the small size of the sub-blocks, the referenced NAND, [1], [12], [15], [16], and [32], respectively.
search power of SCN-CAM is much smaller compared with Although the energy consumption of the CAM presented in
that of the conventional. Consequently, as N is increased, [15] is small compared with most of the others, the cycle time
the energy consumption per-bit-per-search is decreased in is significantly increased (5.2×) when the search-data patterns
SCN-CAM while it stays constant in the conventional CAM. are correlated, whereas the cycle time of SCN-CAM remains
It also implies an advantage of SCN-CAM over PB-CAMs, unchanged in a similar situation.
such as in [16] and [17], where longer tag lengths increase The required silicon area of SCN-CAM is estimated to
the energy consumption as well as the precomputation delay. be 10.1% larger than that of the conventional NAND-type
This is because longer tags will increase the complexity of counterpart mainly due to the existence of the gaps between
the adders and the number of comparisons. Fig. 14 shows the SRAM blocks of the SCN-based classifier. Consequently,
the effect of the correlation in the entries of the CAM on the the silicon area can be reduced if fewer sub-blocks are used
energy consumption for various lengths of the reduced-length with the cost of energy consumption.
tag. The expected value of the number of sub-blocks has been
calculated according to (5). The correlation effect is applied
VII. C ONCLUSION
on the inputs by creating similarities within their contents.
For example, a 10% correlation means that 10% of the bit In this paper, the algorithm and the architecture of a
values are similar in all input patterns. It is therefore observed low-power CAM are introduced. The proposed architecture
that larger correlation costs energy consumption although it (SCN-CAM) employs a novel associativity mechanism based
does not affect the performance as in [15]. Fig. 15 shows on a recently developed family of associative memories based
simulation results for measuring the cycle time of SCN-CAM on SCNs.
for the selected design parameters shown in Table I. It shows SCN-CAM is suitable for low-power applications, where
the worst-case cycle time, where the last input of the AND/ OR frequent and parallel look-up operations are required.
gates are pulled up and under SS corner. It also shows the SCN-CAM employs an SCN-based classifier, which is
wave-pipelining method of clk1 and clk2 signals as shown in connected to several independently compare-enabled CAM
Fig. 7. clk2 is simply a delayed version of clk1 . sub-blocks, some of which are enabled once a tag is pre-
The cycle time is measured by the maximum reliable sented to the SCN-based classifier. By using independent
frequency of operation in the worst-case cycle time (SS) nodes in the output part of SCN-CAM’s training network,
scenario. Table II summarizes the comparisons of the cycle simple and fast updates can be achieved without retraining the
time and the energy consumption between SCN-CAM and network entirely. With optimized lengths of the reduced-length
a collection of other related work including our own tags, SCN-CAM eliminates most of the comparison operations
Authorized licensed use limited to: QIS College Of Engineering & Technology Autonomous. Downloaded on February 08,2023 at 08:49:28 UTC from IEEE Xplore. Restrictions apply.
652 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 23, NO. 4, APRIL 2015
given a uniform distribution of the reduced-length inputs. [18] P.-T. Huang and W. Hwang, “A 65 nm 0.165 fJ/Bit/Search 256 × 144
Depending on the application, nonuniform inputs may result in TCAM macro design for IPv6 lookup tables,” IEEE J. Solid-State
Circuits, vol. 46, no. 2, pp. 507–519, Feb. 2011.
higher power consumptions, but does not affect the accuracy [19] K. Pagiamtzis and A. Sheikholeslami, “Content-addressable memory
of the final result. In other words, a few false-positives may be (CAM) circuits and architectures: A tutorial and survey,” IEEE J. Solid-
generated by the SCN-based classifier, which are then filtered State Circuits, vol. 41, no. 3, pp. 712–727, Mar. 2006.
[20] V. Gripon and C. Berrou, “Sparse neural networks with large learning
by the enabled CAM sub-blocks. Therefore, no false-negatives diversity,” IEEE Trans. Neural Netw., vol. 22, no. 7, pp. 1087–1096,
are ever generated. Jul. 2011.
Conventional NAND-type and NOR-type architectures were [21] V. Gripon and C. Berrou, “Nearly-optimal associative memories based
on distributed constant weight codes,” in Proc. ITA Workshop, Feb. 2012,
also implemented in the same process technology to com- pp. 269–273.
pare SCN-CAM against, along with other recently developed [22] H. Jarollahi, N. Onizawa, V. Gripon, and W. J. Gross, “Architecture
CAM architectures. It has been estimated that for a case and implementation of an associative memory using sparse clustered
networks,” in Proc. IEEE ISCAS, Seoul, South Korea, May 2012,
study design parameter, the energy consumption and the pp. 2901–2904.
cycle time of SCN-CAM are 8.02%, and 28.6% of that of [23] H. Jarollahi, N. Onizawa, V. Gripon, and W. J. Gross, “Reduced-
the conventional NAND-type architecture, respectively, with complexity binary-weight-coded associative memories,” in Proc. IEEE
ICASSP, May 2013, pp. 2523–2527.
a 10.1% area overhead. Future work includes investigating [24] H. Jarollahi, N. Onizawa, and W. J. Gross, “Selective decoding in
sparse compression techniques for the matrix storing the associative memories based on sparse-clustered networks,” in Proc.
connections in order to further reduce the area overhead. IEEE Global Conf. Signal Inf. Process., Dec. 2013, pp. 1270–1273.
[25] J. J. Hopfield, “Neural networks and physical systems with emergent
collective computational abilities,” Proc. Nat. Acad. Sci. USA, vol. 79,
R EFERENCES no. 8, pp. 2554–2558, Apr. 1982.
[26] H. Jarollahi, V. Gripon, N. Onizawa, and W. J. Gross, “A low-power
[1] A. Agarwal et al., “A 128×128 b high-speed wide-and match-line content-addressable memory based on clustered-sparse networks,” in
content addressable memory in 32 nm CMOS,” in Proc. ESSCIRC, Proc. 24th IEEE Int. Conf. ASAP, Jun. 2013, pp. 305–308.
Sep. 2011, pp. 83–86. [27] H. Noda et al., “A cost-efficient high-performance dynamic TCAM
[2] Y.-J. Chang and M.-F. Lan, “Two new techniques integrated for energy- with pipelined hierarchical searching and shift redundancy architecture,”
efficient TLB design,” IEEE Trans. Very Large Scale Integr. (VLSI) Syst., IEEE J. Solid-State Circuits, vol. 40, no. 1, pp. 245–253, Jan. 2005.
vol. 15, no. 1, pp. 13–23, Jan. 2007. [28] K. Pagiamtzis and A. Sheikholeslami, “Pipelined match-lines and hier-
[3] H. Chao, “Next generation routers,” Proc. IEEE, vol. 90, no. 9, archical search-lines for low-power content-addressable memories,” in
pp. 1518–1558, Sep. 2002. Proc. IEEE Custom Integr. Circuits Conf., Sep. 2003, pp. 383–386.
[4] N.-F. Huang, W.-E. Chen, J.-Y. Luo, and J.-M. Chen, “Design of multi- [29] K. Pagiamtzis and A. Sheikholeslami, “A low-power content-
field IPv6 packet classifiers using ternary CAMs,” in Proc. IEEE Global addressable memory (CAM) using pipelined hierarchical search
Telecommun. Conf., vol. 3. 2001, pp. 1877–1881. scheme,” IEEE J. Solid-State Circuits, vol. 39, no. 9, pp. 1512–1519,
[5] M. Meribout, T. Ogura, and M. Nakanishi, “On using the CAM concept Sep. 2004.
for parametric curve extraction,” IEEE Trans. Image Process., vol. 9, [30] H. Noda et al., “A 143 MHz 1.1 W 4.5 Mb dynamic TCAM with
no. 12, pp. 2126–2130, Dec. 2000. hierarchical searching and shift redundancy architecture,” in Proc. IEEE
[6] M. Nakanishi and T. Ogura, “A real-time CAM-based Hough transform ISSCC, vol. 1. Feb. 2004, pp. 208–523.
algorithm and its performance evaluation,” in Proc. 13th Int. Conf. [31] C. Zukowski and S.-Y. Wang, “Use of selective precharge for low-
Pattern Recognit., vol. 2. Aug. 1996, pp. 516–521. power on the match lines of content-addressable memories,” in Proc.
[7] L.-Y. Liu, J.-F. Wang, R.-J. Wang, and J.-Y. Lee, “CAM-based VLSI Int. Workshop Memory Technol., Des. Test., Aug. 1997, pp. 64–68.
architectures for dynamic Huffman coding,” IEEE Trans. Consum. [32] J.-S. Wang, H.-Y. Li, C.-C. Chen, and C. Yeh, “An AND-type match-
Electron., vol. 40, no. 3, pp. 282–289, Aug. 1994. line scheme for energy-efficient content addressable memories,” in IEEE
[8] C.-C. Wang, C.-J. Cheng, T.-F. Chen, and J.-S. Wang, “An adaptively ISSCC Dig. Tech. Papers, vol. 1. Feb. 2005, pp. 464–610.
dividable dual-port BiTCAM for virus-detection processors in mobile [33] M. Motomura, J. Toyoura, K. Hirata, H. Ooka, H. Yamada, and
devices,” IEEE J. Solid-State Circuits, vol. 44, no. 5, pp. 1571–1581, T. Enomoto, “A 1.2-million transistor, 33-MHz, 20-b dictionary search
May 2009. processor (DISP) ULSI with a 160-kb CAM,” IEEE J. Solid-State
[9] B. Wei, R. Tarver, J.-S. Kim, and K. Ng, “A single chip Lempel–Ziv Circuits, vol. 25, no. 5, pp. 1158–1165, Oct. 1990.
data compressor,” in Proc. IEEE ISCAS, May 1993, pp. 1953–1955. [34] K. Schultz and P. Gulak, “Fully parallel integrated CAM/RAM using
[10] S. Panchanathan and M. Goldberg, “A content-addressable memory preclassification to enable large capacities,” IEEE J. Solid-State Circuits,
architecture for image coding using vector quantization,” IEEE Trans. vol. 31, no. 5, pp. 689–699, May 1996.
Signal Process., vol. 39, no. 9, pp. 2066–2078, Sep. 1991. [35] K. Pagiamtzis and A. Sheikholeslami, “Pipelined match-lines and hier-
[11] T. Juan, T. Lang, and J. Navarro, “Reducing TLB power requirements,” archical search-lines for low-power content-addressable memories,” in
in Proc. Int. Symp. Low Power Electron. Des., Aug. 1997, pp. 196–201. Proc. IEEE Custom Integr. Circuits Conf., Sep. 2003, pp. 383–386.
[12] Y.-J. Chang and Y.-H. Liao, “Hybrid-type CAM design for both power
and performance efficiency,” IEEE Trans. Very Large Scale Integr. (VLSI)
Syst., vol. 16, no. 8, pp. 965–974, Aug. 2008.
[13] Z. Lei, H. Xu, D. Ikebuchi, H. Amano, T. Sunata, and M. Namiki,
“Reducing instruction TLB’s leakage power consumption for embedded Hooman Jarollahi (S’09) received the B.A.Sc. and
processors,” in Proc. Int. Green Comput. Conf., Aug. 2010, pp. 477–484. M.A.Sc. degrees in electronics engineering from
[14] S.-H. Yang, Y.-J. Huang, and J.-F. Li, “A low-power ternary content Simon Fraser University, Burnaby, BC, Canada, in
addressable memory with Pai-Sigma matchlines,” IEEE Trans. Very 2008 and 2010, respectively. He is currently pursu-
Large Scale Integr. (VLSI) Syst., vol. 20, no. 10, pp. 1909–1913, ing the Ph.D. degree with the Department of Elec-
Oct. 2012. trical and Computer Engineering, McGill University,
[15] N. Onizawa, S. Matsunaga, V. C. Gaudet, and T. Hanyu, “High- Montreal, QC, Canada.
throughput low-energy content-addressable memory based on self-timed He was a Visiting Scholar with the Research
overlapped search mechanism,” in Proc. Int. Symp. Asynchron. Circuits Institute of Electrical Communication, Tohoku Uni-
Syst., May 2012, pp. 41–48. versity, Sendai, Japan, from 2012 to 2013. His cur-
[16] C.-S. Lin, J.-C. Chang, and B.-D. Liu, “A low-power precomputation- rent research interests include design and hardware
based fully parallel content-addressable memory,” IEEE J. Solid-State implementation of energy-efficient and application-specific VLSI systems,
Circuits, vol. 38, no. 4, pp. 654–662, Apr. 2003. such as associative memories and content-addressable memories.
[17] S.-J. Ruan, C.-Y. Wu, and J.-Y. Hsieh, “Low power design of Mr. Jarollahi was a recipient of the Teledyne DALSA Award in 2010,
precomputation-based content-addressable memory,” IEEE Trans. Very for which he presented a patented architecture of a power and area-efficient
Large Scale Integr. (VLSI) Syst., vol. 16, no. 3, pp. 331–335, Mar. 2008. SRAM.
Authorized licensed use limited to: QIS College Of Engineering & Technology Autonomous. Downloaded on February 08,2023 at 08:49:28 UTC from IEEE Xplore. Restrictions apply.
JAROLLAHI et al.: ALGORITHM AND ARCHITECTURE FOR A LOW-POWER CAM 653
Vincent Gripon received the M.S. degree from Warren J. Gross (SM’10) received the B.A.Sc.
École Normale Supérieure of Cachan, Cachan, degree in electrical engineering from the University
France, and the Ph.D. degree from Télécom of Waterloo, Waterloo, ON, Canada, and the M.A.Sc.
Bretagne, Brest, France. and Ph.D. degrees from the University of Toronto,
He is a Permanent Researcher with Institut Mines- Toronto, ON, in 1996, 1999, and 2003, respectively.
Télécom, Télécom Bretagne. His intent is to propose He is currently an Associate Professor with the
models of neural networks inspired by information Department of Electrical and Computer Engineer-
theory principles, what could be called informa- ing, McGill University, Montral, QC, Canada. His
tional neurosciences. He is also the Co-Creator and current research interests include the design and
Organizer of an online programming contest named implementation of signal processing systems and
TaupIC, which targets the French top undergraduate custom computer architectures.
students. His current research interests include information theory, neuro- Dr. Gross is currently the Chair of the IEEE Signal Processing Society
science, and theoretical and applied computer science. Technical Committee on Design and Implementation of Signal Processing
Systems. He served as a Technical Program Co-Chair of the IEEE Workshop
on Signal Processing Systems in 2012, and as the Chair of the IEEE ICC 2012
Workshop on Emerging Data Storage Technologies. He served as an Associate
Editor of the IEEE T RANSACTIONS ON S IGNAL P ROCESSING. He has served
on the Program Committees of the IEEE Workshop on Signal Processing
Naoya Onizawa (M’09) received the B.E., M.E., Systems, the IEEE Symposium on Field-Programmable Custom Computing
and D.E. degrees in electrical and communication Machines, and the International Conference on Field-Programmable Logic and
engineering from Tohoku University, Sendai, Japan, Applications, and has served as the General Chair of the 6th Annual Analog
in 2004, 2006, and 2009, respectively. Decoding Workshop. He is a licensed Professional Engineer in the Province
He was a Post-Doctoral Fellow with Tohoku Uni- of Ontario.
versity from 2009 to 2011, the University of Water-
loo, Waterloo, ON, Canada, in 2011, and the McGill
University, Montreal, QC, Canada, from 2011 to
2013. He is currently an Assistant Professor with
the Frontier Research Institute for Interdisciplinary
Sciences, Tohoku University. His current research
interests include the energy-efficient VLSI design based on asynchronous
circuits and multiple-valued circuits, and their applications, such as LDPC
decoders, associative memories, and network-on-chips.
Dr. Onizawa was a recipient of the Best Paper Award at the IEEE Computer
Society Annual Symposium on VLSI in 2010.
Authorized licensed use limited to: QIS College Of Engineering & Technology Autonomous. Downloaded on February 08,2023 at 08:49:28 UTC from IEEE Xplore. Restrictions apply.