SlideShare a Scribd company logo
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 239
ASIC Implementation for SOBEL Accelerator
Prema C L1, Dr Siva Yellampalli2
1PG student, VTU Extn. Centre, UTL Technologies Ltd, Bengaluru
2Principal, VTU Extn. Centre, UTL Technologies Ltd, Bengaluru
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - Due to the Rapid progress in the field of VLSI
technology the device are faster than before .This proposed
accelerator circuit is implemented in such a way that we can
increase the circuit speed . If implement the design using ASIC
the single IC can be programmed for several application in
different times and it increases the performance or speed of the
device. such as video surveillance and monitoring system in
security system. so using ASIC implementation on the design
will get the fault free chip so it won’t cause harm to
lives.Implement the accelerator to remaining edge detection
algorithm to make the high volume production for asic chip to
image processing application and make that chips as cost
effective.
Key Words Accelerator, BIST,ASIC design, Sobel
operartor,ATPG,Fault coverage
1.INTRODUCTION
In modern days, to keep speed with the fast
progressing of technologies, we have to make everything
faster, that’s why accelerator comes into the technology.
Accelerators[1] which increases the device performance.
Accelerators which used in edge detection which speeds up
the edge detection process.
Detection of edges in Video images[[2]can be done
more rapidly than it is achievable with software which are
embedded on a core processor. It is used to achieve faster
detection of edge in video images. It refer to a method of
detecting and locate the quick changes in the intensity ofthe
frequency of the image. The abrupt changes in the intensity
of the pixel can be represented by discontinuityintheimage.
Because the improvement in the fabrication
technology and thus the increase within the logic FPGAlogic
block density, the consumption of FPGA is notconstricted to
any longer than debug and prototyping of digital circuits
which leads to increment in the logic blocks in the FPGA
.Whenever the time to point is important for some
applications ASIC(Application Specific IntegratedCircuit)[5]
is the solution.
The Application Specific Integrated Circuits (ASIC) design of
SOBEL accelerator has two major phases: logical orfrontend
design and physical or backend design.Thestepsinvolvedin
frontend design are verification of results in pre-synthesis
simulation, compile, specifying design constraints,synthesis
of design with ASIC technology libraries, Automatic Test
Pattern Generation (ATPG), insertion of Design For Test
(DFT)) creation and verification of results in post synthesis
simulation. The physical layout design involves floor plan,
power plan, placement, routing, various timing analysis and
verifications of completed design, and verification of results
in post layout simulation. In this project the ASIC
implementation of Sobel accelerator is explained in detail.
This document is template. We ask that authors followsome
simple guidelines. In essence,weask youtomakeyourpaper
look exactly like this document. The easiest way to do this is
simply to download the template, and replace(copy-paste)
the content with your own material. Number the reference
items consecutively in square brackets (e.g. [1]). However
the authors name can be used along with the reference
number in the running text. The order of reference in the
running text should match with the list of references at the
end of the paper.
2.Accelerators for edge detection in video images
An accelerator for edgedetection[2]invideoimages
which is compromise between what a real worldaccelerator
,might do.Edge detection is important part of analysing a
scene in video images and has applications in many areas
such as security monitoring and computer vision
application[3]. It involves identifying places in image where
there is abrupt change in intensity.Those places usually
occur at the boundaries of objects. subsequent analysis of
the edge can be recognizing what the objects are.
Assume Monochrome images of 640x480
pixels[1,6],each of 8 bits ,stored row by row in memory with
successive pixels,left to right in a row,at successive
addresses.Pixels values are interpreted as unsigned integer
ranging from 0(black) to 255(white) .will use a algorithm
called sobel edge detector .It works by computing the
derivatives of the intensity signal in each of the x and y
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 240
directions and looking for maxima and minima in the
derivatives.These are the places where the intensity is
changing most rapidly.The sobel method approximates the
derivatives in each direction for each pixel by a process
called convolution. This involves adding the pixels and its
eight neighbours ,each multiplied by a coefficient. The
coefficients are represented in a 3x3 convolution mask. The
sobel convolutions masks,Gx and Gy ,for the derivatives in
the x and y directions respectively. the derivatives images
being computed by centering each of the convolution masks
over successive pixels in the original image.after multiplying
the coefficient in each mask by the intensity values of the
underlying pixel and sum thenineproductstogethertoform
two partial derivatives for the derivative image,Dx and Dy
Ideally will compute the magnitude of the derivative image
pixel as
GX GY
Fig 2.1 Sobel Convolution mask
|D|= sqrt (Dx2 + Dy2)
3 PROPOSED Accelerator DESIGN for Implementation
3.1 Working principle
Basically it is a pipeline structure, where with pixel
data read from the original image entering into the registers
at the top right, flowing through the 3x3 multiplier array on
the left, then down through the adders to the Dx and Dy
registers, then through the absolute value circuitsandadder
to the |D| register, and finally into the register at the bottom
left.The resulting derivative pixelsarethenwrittenfromthat
register to memory. (While a right-to-left data flow is
opposite to usual practice, in this case, it has the advantage
of preserving the same arrangement of pixels as that in an
image.) We will describe the operation of the pipeline
assuming initially that it is full of data. We will then discuss
how to deal with starting it up at the beginning of an image
row and draining it at the end of the row.
The pipeline generates the derivative pixels for a
given row in groups of four. Theacceleratorreadsfourpixels
from each of the preceding, current, and next rows in
memory into the three 32-bit registers at the top right of the
figure. Each register consist of four 8-bitpixel registers.Over
the four subsequent clock cycles, pixels are shiftedouttothe
left, one pixel at a time, into the multiplier array. Each cell in
the array contains a pixel register and one or two circuits
that multiply the stored pixel by a constant coefficientvalue.
Since the coefficients are all +1, -1, +2, or -2, the circuits are
not full-blown multipliers. Instead, multiplying by -1 is
simply a negator, multiplying by +1 is a through connection
with no circuitry, multiplying by -2 is a left shift of the result
of a negator, and multiplying by +2 is simply a left shift. On
each clock cycle, the array provides thepartial productsfora
single derivative pixel, and the partial products are added
and stored in the Dx and Dy registers. Also, on each clock
cycle, the Dx and Dy values for the preceding pixel havetheir
absolute values computed and added and stored in the |D|
register. The resulting derivativepixel valuesareshiftedinto
the result row register.
When four result pixels are ready in the register,
they are subsequently written to memory. In the steady
state, during processing of a row, the accelerator needs to
write the pixels to memory from the result register before it
can shift new pixels into the multiplier array and the Dx, Dy
and |D| registers. Otherwise, the result values would be
overwritten. Having written four pixels, the accelerator can
push four more pixels through the pipeline, thus emptying
the read registers and filling the result register. It can then
write those result pixels and read in three more groups of
four pixels,and repeat the process.assuminga Wishbonebus
connection with 32-bit-wide data signals and a 100MHz
clock, as suggested earlier. Since the accelerator is one of
-1 0 +1
-2 0 +2
-1 0 +1
+1 +2 +1
0 0 0
-1 -2 -1
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 241
several masters on the memory bus, it must request use of
the bus for the writes and reads and wait until granted
access by the bus arbiter. We assume that the arbiter gives
the accelerator sufficiently high priority that it can use the
memory bandwidth it needs.
Fig 3.1 Sobel Accelerator Architecture
4. ASIC DESIGN OF SOBEL Accelerator
As compared to FPGA which has limited number
of logics, the ASIC supports integration of larger number of
logic gates (higher in density). ASICs are designed to fit a
certain application. ASIC implementation supports digital
or mixed-signal circuit in the design. The ASIC design
saves cost, fabrication time for mass production of the
system.
There are two methods of design flow in ASIC: frontend
and backend design flow. The front end design involves
inclusion of technology library files, design compilation,
elaboration, creation of design constraints, synthesize,
Design For Test (DFT) insertion and Logic Equivalence
Checking (LEC).
The backend physicallayoutdesigninvolvesfloorplanningof
blocks, power planning,placement,layoutroutingandtiming
verifications. The ASIC design of CCG has been carried out
with the help Cadence tools. The ASIC design aspects are
described in the following sections
4.1. ASIC FRONTEND DESIGN
Compile and simulation of design
The Verilog design files and test bench modules are
compiled, elaborated, simulated for timing verification
[10]. The simulation waveforms of pre-synthesize are as
shown in fig-5.
Creating design constraints for Synthesis Clocks are used to
provide timing and synchronization information for digital
design. The design constraints are specified for optimizing
synthesize tool to meet timing requirements.Theconstraints
[11] are identified in this design are master clock signal and
global reset signal which are propagated all the parts of the
design. These signals require larger fan-out and should have
lowest skew in timing.
Fig 4.1 Initialisation of sobel
Fig 4.2 The timings are verified successfully as per design
specification
The derived clock used in clock driver section is also
included in constraint list. All the design constraints are
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 242
specified with the help of RTL Compiler before synthesize
the design.
Synthesis of Design
Synthesis [13] process maps the HDL design with standard
cells specified in technology driven library files. This
library file has timing, power and area related information.
The equivalent physical library file has information about
size, internal delays of cells and input output signals of
standard cells. The physical library files are Library
Exchange Format (LEF), Quantus RC (QRC) file or
Capacitance table are required for backend ASIC design.
These library files are provided by the chip manufacturer.
In order to reduce the effort in backend design, the
physical library files [5] are used as a part of synthesize
process. This method is used for generating floor planning
information as Design Exchange Format (DEF) file
required for backend design. Based on SDC constraints,
the RTL compiler generated optimized hardware in the
form of gate level netlist based on standard cells to meet
design constraint requirements.
Generation of report files and backend import files
After synthesize with technology library, the report files
are generated for the mapped gate level netlist. The fig-
shows report of power and area for the synthesized
design.
Fig-4 Synthesize Report from RTL Compiler
Further, the synthesized reports based on PLE are
generated to simplify the burden of design import to
physical layout design.
Insertion of Design For Test (DFT)
The DFT adds additional hardware for the design which
includes testability features to the verified design. The
added hardware validates the product hardware from
manufacturing defects.
In this design the simple method of scan chain DFT with
Full scan mode has been designed. The scan chain test
patterns are used to access the internal nodes of the chip
by shifting. The synthesize steps are repeated to include
DFT along with design. The additional hardware pins as
the result of DFT insertions are Scan In (SI), Scan Out (SO)
and Scan Enable (SE).
Automatic Test Pattern Generation (ATPG) are generated
with test vectors in Verilog form for scan test and logic test
mode in order to verify fault and global coverage of the
design. The ATPG engine used for generating test patterns
to test the structural integrity of the design.
6. ASIC BACKEND DESIGN
During frontend design, the reports and timings are
analyzed and verified before importing the design to
layout. The following are the major steps involved during
backend ASIC design.
Fig 6.1 Floor planning and power planning
Power and ground rings are added with vertical and
horizontal metal layers placed around the core. The main
aim of the power planning [15] is to provide equal amount
of supply to all standard cells without any supply loss. The
change in supply may adversely affect threshold voltage of
standard cells, setup or hold timing violations of the
design.
In order to provide equal amount of power to the entire
design, the power strips are created vertically and
horizontally, as mesh inside the core. For ease of signal
routing, top layer has been selected for power and ground
nets.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 243
Fig 6.2 Analysis of floor plan
Timing analysis
Timing analysis [15] is carried out to check setup or hold
timing violations. The timing analysis of design requires
timing library files for slow, fast and Signal Integrity (SI)
files. The set of library files are created for min and
maximum timing analysis. The design constraints created
during synthesize process also included for layout timing
analysis. The timing analysis is carried out in three
different phases of ASIC layout design: Pre Clock Tree
Synthesize (CTS) post CTS and post route.
The additional delays are introduced by metal layers and
coupling capacitances analyzed after post routing.
Capacitance table library file has been included for post
route analysis.
Placing CTS
Clock tree or clock buffer has higher driving capability and
are usually placed in design based on larger fan-out
requirements. The clock buffer uniformly distributes the clock
signals with lowest skew in timing. The clock tree buffers are
identified from the standard cells. The identified cells are
placed inside the core.
Layout Routing
Layout routing [17] is classified as global and detailed routing.
The global routing generates estimated delays according to
statics by observing previous manufactured chips. The
detailed routing generates actual wire delays that can be
obtained by several timing optimizations.
Clock tree or clock buffer has higher driving capability and
are usually placed in design based on larger fan-out
requirements. The clock buffer uniformly distributes the
clock signals with lowest skew in timing. The clock tree
buffers are identified from the standard cells. The
identified cells are placed inside the core.
Post route timing analysis
The post route timing has been extracted for analysis. The
values of Worst Negative Slack (WNS) and Total Negative
Slack (TNS) are positive.
Fig 6.3 Post route timing analysis
Post route verification
The following verifications arecarriedoutafterpostrouting:
Verification on Geometry, Design Rule Checks (DRC),
connectivity, Bus guide, metal via and process antenna. All
the verifications are passed with zero violations.
TABLE – I POST ROUTE VERIFICATIONS
Verification on Violations
Bus guide Zero
Geometry Zero
Connectivity Zero
Process antenna Zero
DRC Zero
SUMMARY OF TOOLS USED
The following tool sets from Cadence are used for completing
this project.
TABLE -2 Tools Used
Cadence tool name Used for
NCLAUNCH Presynthesis
NCSIM Simulation
RTL COMPILER Synthesis
ENCOUNTER TEST DFT
7.Conclusion
The ASIC Implementation for Sobel Accelerator
completed successfully which compiles all the design
requirements. Thetimingin PreCTS,PostCTSandPost-Route
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 244
analysis are verified successfully. The ASIC post layout
verifications carried out on geometry, DRC, connectivity,
metal density, process antenna. The GLN and SDF file are
extracted from the layout forsimulation.Sobel Acceleratoris
implemented using ASIC flow. Area, power, timingandgates
reports analysed.In the ASIC flow have also Implemented
SDC files,DFT insertion, Physical design and verified with
timing violations. The Sobel accelerator implementation in
ASIC has been completed with Taiwan Semiconductor
Manufacturing Technology (TSMC)90nmtechnologylibrary
files. The core die size is 0.11mm² and 6 metal layers are
used for the layout design to complete. After all the
verifications, the GDS file has been generated which can be
supplied to chip manufacturer for fabrication.
Future work
ASIC implementation done for Sobel acceleratorfor
video images in medical and defence application which
depend on sobel edge detection algorithm but various
algorithms are used for edge detection such as
prewitt,canny,Robert algorithms so by developing
accelerator for those algorithms and implement that to into
ASIC flow to make their own chip in large volume for
corresponding application.
References:
[1]. P.J. Ashenden, Digital Design An Embedded
System Approach Using Verilog, Morgan Kaufmann, 2008.
[2]. An FPGA based Hardware Accelerator for Real
Time Video Segmentation System ICACSIS 2011 ISBN: 978-
979-1421-11-9
[3].Hardware Description of Multi-Directional Fast
Sobel Edge Detection Processor by VHDL for Implementing
on FPGA , Volume 47– No.25, June 2012
[4].FPGA based Image Edge Detection and
Segmentation, Vol.9.No.2, 2011,p.187-192.
[5]. Michael John Sebastian Smith, “Application
Specific Integrated Circutis”, Pearsion Education Inc, 12th
impression, 2013
[6]. I.Yasri, N.H.Hamid, V.V.Yap, “Implementationof
an FPGA based Sobel Edge Detection Operator”,IGCES,2008.
[7].Shukor, Lo HaiHiung, Patrick Sebastian3, 2007.
―Implementation of Real-time Simple Edge Detection on
FPGA‖ pp. 1404-1405,IEEE.
[8].Cadence NCLaunch User Guide, ProductVersion
14.1, June 2014
[9].Cadence RTL Compiler User Guide, Product
Version 12.2, August
[10].Setting Constraints and Performing Timing
Analysis Using Encounter RTL Compiler, Product Version
12.2, August 2013
[11]. Design with RTL Compiler Physical, Product
Version 12.2, May 2013
[12].Design For Test Encounter RTL Compiler,
Product Version 12.2, August 2013
[13].A. Chandra, S. Chebiyam, and R. Kapur
Synopsys, Inc., “A Case Study on Implementing Compressed
DFT Architecture” , IEEE 23rd Asian Test Symposium, 2014,
pp. 336 - 341
[14] Message Reference for Encounter RTL
Compiler, Product Version 12.2, May 2013
[5] Cadence EDI User Guide, ProductVersion14.20,
October 2014
[16] Cadence I/O Planner: Application Note,
Product Version 16.2, November 2008
[17] Plato NanoRoute User‟s Guide, Version 2.5,
Rev. D
[19] SDF Timing Annotation, Product Version 14.2,
January 2015

More Related Content

What's hot (20)

PDF
FPGA FIR filter implementation (Audio signal processing)
Hocine Merabti
 
PDF
IRJET-Hardware Co-Simulation of Classical Edge Detection Algorithms using Xil...
IRJET Journal
 
PPTX
9.design of high speed area efficient low power vedic multiplier using revers...
nareshbk
 
PDF
Mini Project on 4 BIT SERIAL MULTIPLIER
j naga sai
 
PPT
Fpga 11-sequence-detector-fir-iir-filter
Malik Tauqir Hasan
 
PPT
B Eng Final Year Project Presentation
jesujoseph
 
PDF
IRJET- Design of 16 Bit Low Power Vedic Architecture using CSA & UTS
IRJET Journal
 
PDF
Bivariatealgebraic integerencoded arai algorithm for
eSAT Publishing House
 
PDF
A New Cross Diamond Search Motion Estimation Algorithm for HEVC
IJERA Editor
 
DOCX
Fpga implementation of high speed 8 bit vedic multiplier using barrel shifter(1)
Karthik Sagar
 
PDF
Power and Delay Analysis of Logic Circuits Using Reversible Gates
RSIS International
 
PDF
Modified approximate 8-point multiplier less DCT like transform
IJERA Editor
 
PPTX
FPGA Implementation of a GA
Hocine Merabti
 
PDF
Review on Multiply-Accumulate Unit
IJERA Editor
 
PDF
Bit Serial multiplier using Verilog
BhargavKatkam
 
PDF
A Configurable and Low Power Hard-Decision Viterbi Decoder in VLSI Architecture
IRJET Journal
 
PDF
COUPLED FPGA/ASIC IMPLEMENTATION OF ELLIPTIC CURVE CRYPTO-PROCESSOR
IJNSA Journal
 
PDF
HEVC 2D-DCT architectures comparison for FPGA and ASIC implementations
TELKOMNIKA JOURNAL
 
PDF
E0364025031
theijes
 
PDF
IRJET - A Speculative Approximate Adder for Error Recovery Unit
IRJET Journal
 
FPGA FIR filter implementation (Audio signal processing)
Hocine Merabti
 
IRJET-Hardware Co-Simulation of Classical Edge Detection Algorithms using Xil...
IRJET Journal
 
9.design of high speed area efficient low power vedic multiplier using revers...
nareshbk
 
Mini Project on 4 BIT SERIAL MULTIPLIER
j naga sai
 
Fpga 11-sequence-detector-fir-iir-filter
Malik Tauqir Hasan
 
B Eng Final Year Project Presentation
jesujoseph
 
IRJET- Design of 16 Bit Low Power Vedic Architecture using CSA & UTS
IRJET Journal
 
Bivariatealgebraic integerencoded arai algorithm for
eSAT Publishing House
 
A New Cross Diamond Search Motion Estimation Algorithm for HEVC
IJERA Editor
 
Fpga implementation of high speed 8 bit vedic multiplier using barrel shifter(1)
Karthik Sagar
 
Power and Delay Analysis of Logic Circuits Using Reversible Gates
RSIS International
 
Modified approximate 8-point multiplier less DCT like transform
IJERA Editor
 
FPGA Implementation of a GA
Hocine Merabti
 
Review on Multiply-Accumulate Unit
IJERA Editor
 
Bit Serial multiplier using Verilog
BhargavKatkam
 
A Configurable and Low Power Hard-Decision Viterbi Decoder in VLSI Architecture
IRJET Journal
 
COUPLED FPGA/ASIC IMPLEMENTATION OF ELLIPTIC CURVE CRYPTO-PROCESSOR
IJNSA Journal
 
HEVC 2D-DCT architectures comparison for FPGA and ASIC implementations
TELKOMNIKA JOURNAL
 
E0364025031
theijes
 
IRJET - A Speculative Approximate Adder for Error Recovery Unit
IRJET Journal
 

Viewers also liked (20)

PDF
IRJET-A Review on Two Stroke Single Cylinder Compressed Air Engine
IRJET Journal
 
PDF
IRJET-A Survey on Stealthy Denial of Service Strategy in Cloud Computing
IRJET Journal
 
PDF
IRJET-Design of Pile Foundation for site in Sangli district of Maharashtra: C...
IRJET Journal
 
PDF
IRJET- Missing Value Evaluation in SQL Queries: A Survey
IRJET Journal
 
PDF
IRJET-A Blind Watermarking Algorithm
IRJET Journal
 
PDF
IRJET- Assessment of Environmental Impacts during Operational Phase of a T...
IRJET Journal
 
PDF
IRJET-Solar Power Generation with Capacitor Based Seven Level Inverter System
IRJET Journal
 
PDF
IRJET-Comparative Analysis of DCT and DWT based novel methods for Watermarking
IRJET Journal
 
PDF
IRJET- Document Layout analysis using Inverse Support Vector Machine (I-SV...
IRJET Journal
 
PDF
IRJET- A Design Implementation of Single Stage Amplifiers using HEMT Tec...
IRJET Journal
 
PDF
IRJET-A Design Of Modified SSTBC Encoder to Noise Free Mimo Communication in ...
IRJET Journal
 
PDF
IRJET- Simulation based design and analysis of combined effect of various ...
IRJET Journal
 
PDF
IRJET-Second Throat Diffuser System at Different Back Pressure for High Altit...
IRJET Journal
 
PDF
IRJET- Sensrless Luenberger Observer Based Sliding Mode Control of DC Motor
IRJET Journal
 
PDF
IRJET-A Survey On Opportunistic Piggyback Marking For IP Trace Back
IRJET Journal
 
PDF
IRJET-Accessing Information about Programs and Services through a Voice Site ...
IRJET Journal
 
PDF
IRJET-Power Flow & Voltage Stability Analysis using MATLAB
IRJET Journal
 
PDF
IRJET-Design And Development Of Sugar Cane Sprout Cutter Machine By Human Pow...
IRJET Journal
 
PDF
IRJET- Assessment of Environmental Impacts during Operational Phase of a T...
IRJET Journal
 
PDF
IRJET- Review of Vapour Absorption System and Vapour Compression System.
IRJET Journal
 
IRJET-A Review on Two Stroke Single Cylinder Compressed Air Engine
IRJET Journal
 
IRJET-A Survey on Stealthy Denial of Service Strategy in Cloud Computing
IRJET Journal
 
IRJET-Design of Pile Foundation for site in Sangli district of Maharashtra: C...
IRJET Journal
 
IRJET- Missing Value Evaluation in SQL Queries: A Survey
IRJET Journal
 
IRJET-A Blind Watermarking Algorithm
IRJET Journal
 
IRJET- Assessment of Environmental Impacts during Operational Phase of a T...
IRJET Journal
 
IRJET-Solar Power Generation with Capacitor Based Seven Level Inverter System
IRJET Journal
 
IRJET-Comparative Analysis of DCT and DWT based novel methods for Watermarking
IRJET Journal
 
IRJET- Document Layout analysis using Inverse Support Vector Machine (I-SV...
IRJET Journal
 
IRJET- A Design Implementation of Single Stage Amplifiers using HEMT Tec...
IRJET Journal
 
IRJET-A Design Of Modified SSTBC Encoder to Noise Free Mimo Communication in ...
IRJET Journal
 
IRJET- Simulation based design and analysis of combined effect of various ...
IRJET Journal
 
IRJET-Second Throat Diffuser System at Different Back Pressure for High Altit...
IRJET Journal
 
IRJET- Sensrless Luenberger Observer Based Sliding Mode Control of DC Motor
IRJET Journal
 
IRJET-A Survey On Opportunistic Piggyback Marking For IP Trace Back
IRJET Journal
 
IRJET-Accessing Information about Programs and Services through a Voice Site ...
IRJET Journal
 
IRJET-Power Flow & Voltage Stability Analysis using MATLAB
IRJET Journal
 
IRJET-Design And Development Of Sugar Cane Sprout Cutter Machine By Human Pow...
IRJET Journal
 
IRJET- Assessment of Environmental Impacts during Operational Phase of a T...
IRJET Journal
 
IRJET- Review of Vapour Absorption System and Vapour Compression System.
IRJET Journal
 
Ad

Similar to IRJET-ASIC Implementation for SOBEL Accelerator (20)

PPT
09 accelerators
Murali M
 
PPTX
Sobel Edge Detection Using FPGA
ghanshyam zambare
 
PDF
An Efficient FPGA Implemenation of MRI Image Filtering and Tumour Characteriz...
VLSICS Design
 
PDF
AN EFFICIENT FPGA IMPLEMENTATION OF MRI IMAGE FILTERING AND TUMOUR CHARACTERI...
VLSICS Design
 
PDF
Hardware software co simulation of edge detection for image processing system...
eSAT Publishing House
 
PDF
The Computation Complexity Reduction of 2-D Gaussian Filter
IRJET Journal
 
PPTX
imagefiltervhdl.pptx
Akbarali206563
 
PDF
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
VLSICS Design
 
PDF
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
VLSICS Design
 
PDF
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
VLSICS Design
 
PDF
Sai Dheeraj_Resume
Sai Dheeraj Polagani
 
PDF
“Processing Raw Images Efficiently on the MAX78000 Neural Network Accelerator...
Edge AI and Vision Alliance
 
PDF
Hardware for Deep Learning AI ML CNN.pdf
AhmedSaeed115917
 
PDF
FPGA Implementation for Image Edge Detection using Xilinx System Generator
rahulmonikasharma
 
PDF
"Can We Have Both Safety and Performance in AI for Autonomous Vehicles?," a P...
Edge AI and Vision Alliance
 
PDF
C0161018
IOSR Journals
 
PDF
C0161018
IOSR Journals
 
PDF
Real Time Video Processing in FPGA
QuEST Global (erstwhile NeST Software)
 
PPTX
Edge Detection using 4 bit MAC on Basys3 FPGA
girilogu2
 
PPTX
ASIC Design Fundamentals.pptx
RameshK531901
 
09 accelerators
Murali M
 
Sobel Edge Detection Using FPGA
ghanshyam zambare
 
An Efficient FPGA Implemenation of MRI Image Filtering and Tumour Characteriz...
VLSICS Design
 
AN EFFICIENT FPGA IMPLEMENTATION OF MRI IMAGE FILTERING AND TUMOUR CHARACTERI...
VLSICS Design
 
Hardware software co simulation of edge detection for image processing system...
eSAT Publishing House
 
The Computation Complexity Reduction of 2-D Gaussian Filter
IRJET Journal
 
imagefiltervhdl.pptx
Akbarali206563
 
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
VLSICS Design
 
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
VLSICS Design
 
EFFICIENT ABSOLUTE DIFFERENCE CIRCUIT FOR SAD COMPUTATION ON FPGA
VLSICS Design
 
Sai Dheeraj_Resume
Sai Dheeraj Polagani
 
“Processing Raw Images Efficiently on the MAX78000 Neural Network Accelerator...
Edge AI and Vision Alliance
 
Hardware for Deep Learning AI ML CNN.pdf
AhmedSaeed115917
 
FPGA Implementation for Image Edge Detection using Xilinx System Generator
rahulmonikasharma
 
"Can We Have Both Safety and Performance in AI for Autonomous Vehicles?," a P...
Edge AI and Vision Alliance
 
C0161018
IOSR Journals
 
C0161018
IOSR Journals
 
Real Time Video Processing in FPGA
QuEST Global (erstwhile NeST Software)
 
Edge Detection using 4 bit MAC on Basys3 FPGA
girilogu2
 
ASIC Design Fundamentals.pptx
RameshK531901
 
Ad

More from IRJET Journal (20)

PDF
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
IRJET Journal
 
PDF
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
IRJET Journal
 
PDF
Kiona – A Smart Society Automation Project
IRJET Journal
 
PDF
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
IRJET Journal
 
PDF
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
IRJET Journal
 
PDF
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
IRJET Journal
 
PDF
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
IRJET Journal
 
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 
PDF
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
IRJET Journal
 
PDF
BRAIN TUMOUR DETECTION AND CLASSIFICATION
IRJET Journal
 
PDF
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
IRJET Journal
 
PDF
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
IRJET Journal
 
PDF
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
IRJET Journal
 
PDF
Breast Cancer Detection using Computer Vision
IRJET Journal
 
PDF
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
PDF
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
PDF
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
IRJET Journal
 
PDF
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
PDF
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
IRJET Journal
 
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
IRJET Journal
 
Kiona – A Smart Society Automation Project
IRJET Journal
 
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
IRJET Journal
 
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
IRJET Journal
 
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
IRJET Journal
 
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
IRJET Journal
 
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
IRJET Journal
 
BRAIN TUMOUR DETECTION AND CLASSIFICATION
IRJET Journal
 
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
IRJET Journal
 
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
IRJET Journal
 
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
IRJET Journal
 
Breast Cancer Detection using Computer Vision
IRJET Journal
 
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
IRJET Journal
 
Auto-Charging E-Vehicle with its battery Management.
IRJET Journal
 
Analysis of high energy charge particle in the Heliosphere
IRJET Journal
 
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
IRJET Journal
 

Recently uploaded (20)

PPTX
darshai cross section and river section analysis
muk7971
 
PPTX
Fundamentals of Quantitative Design and Analysis.pptx
aliali240367
 
PDF
Module - 4 Machine Learning -22ISE62.pdf
Dr. Shivashankar
 
PDF
Tesia Dobrydnia - An Avid Hiker And Backpacker
Tesia Dobrydnia
 
PDF
20ES1152 Programming for Problem Solving Lab Manual VRSEC.pdf
Ashutosh Satapathy
 
PDF
Submit Your Papers-International Journal on Cybernetics & Informatics ( IJCI)
IJCI JOURNAL
 
PPTX
Introduction to Internal Combustion Engines - Types, Working and Camparison.pptx
UtkarshPatil98
 
PPTX
Distribution reservoir and service storage pptx
dhanashree78
 
PPTX
UNIT 1 - INTRODUCTION TO AI and AI tools and basic concept
gokuld13012005
 
PDF
PROGRAMMING REQUESTS/RESPONSES WITH GREATFREE IN THE CLOUD ENVIRONMENT
samueljackson3773
 
PDF
Bayesian Learning - Naive Bayes Algorithm
Sharmila Chidaravalli
 
PPTX
Explore USA’s Best Structural And Non Structural Steel Detailing
Silicon Engineering Consultants LLC
 
PDF
3rd International Conference on Machine Learning and IoT (MLIoT 2025)
ClaraZara1
 
PDF
Digital water marking system project report
Kamal Acharya
 
PPTX
Unit_I Functional Units, Instruction Sets.pptx
logaprakash9
 
PDF
WD2(I)-RFQ-GW-1415_ Shifting and Filling of Sand in the Pond at the WD5 Area_...
ShahadathHossain23
 
PDF
Información de microsoft purview herramienta de microsoft
macarenabenitez6
 
PPTX
OCS353 DATA SCIENCE FUNDAMENTALS- Unit 1 Introduction to Data Science
A R SIVANESH M.E., (Ph.D)
 
PDF
13th International Conference of Security, Privacy and Trust Management (SPTM...
ijcisjournal
 
DOCX
Engineering Geology Field Report to Malekhu .docx
justprashant567
 
darshai cross section and river section analysis
muk7971
 
Fundamentals of Quantitative Design and Analysis.pptx
aliali240367
 
Module - 4 Machine Learning -22ISE62.pdf
Dr. Shivashankar
 
Tesia Dobrydnia - An Avid Hiker And Backpacker
Tesia Dobrydnia
 
20ES1152 Programming for Problem Solving Lab Manual VRSEC.pdf
Ashutosh Satapathy
 
Submit Your Papers-International Journal on Cybernetics & Informatics ( IJCI)
IJCI JOURNAL
 
Introduction to Internal Combustion Engines - Types, Working and Camparison.pptx
UtkarshPatil98
 
Distribution reservoir and service storage pptx
dhanashree78
 
UNIT 1 - INTRODUCTION TO AI and AI tools and basic concept
gokuld13012005
 
PROGRAMMING REQUESTS/RESPONSES WITH GREATFREE IN THE CLOUD ENVIRONMENT
samueljackson3773
 
Bayesian Learning - Naive Bayes Algorithm
Sharmila Chidaravalli
 
Explore USA’s Best Structural And Non Structural Steel Detailing
Silicon Engineering Consultants LLC
 
3rd International Conference on Machine Learning and IoT (MLIoT 2025)
ClaraZara1
 
Digital water marking system project report
Kamal Acharya
 
Unit_I Functional Units, Instruction Sets.pptx
logaprakash9
 
WD2(I)-RFQ-GW-1415_ Shifting and Filling of Sand in the Pond at the WD5 Area_...
ShahadathHossain23
 
Información de microsoft purview herramienta de microsoft
macarenabenitez6
 
OCS353 DATA SCIENCE FUNDAMENTALS- Unit 1 Introduction to Data Science
A R SIVANESH M.E., (Ph.D)
 
13th International Conference of Security, Privacy and Trust Management (SPTM...
ijcisjournal
 
Engineering Geology Field Report to Malekhu .docx
justprashant567
 

IRJET-ASIC Implementation for SOBEL Accelerator

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 239 ASIC Implementation for SOBEL Accelerator Prema C L1, Dr Siva Yellampalli2 1PG student, VTU Extn. Centre, UTL Technologies Ltd, Bengaluru 2Principal, VTU Extn. Centre, UTL Technologies Ltd, Bengaluru ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - Due to the Rapid progress in the field of VLSI technology the device are faster than before .This proposed accelerator circuit is implemented in such a way that we can increase the circuit speed . If implement the design using ASIC the single IC can be programmed for several application in different times and it increases the performance or speed of the device. such as video surveillance and monitoring system in security system. so using ASIC implementation on the design will get the fault free chip so it won’t cause harm to lives.Implement the accelerator to remaining edge detection algorithm to make the high volume production for asic chip to image processing application and make that chips as cost effective. Key Words Accelerator, BIST,ASIC design, Sobel operartor,ATPG,Fault coverage 1.INTRODUCTION In modern days, to keep speed with the fast progressing of technologies, we have to make everything faster, that’s why accelerator comes into the technology. Accelerators[1] which increases the device performance. Accelerators which used in edge detection which speeds up the edge detection process. Detection of edges in Video images[[2]can be done more rapidly than it is achievable with software which are embedded on a core processor. It is used to achieve faster detection of edge in video images. It refer to a method of detecting and locate the quick changes in the intensity ofthe frequency of the image. The abrupt changes in the intensity of the pixel can be represented by discontinuityintheimage. Because the improvement in the fabrication technology and thus the increase within the logic FPGAlogic block density, the consumption of FPGA is notconstricted to any longer than debug and prototyping of digital circuits which leads to increment in the logic blocks in the FPGA .Whenever the time to point is important for some applications ASIC(Application Specific IntegratedCircuit)[5] is the solution. The Application Specific Integrated Circuits (ASIC) design of SOBEL accelerator has two major phases: logical orfrontend design and physical or backend design.Thestepsinvolvedin frontend design are verification of results in pre-synthesis simulation, compile, specifying design constraints,synthesis of design with ASIC technology libraries, Automatic Test Pattern Generation (ATPG), insertion of Design For Test (DFT)) creation and verification of results in post synthesis simulation. The physical layout design involves floor plan, power plan, placement, routing, various timing analysis and verifications of completed design, and verification of results in post layout simulation. In this project the ASIC implementation of Sobel accelerator is explained in detail. This document is template. We ask that authors followsome simple guidelines. In essence,weask youtomakeyourpaper look exactly like this document. The easiest way to do this is simply to download the template, and replace(copy-paste) the content with your own material. Number the reference items consecutively in square brackets (e.g. [1]). However the authors name can be used along with the reference number in the running text. The order of reference in the running text should match with the list of references at the end of the paper. 2.Accelerators for edge detection in video images An accelerator for edgedetection[2]invideoimages which is compromise between what a real worldaccelerator ,might do.Edge detection is important part of analysing a scene in video images and has applications in many areas such as security monitoring and computer vision application[3]. It involves identifying places in image where there is abrupt change in intensity.Those places usually occur at the boundaries of objects. subsequent analysis of the edge can be recognizing what the objects are. Assume Monochrome images of 640x480 pixels[1,6],each of 8 bits ,stored row by row in memory with successive pixels,left to right in a row,at successive addresses.Pixels values are interpreted as unsigned integer ranging from 0(black) to 255(white) .will use a algorithm called sobel edge detector .It works by computing the derivatives of the intensity signal in each of the x and y
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 240 directions and looking for maxima and minima in the derivatives.These are the places where the intensity is changing most rapidly.The sobel method approximates the derivatives in each direction for each pixel by a process called convolution. This involves adding the pixels and its eight neighbours ,each multiplied by a coefficient. The coefficients are represented in a 3x3 convolution mask. The sobel convolutions masks,Gx and Gy ,for the derivatives in the x and y directions respectively. the derivatives images being computed by centering each of the convolution masks over successive pixels in the original image.after multiplying the coefficient in each mask by the intensity values of the underlying pixel and sum thenineproductstogethertoform two partial derivatives for the derivative image,Dx and Dy Ideally will compute the magnitude of the derivative image pixel as GX GY Fig 2.1 Sobel Convolution mask |D|= sqrt (Dx2 + Dy2) 3 PROPOSED Accelerator DESIGN for Implementation 3.1 Working principle Basically it is a pipeline structure, where with pixel data read from the original image entering into the registers at the top right, flowing through the 3x3 multiplier array on the left, then down through the adders to the Dx and Dy registers, then through the absolute value circuitsandadder to the |D| register, and finally into the register at the bottom left.The resulting derivative pixelsarethenwrittenfromthat register to memory. (While a right-to-left data flow is opposite to usual practice, in this case, it has the advantage of preserving the same arrangement of pixels as that in an image.) We will describe the operation of the pipeline assuming initially that it is full of data. We will then discuss how to deal with starting it up at the beginning of an image row and draining it at the end of the row. The pipeline generates the derivative pixels for a given row in groups of four. Theacceleratorreadsfourpixels from each of the preceding, current, and next rows in memory into the three 32-bit registers at the top right of the figure. Each register consist of four 8-bitpixel registers.Over the four subsequent clock cycles, pixels are shiftedouttothe left, one pixel at a time, into the multiplier array. Each cell in the array contains a pixel register and one or two circuits that multiply the stored pixel by a constant coefficientvalue. Since the coefficients are all +1, -1, +2, or -2, the circuits are not full-blown multipliers. Instead, multiplying by -1 is simply a negator, multiplying by +1 is a through connection with no circuitry, multiplying by -2 is a left shift of the result of a negator, and multiplying by +2 is simply a left shift. On each clock cycle, the array provides thepartial productsfora single derivative pixel, and the partial products are added and stored in the Dx and Dy registers. Also, on each clock cycle, the Dx and Dy values for the preceding pixel havetheir absolute values computed and added and stored in the |D| register. The resulting derivativepixel valuesareshiftedinto the result row register. When four result pixels are ready in the register, they are subsequently written to memory. In the steady state, during processing of a row, the accelerator needs to write the pixels to memory from the result register before it can shift new pixels into the multiplier array and the Dx, Dy and |D| registers. Otherwise, the result values would be overwritten. Having written four pixels, the accelerator can push four more pixels through the pipeline, thus emptying the read registers and filling the result register. It can then write those result pixels and read in three more groups of four pixels,and repeat the process.assuminga Wishbonebus connection with 32-bit-wide data signals and a 100MHz clock, as suggested earlier. Since the accelerator is one of -1 0 +1 -2 0 +2 -1 0 +1 +1 +2 +1 0 0 0 -1 -2 -1
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 241 several masters on the memory bus, it must request use of the bus for the writes and reads and wait until granted access by the bus arbiter. We assume that the arbiter gives the accelerator sufficiently high priority that it can use the memory bandwidth it needs. Fig 3.1 Sobel Accelerator Architecture 4. ASIC DESIGN OF SOBEL Accelerator As compared to FPGA which has limited number of logics, the ASIC supports integration of larger number of logic gates (higher in density). ASICs are designed to fit a certain application. ASIC implementation supports digital or mixed-signal circuit in the design. The ASIC design saves cost, fabrication time for mass production of the system. There are two methods of design flow in ASIC: frontend and backend design flow. The front end design involves inclusion of technology library files, design compilation, elaboration, creation of design constraints, synthesize, Design For Test (DFT) insertion and Logic Equivalence Checking (LEC). The backend physicallayoutdesigninvolvesfloorplanningof blocks, power planning,placement,layoutroutingandtiming verifications. The ASIC design of CCG has been carried out with the help Cadence tools. The ASIC design aspects are described in the following sections 4.1. ASIC FRONTEND DESIGN Compile and simulation of design The Verilog design files and test bench modules are compiled, elaborated, simulated for timing verification [10]. The simulation waveforms of pre-synthesize are as shown in fig-5. Creating design constraints for Synthesis Clocks are used to provide timing and synchronization information for digital design. The design constraints are specified for optimizing synthesize tool to meet timing requirements.Theconstraints [11] are identified in this design are master clock signal and global reset signal which are propagated all the parts of the design. These signals require larger fan-out and should have lowest skew in timing. Fig 4.1 Initialisation of sobel Fig 4.2 The timings are verified successfully as per design specification The derived clock used in clock driver section is also included in constraint list. All the design constraints are
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 242 specified with the help of RTL Compiler before synthesize the design. Synthesis of Design Synthesis [13] process maps the HDL design with standard cells specified in technology driven library files. This library file has timing, power and area related information. The equivalent physical library file has information about size, internal delays of cells and input output signals of standard cells. The physical library files are Library Exchange Format (LEF), Quantus RC (QRC) file or Capacitance table are required for backend ASIC design. These library files are provided by the chip manufacturer. In order to reduce the effort in backend design, the physical library files [5] are used as a part of synthesize process. This method is used for generating floor planning information as Design Exchange Format (DEF) file required for backend design. Based on SDC constraints, the RTL compiler generated optimized hardware in the form of gate level netlist based on standard cells to meet design constraint requirements. Generation of report files and backend import files After synthesize with technology library, the report files are generated for the mapped gate level netlist. The fig- shows report of power and area for the synthesized design. Fig-4 Synthesize Report from RTL Compiler Further, the synthesized reports based on PLE are generated to simplify the burden of design import to physical layout design. Insertion of Design For Test (DFT) The DFT adds additional hardware for the design which includes testability features to the verified design. The added hardware validates the product hardware from manufacturing defects. In this design the simple method of scan chain DFT with Full scan mode has been designed. The scan chain test patterns are used to access the internal nodes of the chip by shifting. The synthesize steps are repeated to include DFT along with design. The additional hardware pins as the result of DFT insertions are Scan In (SI), Scan Out (SO) and Scan Enable (SE). Automatic Test Pattern Generation (ATPG) are generated with test vectors in Verilog form for scan test and logic test mode in order to verify fault and global coverage of the design. The ATPG engine used for generating test patterns to test the structural integrity of the design. 6. ASIC BACKEND DESIGN During frontend design, the reports and timings are analyzed and verified before importing the design to layout. The following are the major steps involved during backend ASIC design. Fig 6.1 Floor planning and power planning Power and ground rings are added with vertical and horizontal metal layers placed around the core. The main aim of the power planning [15] is to provide equal amount of supply to all standard cells without any supply loss. The change in supply may adversely affect threshold voltage of standard cells, setup or hold timing violations of the design. In order to provide equal amount of power to the entire design, the power strips are created vertically and horizontally, as mesh inside the core. For ease of signal routing, top layer has been selected for power and ground nets.
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 243 Fig 6.2 Analysis of floor plan Timing analysis Timing analysis [15] is carried out to check setup or hold timing violations. The timing analysis of design requires timing library files for slow, fast and Signal Integrity (SI) files. The set of library files are created for min and maximum timing analysis. The design constraints created during synthesize process also included for layout timing analysis. The timing analysis is carried out in three different phases of ASIC layout design: Pre Clock Tree Synthesize (CTS) post CTS and post route. The additional delays are introduced by metal layers and coupling capacitances analyzed after post routing. Capacitance table library file has been included for post route analysis. Placing CTS Clock tree or clock buffer has higher driving capability and are usually placed in design based on larger fan-out requirements. The clock buffer uniformly distributes the clock signals with lowest skew in timing. The clock tree buffers are identified from the standard cells. The identified cells are placed inside the core. Layout Routing Layout routing [17] is classified as global and detailed routing. The global routing generates estimated delays according to statics by observing previous manufactured chips. The detailed routing generates actual wire delays that can be obtained by several timing optimizations. Clock tree or clock buffer has higher driving capability and are usually placed in design based on larger fan-out requirements. The clock buffer uniformly distributes the clock signals with lowest skew in timing. The clock tree buffers are identified from the standard cells. The identified cells are placed inside the core. Post route timing analysis The post route timing has been extracted for analysis. The values of Worst Negative Slack (WNS) and Total Negative Slack (TNS) are positive. Fig 6.3 Post route timing analysis Post route verification The following verifications arecarriedoutafterpostrouting: Verification on Geometry, Design Rule Checks (DRC), connectivity, Bus guide, metal via and process antenna. All the verifications are passed with zero violations. TABLE – I POST ROUTE VERIFICATIONS Verification on Violations Bus guide Zero Geometry Zero Connectivity Zero Process antenna Zero DRC Zero SUMMARY OF TOOLS USED The following tool sets from Cadence are used for completing this project. TABLE -2 Tools Used Cadence tool name Used for NCLAUNCH Presynthesis NCSIM Simulation RTL COMPILER Synthesis ENCOUNTER TEST DFT 7.Conclusion The ASIC Implementation for Sobel Accelerator completed successfully which compiles all the design requirements. Thetimingin PreCTS,PostCTSandPost-Route
  • 6. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 01 | Jan -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 244 analysis are verified successfully. The ASIC post layout verifications carried out on geometry, DRC, connectivity, metal density, process antenna. The GLN and SDF file are extracted from the layout forsimulation.Sobel Acceleratoris implemented using ASIC flow. Area, power, timingandgates reports analysed.In the ASIC flow have also Implemented SDC files,DFT insertion, Physical design and verified with timing violations. The Sobel accelerator implementation in ASIC has been completed with Taiwan Semiconductor Manufacturing Technology (TSMC)90nmtechnologylibrary files. The core die size is 0.11mm² and 6 metal layers are used for the layout design to complete. After all the verifications, the GDS file has been generated which can be supplied to chip manufacturer for fabrication. Future work ASIC implementation done for Sobel acceleratorfor video images in medical and defence application which depend on sobel edge detection algorithm but various algorithms are used for edge detection such as prewitt,canny,Robert algorithms so by developing accelerator for those algorithms and implement that to into ASIC flow to make their own chip in large volume for corresponding application. References: [1]. P.J. Ashenden, Digital Design An Embedded System Approach Using Verilog, Morgan Kaufmann, 2008. [2]. An FPGA based Hardware Accelerator for Real Time Video Segmentation System ICACSIS 2011 ISBN: 978- 979-1421-11-9 [3].Hardware Description of Multi-Directional Fast Sobel Edge Detection Processor by VHDL for Implementing on FPGA , Volume 47– No.25, June 2012 [4].FPGA based Image Edge Detection and Segmentation, Vol.9.No.2, 2011,p.187-192. [5]. Michael John Sebastian Smith, “Application Specific Integrated Circutis”, Pearsion Education Inc, 12th impression, 2013 [6]. I.Yasri, N.H.Hamid, V.V.Yap, “Implementationof an FPGA based Sobel Edge Detection Operator”,IGCES,2008. [7].Shukor, Lo HaiHiung, Patrick Sebastian3, 2007. ―Implementation of Real-time Simple Edge Detection on FPGA‖ pp. 1404-1405,IEEE. [8].Cadence NCLaunch User Guide, ProductVersion 14.1, June 2014 [9].Cadence RTL Compiler User Guide, Product Version 12.2, August [10].Setting Constraints and Performing Timing Analysis Using Encounter RTL Compiler, Product Version 12.2, August 2013 [11]. Design with RTL Compiler Physical, Product Version 12.2, May 2013 [12].Design For Test Encounter RTL Compiler, Product Version 12.2, August 2013 [13].A. Chandra, S. Chebiyam, and R. Kapur Synopsys, Inc., “A Case Study on Implementing Compressed DFT Architecture” , IEEE 23rd Asian Test Symposium, 2014, pp. 336 - 341 [14] Message Reference for Encounter RTL Compiler, Product Version 12.2, May 2013 [5] Cadence EDI User Guide, ProductVersion14.20, October 2014 [16] Cadence I/O Planner: Application Note, Product Version 16.2, November 2008 [17] Plato NanoRoute User‟s Guide, Version 2.5, Rev. D [19] SDF Timing Annotation, Product Version 14.2, January 2015