INTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN ENGINEERING AND TECHNOLOGY (IJARET)

Similar documents
Design of 32 bit Parallel Prefix Adders

Implementation and Estimation of Delay, Power and Area for Parallel Prefix Adders

Design and Estimation of delay, power and area for Parallel prefix adders

Comparison of Parallel Prefix Adders Performance in an FPGA

A Novel Approach For Error Detection And Correction Using Prefix-Adders

Design of Low Power and High Speed Carry Select Adder Using Brent Kung Adder

Design of High Speed Vedic Multiplier Using Carry Select Adder with Brent Kung Adder

DEVISE AND INFERENCE OF DELAY, POWER AND AREA FOR ANALOGOUS PREFIX ADDERS

Comparative Analysis of Adders Parallel-Prefix Adder for Their Area, Delay and Power Consumption

Design of a High Speed Adder

Design of Carry Select Adder with Binary Excess Converter and Brent Kung Adder Using Verilog HDL

Implementation of 16-Bit Area Efficient Ling Carry Select Adder

Design of 16-Bit Adder Structures - Performance Comparison

Design of Carry Select Adder Using Brent Kung Adder and BEC Adder

REVIEW OF CARRY SELECT ADDER BY USING BRENT KUNG ADDER

16-BIT CARRY SELECT ADDER. Anushree Garg B.Tech Scholar, JVW, University, Rajasthan, India

FPGA Implementation of Efficient 16-Bit Parallel Prefix Kogge Stone Architecture for Convolution Applications Geetha.B 1 Ramachandra.A.

Design of Modified Low Power and High Speed Carry Select Adder Using Brent Kung Adder

Australian Journal of Basic and Applied Sciences. Performance Analysis of Different Types of Adder Using 3-Transistor XOR Gate

A COMPREHENSIVE SURVEY ON VARIOUS ADDERS AND ITS COMPACTION RESULT

Design of Low Power and High Speed Carry Select Adder Using Brent Kung Adder

Design of 64-bit hybrid carry select adder using CMOS 32nm Technology

DESIGN AND SIMULATION OF 4-BIT ADDERS USING LT-SPICE

Designing, simulation and layout of 6bit full adder in cadence software

Pareto Points in SRAM Design Using the Sleepy Stack Approach

utca mother board for FMC ADC daughter cards

Pareto Points in SRAM Design Using the Sleepy Stack Approach

FPGA-based Emotional Behavior Design for Pet Robot

Sleepy stack: a New Approach to Low Power VLSI Logic and Memory

Pareto Points in SRAM Design Using the Sleepy Stack Approach. Abstract

We recommend you cite the published version. The publisher s URL is

The City School. Learn Create Program

Nathan A. Thompson, Ph.D. Adjunct Faculty, University of Cincinnati Vice President, Assessment Systems Corporation

The ALife Zoo: cross-browser, platform-agnostic hosting of Artificial Life simulations

The Impact of Gigabit LTE Technologies on the User Experience

Simulation of the ASFA system in an ERTMS simulator

Effective Vaccine Management Initiative

An Esterel Virtual Machine (EVM) Aruchunan Vaseekaran

HCM 6: Highway Capacity Manual: A Guide for Multimodal Mobility Analysis

Applicability of Earn Value Management in Sri Lankan Construction Projects

MGL Avionics EFIS G2 and iefis. Guide to using the MGL RDAC CAN interface with the UL Power engines

Optimal Efficient Meta Heauristic Based Approch for Radial Distribution Network

Representation, Visualization and Querying of Sea Turtle Migrations Using the MLPQ Constraint Database System

Cat Swarm Optimization

Benjamin Wang. Lakshman One School of Engineering Science Simon Fraser University Burnaby, British Columbia V5A 1S6. RE: Letter of Transmittal

FCI LT LM UNDERGROUND

DIGITUS Network Cabinet Unique Series, 600, 800 mm width - 600, 800, 1000, 1200 mm depth

Improving AIBO with Artificial Intelligence Technique

THE EFIGENIA EJ-1B MOZART S/VTOL

Modeling and Control of Trawl Systems

Development of Design of Initial Cat Robot Model for the Use of Early Treatment of Children with Autism Spectrum Disorder (ASD)

Comparison of different methods to validate a dataset with producer-recorded health events

IHE Profile Proposal Anatomic Pathology Opinion Request (APOR) Paris June 1, 2012 E.Cordonnier, C.Daniel, F.Macary


Controllability of Complex Networks. Yang-Yu Liu, Jean-Jacques Slotine, Albert-Laszlo Barbasi Presented By Arindam Bhattacharya

LOW POWER PARALLEL PREFIX ADDER

Pet Selective Automated Food Dispenser

2. From where the latest Software Development Kit for the EVM can be downloaded?

2. From where the latest Software Development Kit for the EVM can be downloaded?

Turtle Ballet: Simulating Parallel Turtles in a Nonparallel LOGO Version. Erich Neuwirth

Half-Lives of Antibiotics

The EVM + AGILE Anthology

THE PIGEONHOLE PRINCIPLE AND ITS APPLICATIONS

Your web browser (Safari 7) is out of date. For more security, comfort and the best experience on this site: Update your browser Ignore

Training Cats and Dogs to Love Being Petted or Groomed*

Smart Automatic Cat Feeding Machine (SACFM) (sack-ff-mm)

Identity Management with Petname Systems. Md. Sadek Ferdous 28th May, 2009

Let s Play Poker: Effort and Software Security Risk Estimation in Software. Picture from

REPORT ON SCOTTISH EID TRIALS

ISL55185 EVM Getting Started

Recurrent neural network grammars. Slide credits: Chris Dyer, Adhiguna Kuncoro

Lecture 1: Turtle Graphics. the turtle and the crane and the swallow observe the time of their coming; Jeremiah 8:7

Antimicrobial Stewardship Northern Ireland

Workbook. Version 3. Created by G. Mullin and D. Carty

IQ Range. Electrical Data 3-Phase Power Supplies. Keeping the World Flowing

SEDAR31-DW30: Shrimp Fishery Bycatch Estimates for Gulf of Mexico Red Snapper, Brian Linton SEDAR-PW6-RD17. 1 May 2014

CONTINUING EDUCATION AND INCORPORATION OF THE ONE HEALTH CONCEPT

Effects of Cage Stocking Density on Feeding Behaviors of Group-Housed Laying Hens

Pixie-7P. Battery Connector Pixie-7P Fuse* Motor. 2.2 Attaching the Motor Leads. 1.0 Features of the Pixie-7P: Pixie-7P Batt Motor

[Boston March for Science 2017 photo Hendrik Strobelt]

Environmental awareness Automation Monitoring System for Training Dog Kennel

SYTLE FORMAL : The Online Dog Trainer In-Depth Review

GOOD GOVERNANCE AND OIE GUIDELINES FOR ANIMAL DISEASES CONTROL

Custom Software Solution

RECOMMENDATION ITU-R P ITU-R reference ionospheric characteristics *

Application of Fuzzy Logic in Automated Cow Status Monitoring

Creating an EHR-based Antimicrobial Stewardship Program Session #257, March 8, 2018 David Ratto M.D., Chief Medical Information Officer, Methodist

Practical Attacks against the MSP430 BSL

GAO Earned Value Management (EVM) Audit Findings

Improving RLP Performance by Differential Treatment of Frames

Lab 7. Evolution Lab. Name: General Introduction:

State Machines and Statecharts

OIE Digital Action Plan

Hours of manual cash counting reduced to 12 minutes. John G. Shedd Aquarium, USA

CONVENTION ON INTERNATIONAL TRADE IN ENDANGERED SPECIES OF WILD FAUNA AND FLORA

Optimizing use of quality antimicrobial medicines in humans

Biol 160: Lab 7. Modeling Evolution

A Biomimetic Haptic Sensor

Strengthening Epidemiology Capacity Using a One Health Framework in South Asia

Dynamic Programming for Linear Time Incremental Parsing

Transcription:

INTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN ENGINEERING AND TECHNOLOGY (IJARET) International Journal of Advanced Research in Engineering and Technology (IJARET), ISSN 0976 ISSN 0976-6480 (Print) ISSN 0976-6499 (Online) Volume 3, Issue 2, July-December (2012), pp. 01-10 IAEME: www.iaeme.com/ijaret.html Journal Impact Factor (2012): 2.7078 (Calculated by GISI) www.jifactor.com IJARET I A E M E HIGH PERFORMANCE PARALLEL PREFIX ADDERS WITH FAST CARRY CHAIN LOGIC Anitha R 1, V Bagyaveereswaran 2 AP(Sr), SENSE, AP(Sr), SELECT VIT University, Vellore 632014, TamilNadu, India. 1 eranitharavi@gmail.com, 2 bagyaveereswaran@gmail.com, ABSTRACT Binary adders are the basic and vital element in the circuit designs. Prefix adders are the most efficient binary adders for ASIC implementation. But these advantages are not suitable for FPGA implementation because of CLBs and routing constraints on FPGA. This paper presents different types of parallel prefix adders and compares them with the Simple Adder. The adders are designed using Verilog HDL code and simulated and synthesized using Xilinx ISE13.2 software tool and Cadence RTL compiler. Among all the adders, Kogge-Stone adder provides better performance in ASIC implementation but it is not suitable for FPGA implementation. In order to make it suitable for FPGA implementation, Kogge-Stone adder is modified using fast carry logic technique. The modified adder provides better performance over the Simple adder for the higher order bit widths. Keywords- FPGA, Binary addition, Carry tree adders, Prefix computation, Prefix addition. I. INTRODUCTION Adders are the most important element in all the digital circuit design. Among the various types of adders, Carry Tree Adder which is also known as Parallel Prefix Adders provide high performance in terms of speed in ASIC implementation. The Carry Tree adder performs three stage operations such as Pre-computation stage, Prefixcomputation stage and Post-computation stage. The Pre-computation stage generates and propagates the carry signal, the Prefix computation stage generates the carry signal using prefix cells and the Post-computation stage generates the sum. This is shown in Fig.1 and the equation for these three stage addition operation is given in equation 1, 2 &3. Field Programmable Gate Array [10] offers low cost and less development time over ASIC implementation. In this paper, the most efficient adder i.e., the carry tree adder like Kogge-Stone, Brent-Kung, Ladner-Fischer and Han-Carlson adders are designed and implemented on FPGA. 1

International Journal of Advanced Research in Engineering and Technology (IJARET), ISSN 0976 The problems involved in FPGA implementation are investigated and the possible FPGA architecture which can make the Carry Tree Adder to provide high performance over the Simple adder is explored. The possible trade-offs like area, power, delay, interconnect count and fan-out involved in the adders are examined. Fig.1. Block Diagram of Prefix addition The three stages addition consists of the following computations: Pre-computation: G m:n =A n and B n, G 0 =c in ; P m:n =A n xor B n, P 0 =0;...(1) Prefix-computation: (G m, P m ) ο (G n, P n )= (G n:k + P n:k G k-1:n, P n:k P k-1:j ) (or) G m:n =G n:k + P n:k G k-1:n P m:n =P n:k P k-1:j...(2) Post-computation: S n =P n xor G n-1:0...(3) II. CARRY TREE ADDERS The various types of carry tree adders are shown in Fig.2. Each carry tree adder consists of three parts. They are: Upper part, Middle part, Lower part. Using these parts the carry tree adders computes N outputs from N inputs as shown in Fig.1. The Upper part generates and propagates the carry signal from the input to the prefix stage using the formula given in equation (1). The propagated and generated carry signals are combined using the associate operator ο. This operation is performed in the middle part using the formula given in equation (2). The Middle part consists of prefix cells such as black cells, grey cells and white buffers [1]. The arrangement of these prefix cells in different order results in various types of Carry Tree adders. where the carry signals need not to be propagated. Such operations are performed by grey cells. The grey cells generate the carry signal only.black cell generates and propagates the carry signal. There are some places The white buffers are used to reduce the loading effect for the further stages. The Lower part generates the overall sum using the formula given in equation (3). Depends on the arrangement of prefix cells, the carry tree adders involves in tradeoffs like area, power, delay, interconnect count, fan-out and logic depth [3 & 4]. Fig.2 (a) shows the Brent-Kung The dark black line in the figure indicates the critical path of the adder. The critical path for Han-Carlson and Kogge-Stone are less. So these two 2

International Journal of Advanced Research in Engineering and Technology (IJARET), ISSN 0976 adders are expected to be the fastest adder. The power utilized by all the Carry Tree Adder is more than the Simple Adder. (a)brent Kung (b) Kogge Stone minimum area and maximum logic depth. Due to the maximum logic depth, the delay of this adder is expected to be high. Fig.2 (b) shows the Kogge-Stone adder. adder. It is designed in such a way that it provides It provides maximum interconnect count and area but minimum logic depth and fan-out. Ladner-Fischer adder as shown in Fig.2 (c) provides minimum logic depth with improved area. Han-Carlson adder as shown in Fig. 2(d) provides minimum logic depth and minimum interconnect count. 3

(c) Ladner Fischer (d)han Carlson Fig.2. (a-f) Carry Tree adders Simple adder is designed using Verilog HDL + operator. The carry chain structure on FPGA makes Simple Adder to provide high performance. But this is not an efficient adder for VLSI implementation. In this paper, Carry Tree Adder is compared with Simple Adder for both ASIC and FPGA implementation. III. RELATED RESEARCH AND PROPOSED WORK The different types of carry tree adders are discussed in [4]. In [5], the authors implemented different types of adders like Simple Adder, Carry Look Ahead Adder, Carry Skip Adder, and Carry Select Adder on the Virtex2 FPGAs and found that the Simple Adder provides better performance. In [3], the authors discussed various parallel prefix networks design and implementation on a Xilinx Virtex5 FPGA. It is observed that the Simple Adder provides better performance over the prefix networks for the bit widths up to 256 bits. This is due to the advantage of the carry chain structure on the FPGA. All these works by different authors shows that the simple adder provides better performance on FPGA. The area, delay results for these works depend upon synthesis reports. In [2], the authors described several Carry Tree Adders implemented on a Xilinx Spartan3E FPGA. It is found that the Kogge Stone Carry Tree Adder provide better delay performance for the higher order bits. The results obtained for this paper is similar to those presented in [2]. 4

Carry Tree Adders are designed, coded, simulated and synthesized and then it is compared with the Simple Adder. The obtained area, power, delay results of various Carry Tree Adders are compared with each other and also with the Simple Adder. Among all the Carry Tree Adders, Kogge-Stone Adder and Han-Carlson Adder is expected to be the fastest adder in ASIC implementation but not in FPGA implementation. In this paper, Kogge-Stone Adder is taken, since it is having minimum fan-out and logic depth than Han-Carlson Adder, and modified using Fast Carry Logic technique in order to make it suitable for FPGA implementation [6, 7, 8 & 9]. The addition operation performed by Simple Adder, which is generated by synthesis tool, is shown in Fig.3 (a). From Fig.3 (a), it is clear that the Prefix-computation stage of the Simple Adder uses multiplexers. Similarly, the Prefix-computation stage of Carry Tree Adder is replaced with the Fast Carry logic technique which uses muxes as shown in Fig.3 (b). The Fast Carry Logic architecture for 4- bit addition is shown in Fig.3(c). Instead of using Black cells, Grey cells and White buffers to propagate and generate the carry signals, simple muxes are used. The blocks present in Fast Carry Logic technique also uses muxes. The input to the Fast Carry Logic is the propagated and generated carry signal of the Pre-computation stage. The Pre-computation and Postcomputation of the modified adder is similar to that of the normal carry tree adders. IV. RESULTS The delay, power and cell area results obtained by synthesizing the designed adders for 128bits using Cadence RTL compiler (90nm technology) is shown in Table 1, 2 & 3. The abbreviations used in the table are: KS for the Kogge Stone Adder, BK for the Brent Kung Adder, LF for the Ladner Fischer Adder and HC for Han Carlson Adder. Fig.3(a)SimpleAdder 5

Fig.3(b)Carry Tree Adder Fig.3(c) Fast Carry Logic for 4-bit Carry tree addition The delay is measured in terms of nanoseconds, power in terms of nanowatt. From the results it is found that the Carry Tree Adders provide best delay performance than the Simple adder. Among the Carry Tree Adders, Kogge-Stone Adder and Han-Carlson Adder provide best delay as it is expected but the area and power utilized by those adders are more. Comparatively, Brent-Kung Adder and Ladner-Fischer Adder utilizes less area and power. 6

Table 1: Delay Results of Carry Tree Adders compared with Simple Adder Table2: Power Results of Carry Tree Adders compared with Simple Adder Table3: Cell Area Results of Carry Tree Adders compared with Simple Adder Fig.4 shows the simulated delay results of the adders for the bit widths up to 128bits using Xilinx ISE13.2 software tool. From the Fig.4, it is found that the Simple Adder provide best delay performance over the Carry Tree Adder. The obtained delay result is entirely different from the result shown in Table 1. This is because of the presence of Fast Carry chain structure on Xilinx FPGA. Among the Carry Tree adders, Kogge-Stone Adder provides best delay as it is expected. 7

Fig.5(a-d) shows the delay results of Kogge-Stone Adder, Kogge-Stone Modified Adder and Simple Adder for the FPGA families like Spartan-3E, Virtex-4, Virtex-5 and Virtex-6 Lower power. Some of the 64-bit adder structure cannot be fitted in to all the devices under this family. Fig. 4 Simulated Delay Results of Carry Tree Adders compared with Simple Adder Depends on the adder structure, the device and package has been selected. From the Fig it is found that, for Spartan-3E FPGA, Kogge-Stone adder provide best performance after it reaches 256 bits whereas Modified adder provides best performance after it reaches 128bits, for Virtex-4 FPGA, Kogge-Stone adder provides best performance after itreaches 128bits whereas Modified adder provides best performance from 128bits, for Virtex-5 FPGA, Kogge-Stone adder provides best performance after it reaches 256bits whereas Modified adder provides best performance from 128bits, for Virtex-6 FPGA, it is able to reduce the delay of Carry Tree Adder but Simple Adder provide better delay performance. (fig. 5.a) 8

V. CONCLUSION This paper presents different types of Carry Tree Adders. Kogge Stone adder is the fastest carry tree adder in VLSI implementation but it provides different result for the FPGA implementation. (Fig.5.b) In order to make it suitable for FPGA implementation, the prefix computation stage is modified using Fast Carry Logic. (Fig.5.c) 9

(Fig.5.d) Fig. 5(a-d) Simulated Delay results The obtained delay results are compared with the Simple Adder for the various FPGA devices like Spartan3E, Virtex4, Virtex5, Lower power Virtex6. The Lower power Virtex6 FPGA provides best delay compared to that of all the FPGA devices. By using carry logic technique the Carry Tree Adders are able to provide better delay performance on FPGA over the Simple Adder for the higher order bit widths. REFERENCES 1. N. H. E. Weste and D. Harris, CMOS VLSI Design, 4 th edition, Pearson Addison- Wesley, 2011. 2. FPGAs David H. K. Hoe, Chris Martinez and Sri Jyothsna Vundavalli, Design and Characterization of Parallel Prefix Adders, IEEE 43rd Southeastern Symposium on system theory, March 2011. 3. K. Vitoroulis and A. J. Al-Khalili, Performance of Parallel Prefix Adders Implemented with FPGA technology, IEEE Northeast Workshop on Circuits and Systems, pp. 498-501, Aug. 2007. 4. D. Harris, A Taxonomy of Parallel Prefix Networks, in Proc. 37th Asilomar Conf. Signals Systems and Computers, pp. 2213 7, 2003. 5. M. Becvar and P. Stukjunger, Fixed-Point Arithmetic in FPGA, Acta Polytechnica, vol. 45, no. 2, pp. 67-72, 2005. 6. Spartan-3E Generation FPGA User Guide, Xilinx Inc., June 2011, UG331 (v1.8): http://www.xilinx.com/support/documentation/user_guides/ug331.pdf 7. Virtex-4 FPGA User Guide, Xilinx Inc., Dec 2008, UG070 (V2.6):http://www.xilinx.com/support/documentation/user_guides/ug070.pdf. 8. Virtex-5 FPGA User Guide, Xilinx Inc., May 2010, UG190 (V5.3): http://www.xilinx.com/support/documentation/userguides/ug190.pdf. 9. Virtex-6 FPGA User Guide, Xilinx Inc., Sep. 2009, UG364 (v1.1): http://www.xilinx.com/support/documentation/user guides/ug364.pdf. 10. www.xilinx.com 10