Continue to Site

Welcome to EDAboard.com

Welcome to our site! EDAboard.com is an international Electronics Discussion Forum focused on EDA software, circuits, schematics, books, theory, papers, asic, pld, 8051, DSP, Network, RF, Analog Design, PCB, Service Manuals... and a whole lot more! To participate you need to register. Registration is free. Click here to register now.

Division in xilinx ultrascale+ using DSP

Status
Not open for further replies.

beginner_EDA

Full Member level 4
Joined
Aug 14, 2013
Messages
191
Helped
0
Reputation
0
Reaction score
0
Trophy points
1,296
Activity points
3,854
Hi,
I have a design running at 125 MHz (8ns Pulse) and I do a large number division (125000000/115200) in system Verilog and it consumes 43 LUTs and 130 CARRY8 and have 34.087ns data path delay and that leading to -26.233ns setup timing violation. I would like to do it using dsp/ip core, if it helps to pass timing.
I saw a documentation here:
but I am not sure if it is the write one for this purpose.
 

Are both the dividend and the divisor constants? If so, why doing the division in logic and not at compile time?
If using logic, do you need to do the division in one clock cycle? If not, do it in a simple state machine (long division).
 

Why do you need to complete the division in a single clock - why cant it be pipelined?
I don't need it in a single clock and can be pipelined. However the requirement is not to use Xilinx IP(divider) but just DSP blocks. Can you point me to such design from Xilinx if you already know?
 

I don't need it in a single clock and can be pipelined. However the requirement is not to use Xilinx IP(divider) but just DSP blocks. Can you point me to such design from Xilinx if you already know?
You probably don't gain anything by using one or more DSP blocks for this. Do a simple state machine in "pure" VHDL/verilog instead.
 

Is there any reason for not using a Xilinx divider IP? Also any reason you cannot use a LUT?
 

Is there any reason for not using a Xilinx divider IP? Also any reason you cannot use a LUT?
IP -> complexity reason. LUT can be used. How to use LUTs for the division in pipelined format that passed timing?
--- Updated ---

You probably don't gain anything by using one or more DSP blocks for this. Do a simple state machine in "pure" VHDL/verilog instead.
can you give me some hints how to write Verilog for division. I saw some radix 2 algorithm but its seems to be not easy.
 

Non-restoring division algorithm is much simpler than a radix-2 division. All it uses is addition and subtraction. You probably don't even need to write it yourself, you can probably find code for it if you do a search.

Here is an easy to follow description.
https://www.geeksforgeeks.org/non-restoring-division-unsigned-integer/

It's not something you can pipeline so you have to clock it as many times as there are bits in one of the operands (don't recall which and too lazy to look it up ;)) to get one result.
 

@ads-ee Hi, is it allowed to post on this forum the instantiation code of only DSP block I see under ultrascale+ device "DSP48E2" ? I have some question about setting some parameters of it.

Furthermore If I do addition (A<=A+M) or subtraction(A<=A-M), how I know if synthesis tool/vivado utilize LUTs or DSP block (e.g. DSP48E2) for this? DO I have to set additionally something in synthesis tool/vivado to say use DSP block and not LUTs?
or I must have to instantiate DSP48E2 and configured accordingly, if I want to use DSP block in ultrasclae+ devices for this purpose.

I am asking because I am not sure even 32 bit addition and subtraction for the algorithm you mentioned can lead to timing violation which I have.
 
Last edited:

Furthermore If I do addition (A<=A+M) or subtraction(A<=A-M), how I know if synthesis tool/vivado utilize LUTs or DSP block (e.g. DSP48E2) for this? DO I have to set additionally something in synthesis tool/vivado to say use DSP block and not LUTs?
or I must have to instantiate DSP48E2 and configured accordingly, if I want to use DSP block in ultrasclae+ devices for this purpose.

I am asking because I am not sure even 32 bit addition and subtraction for the algorithm you mentioned can lead to timing violation which I have.
If you want to use the DSP block for addition/subtraction, either instantiate it or look in the documentation if there is a way to infer it.
If 32-bit LUT operations give you timing violations, you can create a strobe every second/third clock cycle for the operations, and set the corresponding timing constraints as "multi-cycle". You have a lot of options for fixing timing violations when the throughput isn't important.
 

@ads-ee Hi, is it allowed to post on this forum the instantiation code of only DSP block I see under ultrascale+ device "DSP48E2" ? I have some question about setting some parameters of it.
I don't see why it would be a problem. Go ahead and post it if you want help with the settings.
 

Status
Not open for further replies.

Part and Inventory Search

Welcome to EDABoard.com

Sponsor

Back
Top