UPC EETAC Bachelor's Degree in Telecommunications Systems and in Network Engineering EEL

Lab 3

Laboratory

Lab 4: propagation delay and speed: Adder_1bit, Adder_16bit (RC), Adder_16bit (CLA), Prototype

This is the highlighted Int_add_subt_8bit [P4] reference design to study propagation delay using timing analyser and VHDL gate-level simulations

Lab 5

[17/10]

This is the post lab assignment PLA4 to be submitted next Lab 5 session. Study in detail and execute in your computer this lab tutorial and the reference design before attempting to solve the PLA.

1.5.5.1. Gate-level simulation: propagation delay measurement

How to measure circuit's propagation time for a given target chip: post-synthesis model in VHDL (VHO file and its associated SDO/SDF delay file) and gate-level VHDL simulation.

1.5.5.2. Timing analyser spreadsheet tool

Measuring the worst-case scenario: longest propagation delay (tP).

1.5.5.3. Calculating circuit's maximum speed (fMAX).

In the lab we have some commercial CPLD and FPGA target chips where to synthesise our circuits. For instance:

  CPLD FPGA
Xilinx XC2C256-TQ144 - 7 Spartan-3E XC3S500E-FG320
Intel MAX II EPM2210F324C3 (option #1) Cyclone IV EP4CE115F29C7 (option #2)
Lattice ispMach4128V TQFP100 MachXO

NOTE: Quartus Prime does not generate delay files (sdo) for Intel MAX 10 devices, and thus we cannot practise gate-level simulations in ModelSim for this family.

 

1. Specifications Planning Developing Functional test Gate-level test Report Prototype

Let us continue Adder_1bit based on MoM from Lab3 measuring propagation time of signals in a given transition and maximum speed of computing.

Two new tools will be presented:

The objective is to find what target chip is faster when implementing the same project Adder_1bit:

Firstly, we will solve the project for chip option #1 and annotate results. Secondly, keeping the same project, we will change the target chip to option #2 and annotate results. We will discuss solutions.

 


Specifications 2. Planning          
    3. Developing        
      4. Functional test Gate-level test Report Prototype

The Adder_1bit specifications and also the three next sections #2 -#3 -#4 are already implemented in the previous Lab3. Our circuit is running correctly and we can visualise functional simulations. This, we can start a new Quartus Prime project copying the source VHDL files (Adder_bit.vhd, MUX_2.vhd, MUX_4.vhd) and the testbench files (Adder_1bit_tb.vhd, wave.do) from Lab3 to this new location:

C:\CSD\P4\Adder_1bit_MoM\(files)

Circuit plan as in Lab3

Fig. 1. The plan for inventing the Adder_1bit is the same from LAB3

Select the  option #1 MAX II target device and synthesise the circuit.

Files

Fig. 2. Plan C2 source files from Lab3.

Be sure that you check the RTL and technology view as you did in Lab3 Fig. 6 and Fig. 7

Check that you've got the same functional test results as in  Lab3  and Fig. 10  when using the same VHDL testbench fixture Adder_1bit_tb.vhd. Use the same wave.do for setting up the signals of interest in the wave diagram.

 


Specifications Planning Developing Functional test 5. Gate-level test Report Prototype

Gate-level VHDL simulation of the technology schematic

Let us use the new tools that will allow us to characterise better the real circuit implemented in the FPGA. This is our tutorial on gate-level simulation and the timing analyser tool in case you needed it even before running this tutorial.  

Set the following project parameters before re-synthesising your project and be able to generate the necessary VHO and SDO files for the target chip:

Settings

Properties

 netlist generation

Fig. 4. Parameters for letting Quartus Prime generate delay files.

And now re-synthesise your project to generate in Quartus Prime the VHDL translation of the technology circuit (Adder_1bit.vho) and its delay file (Adder_1bit_vhd.sdo).

generated files

Fig. 5. Indications for generating the VHDL technology circuit translation (vho) along with its delay file (sdo).

Now you are ready for starting a ModelSim gate-level simulation for this flat circuit.

project creation

Fig. 6. Create a new ModelSim project.

Add the same testbench that you also have copied from Lab3 to the new location:

testbench

Technology file

Fig. 7. Add the same testbench and  flat structure

Compile all and check the project's integrity. Therefore, hierarchical structures formed by multiple VHDL files are replaced by a single flat VHO file to be tested using the same testbench.

files compiled

Fig. 8. Check the project integrity.

Start a new simulation paying attention this time to both "Design" and "SDF" tabs:

New simulation

Fig. 9.

Attach the standard delay file to the region of interest:

DSF

Fig. 10. The region where to apply the SDO file is the instance i1 (the unit-under-test).

Run and check that the full wave is the same as it was in functional simulations.

Full span

Fig. 11. Full view of the simulation results.

Zoom at a given signal transition to measure propagation delays using two cursors.

results

Fig. 12. In this example transition, both outputs changes from "10" to "01" after 6.17 ns.

 


Timing anayser tool: longest propagation time and maximum speed of operation

Go back to Quartus Prime to find the largest propagation delay using the timing analyser spreadsheet tool.

Timing analyser

Fig. 13. Starting processing the timing analyser tool.

View results in the spreadsheet.

Tool timing analyser

Fig. 14. View results using the timing analyser tool.

 

Tpmax

Fig. 15. Spreadsheet from datasheet report.

And so, calculate the maximum frequency of operation for this specific target chip MAX II (L4.3):

max fequency of operation


And now, you can redo the project (at the same location), changing the target chip and compare how this Addder_1bit is performing for an  option #2 Cyclone IV target chip.

Cyclone IV

Fig. 16. Select a Cyclone IV device FPGA (Field Programmable Gate Array).

Check that the files of interest are correctly generated for this new target chip. Indeed here you can find several simulation models (fast, slow, etc.):

Simulation files for Cyclone IV

Fig. 17. For this chip several delay models are generated.

Recompile your ModelSim project and write down results:

Waveform results

Fig. 18. Cyclone IV measurements are slightly different at the same transition.

And, using the timing analyser, we can observe a longer propagation delay than in MAX II.

MAX FREQ

Fig. 19. Cyclone IV measurements are slightly different at the same transition.

And thus, the maximum frequency of operation for this Cyclone IV is slower that MAXII.

 


Specifications Planning Developing Functional test Gate-level test 6. Report Prototype

Follow this rubric for writing reports.

 



Lab3

Laboratory

Lab 4.Propagation delay and speed. Adder_1bit - Adder_16bit (RC) - Adder_16bit (CLA) - Prototype

Comparing circuit speed and resources in ripple-carry (RC) and carry-lookahead (CLA) architectures

Lab5

[17/10]

1.9.1.2. n-bit adders  [L3.2]

1.9.1.2.1. Ripple-carry adder: Adder_4bit, Adder_8bit [Lab 3], Adder_16bit [Lab 4],(option #1)

1.9.1.2.2. Carry-lookahead adder: Adder_4bit, Adder_16bit [Lab 4], (option #2)

In the project above, you have seen how is implemented the same design in two target chips. Now we have in mind solving the same project Adder_16bit using two different architectures synthesised for the same target chip. Thus, learning basic concepts on circuit optimisation. You will observe what is the difference between ripple carry and carry-lookahead adders, which one is faster and why?

1. Specifications Planning Developing Functional test Gate-level test Report Prototype

Option #1: Design an Adder_16bit using ripple-carry technique (RC). 

Symbol Waveforms

Fig. 1. Symbol and example waveforms.

Here we rely on the work done when designing the tutorial Adder_4bit ripple-carry (RC). Zero (Z) flag will not be implemented, it simply adds another level of gates and is not necessary for the purpose of comparing with the Adder_16bit CLA presented below as the third project in this lab session.

 


Specifications 2. Planning Developing Functional test Gate-level test Report Prototype

Fig. 2 shows the ripple-carry architecture. Remember that we already have designed Adder_8bit component in Lab3 using the same carry chain strategy.

Architecture ripple carry

Fig. 2. Adder_16bit architecture.

Project location:

C:\CSD\P4\Adder_16bit_RC\(files)

 


Specifications Planning 3. Developing Functional test Gate-level test Report Prototype

Let us pick up a MAX II EPM2210F324C3 target chip.

VHDL file translation of the architecture in Fig. 2: Adder_16bit.vhd.

Components files: Adder_8bit.vhd , Adder_4bit.vhd and Adder_1bit.vhd (plan A or plan B) can be found in Lab3.

files
RTl view

Fig. 3. RTL view and all the files requires in this design.

Technology view and project summary shows that only 32 logic elements are used for synthesising this project, saving much hardware with respect the CLA implementation below proposed in the next project.

Summary
Technology

Fig. 4.Technology view. Each Adder_1bit is implemented in two logic elements. To make it simple and comparable to classic technologies, we can imagine that the Adder_1bit is solved using 3 levels of gates.

 


Specifications Planning Developing 4. Functional test Gate-level test Report Prototype

This is the translation of the testbench fixture and some signal activity proposed in Fig. 1: Adder_16bit_tb.vhd. It can be used for both, functional and gate-level simulations.

Fixture testbench

Fig. 5. VHDL testbench fixture. The UUT is described as a hierarchical VHDL project when performing a functional simulation, and as a flat technology circuit when solving the gate-level simulation.

The ideal functional results in this design step #4 must be identical for both projects of the same entity: ripple-carry and carry-lookahead Adder_16bit.

Functional results

Fig. 6. Functional simulation results.

 


Specifications Planning Developing Functional test 5. Gate-level test Report Prototype

Running a gate-level simulation for this ripple-carry Adder_16bit, for instance, at the transition highlighted in Fig.1 where all bits have to change, we obtain an accumulated propagation delay of tP = 22.6 ns, practically doubling the one produced by the CLA adder in the project below.

Gate_level results

Fig. 7. Gate-level simulation results at a particular transition.

When running the timing analyser, the maximum delay occurs when driving A(0) and waiting results at Cout. This is tP= 22.7 ns, allowing a maximum frequency of operations of 22 Mops. (unsigned radix-2 16-bit millions operations per second).

Longest propagation time

Fig. 8. Timing analyser spreadsheet.

Furthermore, at this point the answer to the question: which is the minimum value for Min_Pulse for experimentation in the laboratory becomes straightforward: Min_Pulse > tP. Min_Pulse smaller that the propagation delay of the circuit will imply that the circuit is going to be switching continuously, never being able to reach stable output values. We can verify that using this set of vectors Adder_16bit_tb.vhd as represented in Fig. 9 (Min_Pulse = 25 ns) and Fig. 10  (Min_Pulse = 20 ns).

Min_Pulse at the limit

Fig. 9. Min_Pulse is 25 ns, practically on the limit. 

 

Min_Pulse < tp

Fig. 10. Min_Pulse is 20 ns < tP. Thus, the outputs never settles to a valid result.

It is time to compare results with the new carry-lookahead design proposed in the next project.

 


Specifications Planning Developing Functional test Gate-level test 6. Report Prototype

Follow this rubric for writing reports.

 




Lab3

Laboratory

Lab 4.Propagation delay and speed. Adder_1bit - Adder_16bit (RC) - Adder_16bit (CLA) - Prototype

Comparing circuit speed and resources in ripple-carry (RC) and carry-lookahead (CLA) architectures

Lab5

[17/10]

1.9.1.2. n-bit adders

1.9.1.2.1. Ripple-carry adder: Adder_4bit, Adder_8bit [Lab 3], Adder_16bit [Lab 4], (option #1)

1.9.1.2.2. Carry-lookahead adder: Adder_4bit, Adder_16bit [Lab 4],(option #2)

 

 Let us complete this lab tutorial considering the enhanced architecture based on carry-lookahead adders. You will observe what is the difference between ripple carry and carry-lookahead adders, which one is faster and why?

 

1. Specifications Planning Developing Functional test Gate-level test Report Prototype

Option #2: Design an Adder_16bit using the carry-lookahead (CLA) technique. The additional signals in this symbol P (propagator) and G (generator) are outputs required for chaining larger adders using the same CLA architecture.

The truth table and timing diagram are the same stated in Adder_4bit CLA considering now radix-2 numbers ranging up to 65535 = "1111111111111111".

Symbol Waveforms

Fig. 1. Symbol and waveforms.

Here we rely on the work done when designing the tutorial Adder_4bit CLA. For instance, read in Wikipedia how an Adder_16bit works and how it is possible to calculate all carries beforehand. This reference also explains how to chain carry generators: Ercegovac, M., Lang, T., Moreno, J. H., "Introduction to Digital Systems", John Wiley & Sons, 1999). It includes slides: Chapter 10 is on arithmetic circuits.

 


Specifications 2. Planning Developing Functional test Gate-level test Report Prototype

Fig. 2 shows the picture from Wikipedia that is adapted in CSD as Fig 3.

Adder_16bit using carry generator

Fig. 2. Adder_16bit architecture proposed in Wikipedia to be written in CSD as VHDL file.

The Adder_16bit has the same structure than the circuit designed in Adder_4bit, as shown in Fig. 3.

Arquitecture

Fig. 3. Planning the Adder_16bit. How many gate-levels are expected in this circuit?

Project location:

C:\CSD\P4\Adder_16bit_CLA\(files)

 


Specifications Planning 3. Developing Functional test Gate-level test Report Prototype

We will pick up the same Intel MAX II EPM2210F324C3 target chip. Thus, the same chip will implement the same entity but with two alternative architectures.

VHDL file translation of the architecture in Fig. 3: Adder_16bit.vhd.

Components Carry_generator.vhd and Adder_4bit.vhd can be found in tutorial Adder_4bit.

Start a VHDL synthesis project and observe the schematics. This time the architecture is far more complex, compromising up to 71 logic elements; more chip logic resources with the aim of obtaining a faster circuit.

Schematic RTL

Fig. 4. RTL view.

The architecture uses up to five Carry_generator circuits organised in two levels (one carry_generator in each Adder_4bit module, and another one in the top entity). However, the largest number of logic gates levels is only six.

Technology implementation

Fig. 5. Technology view.

Chip planner tool allows us to map exactly where all the resources in use are located.

Chip planner and logic elements in use

Fig. 6. MAX II chip occupation from chip planner tool.

 


Specifications Planning Developing 4. Functional test Gate-level test Report Prototype

Now, it is time to test the synthesised circuit above. Thus, we can use the same VHDL fixture represented in the Fig. 5 of the previous project Adder_16bit RC.

Example of testbench Adder_16bit_tb.vhd from which to copy the stimulus signals. We are expecting the same ideal functional results represented in Fig. 7 when applying the same input vectors.

Functional results

Fig. 7. Functional results.

 


Specifications Planning Developing Functional test 5. Gate-level test Report Prototype

Now, when running a gate-level simulation at the same transition, a shorter propagation delay is obtained. Only 12.6 ns (saving 10 ns from the same design above based on ripple-carry).

Gate_level results for the CLA adder

Fig. 8. Gate-level simulation.

And, finally, measuring the longest propagation delay using the timer analyser tool, we can deduce the maximum circuit's speed. The longest signal path is established between input A(6) and output S(11), tP =  14.3 ns. This circuit is capable of performing 35 Mops. Thus, an improved architecture using more hardware (logic elements) generates a faster circuit.

The drawback is that the dynamic power consumption will be higher than the same circuit based on the RC technique. At this CSD introductory level, and from the perspective of classic technologies, we can say that the static power consumption will be larger because there are more gates connected to the power supply, and the dynamic power consumption will also be larger because there are many more logic gates switching.

Timing analyser

Fig. 9. Timing analyser spreadsheet showing CLA adder results.

 


Specifications Planning Developing Functional test Gate-level test 6. Report Prototype

Follow this rubric for writing reports.

 


Conclusions

These three projects have been used for learning new tools: gate-level simulation in ModelSim and Quartus Prime timing analyser, and for demonstrating how digital technologies, gate switching (operating speed) and logic resources (logic elements) are linked. We can imagine that when more resources are used the circuit is more power demanding. You can balance speed and power consumption choosing alternative designs.

 





Lab3

Laboratory

Lab 4: propagation delay and speed. Adder_1bit - Adder_16bit RC - Adder_16bit CLA - Prototype

[P4] ALU_9bit prototype using the DE10-Lite board

Lab5

[17/10]

7. Designing an FPGA prototype

The next design flow steps: Choose an FPGA prototyping board, design an extension PCB board to place more inputs and outputs, assign pins, configure the chip, run and perform measurements.

Prototype specifications Planning Development Test & measurements

To practise with ALU we play with the prototype sketched in Fig. 1, an ALU_9bit capable of performing eight logic and arithmetic operations in a DE10-Lite board populated with an Intel MAX10 FPGA chip.

Symbol and operations

Fig. 1. ALU_9bit symbol and projected operations.

The DE10-Lite board switches can be used for inputting operand A. The six 7-segment displays can be used for representing results. In this way, we can populate a PCB connected through the 40-pin general purpose input-output connector GPIO for capturing the operand B, representing the full binary result in a LED array and lighting the flag indicators. In Chapter II, we can run the same application replacing the switches by a 16-key matrix keypad FSM as designed in P6 to input operands and operations sequentially as usual in a calculator.

Prototype using DE10-Lite

Fig. 2. DE10-Lite and PCB prototype.

 


Prototype specifications Planning Development Test & measurements

Plan C2 is required here to get an idea on the components involved and how they will be connected. To review the theory and how components work, we can get inspiration studying the circuits proposed in D1.18. Fig, 3 shows hierarchical schematic.

Logic operations are solve by gate blocks. The Adder_9bit used for both, adding and subtracting , is proposed in the final Annex A1. The integer multiplier is designed in this tutorial Int_Mult_9bit. Several multiplexers select the operands and additional logic circuits generate and enable status flags.

 

Internal plan C2 architecture

Fig. 3a. Proposed ALU internal design.

BCD Display logic and flag enabling circuits

Fig. 3b. Additional circuits for using the 7-segment displays and for enabling the flags.

The Chip9 Bin_BCD_16bit converter for representing results in BCD will be connected to the signal Y(15..0) when performing multiplications (OP = 7), or to the signal L(8..0) when OP = 4, 5, 6. The display will be kept blank for logic bitwise operations (OP = 0, 1, 2, 3).

Thus, the full list of FPGA input and output connections is represented in Fig. 4.

Inputs and otputs assignments

Fig. 4. FPGA connections to be used in the pin assignment tool. Board LED and other unused segments will be blanked.

 


Prototype specifications Planning Development Test & measurements

FPGA circuit

We can translate into VHDL the top schematic ALU_9bit.vhd in Fig. 3a and Fig. 3b, synthesise and test the FPGA using Quartus Prime and ModelSim respectively.

These are the multiplexers used to select operands and operations: Nonuple_MUX_4.vhd, Nonuple_MUX_2.vhd, MUX_2.vhd , Quad_MUX_2.vhd, Sexdecuple_MUX_2.vhd. All of them are implemented in a single file adapting the typical plan B Dual_MUX_4.

Annex A1 shows how to design a carry-lookahead (CLA) version of the  Adder_9bit.

Int_Mult_9bit tutorial shows how to implement a 9-bit multiplier for integer numbers as required in Chip4.

Detail on the Int_Mult-9bit

Fig. 5. Detail on the Int_Mult_9bit RTL.

Bin_BCD_16bit shows how to design the converter from radix-2 to BCD for a size of 16-bit. You will find its top entity and its component block DM74185.

Chip9 on implementing a two's complement Two_C_9bit can be found in Int_Mult_9bit tutorial. 

We can continue adding the code converters to represent the result in the six 7-segment displays. The idea is that the sign is represented as a hyphen '-' when the result is negative in the display number 5  (HEX5_L(6)). This is the decoder hexadecimal to 7-segments Dec_Hex_7seg and its block Hex_7seg_decoder that can be either plan A or plan B

All the VHDL files required in this hierarchical modular prototype: ALU_9bit_top.zip.

We observe how the project summary shows the use of a dedicated multiplier (one of the 288 available). This is given you an initial idea of the size and complexity of the circuits that can be engineered to fit in this target chip.

Project summary and propagation delay

Fig. 6. Project summary and propagation delay.

The circuit uses 483 logic elements and 1 embedded multiplier. Switching A(0) and waiting for the HEX2_L(5) output function takes the longest signal path, a worst case scenario propagation delay of tP = 35.24 ns. If we attach A(0) to a signal generator the maximum signal frequency is fmax < 14.18 MHz).

 


Printed circuit board (PCB)

As indicated above, in this experiment we can use the board's switches to input A(8..0). The other operant B(8..0), the operation selector OP(2..0) and the 5 status flags (N, OV, C, Z, EQ) will be placed in an auxiliary PCB connected through the 40-pin GPIO as shown in Fig.4. 

We have to capture the schematic in KiCad schematic as shown in Fig. 7. LED bars and resistors packs can be used to save board space.

Image KiCad schematic captured

Fig. 7. Schematic captured in KiCad.

The PCB component placement is shown in Fig 8.

Image silkscreen view

Fig. 8. PCB layout silkscreen that shows component placement and references.

Finally, a 3D view shows the real dimensions and how the PCB will look like once soldered.

Image 3D view

Fig. 9. PCB 3D representation.

Thus, once the hardware circuit for the PCB is conceived, we organise a KiCad project CSD_LAB4_PCB_v2.zip to be able to manufacture the board. Each component contains the symbol, the footprint, and a link to the datasheet to prepare the bill of materials (BOM) spreadsheet.

NOTE: Current CSD and DEE KiCad symbols, footprints and 3D tuned components are available in these three libraries: symbols.zip, footprints.zip and 3dmodels.zip, to be unzipped and placed in the corresponding 3rdparty user directory. Basically, in this introductory level the idea behind tunnig components is to enlarge their pads for easy soldering.

Fig. 10 shows the process of placing and soldering components on the board once manufactured. 

Board components placing
Final manufactured board

Fig. 10. The idea of placing and soldering components on the PCB. The final manufactured board with the silkscreen layer sesialised.

 


Prototype specifications Planning Development Test & measurements

It is convenient to perform a functional simulation to check that the circuit operates correctly before configuring the target chip with the programmer application. This is a testbench ALU_9bit_tb.vhd adapted from P4. This is the wave.do setup as shown in Fig. 11.

Example operations

Fig. 11. ModelSim simulations. Be aware that for arithmetic operations 4, 5, and 6, the result of interest is Q(8..0) or R(8..0).

The top schematic ALU_9bit_top.vhd will be used to leave unconnected R16 and R17 outputs and also to disable (ground) all the DE10-Lite LED.

Top design specified for the DE10-Lite board

Fig. 12. Top design ready for pin assignment and for prototyping. This top design allows us to add to the ALU other chip components for future experiments, such as FSM, CLK generator, data registers, etc.

The pin assignment file considering Fig. 4 and the De10-Lite user manual to be imported to the project is ALU_9bit_prj.csv.

This is the final FPGA ALU_9bit.sof configuration file to be used by the Quartus Prime programmer. In case we like to write the configuration flash memory (CFM) of the FPGA and make the circuit permanent (not erased when unplugging the board), the programmer object file (.pof) is required: ALU_9bit.pof

The final board prototype performing an operation is pictured in Fig. 13 connected to the DE10-Lite. 

Prototype running

Fig. 13. Prototype connecting the inputs and outputs board to the DE10-Lite training board ready for running the truth table. An example of signed multiplication is visualised.

We can perform laboratory measurements using the compact VB8012 instrument.

Propagation delay measurement

Let us use the adder that has available the C (carry out) output connected to a test pin (TP_C1). Prepare two convenient values for A and B operands, for instance A = 255 = "011111111" and B = 256 / 257  = "100000000" /  "100000001" that switch all the result outputs when B(0) toggles from '0' (R = 511; C = '0' , R(8..0) = "111111111") to '1' (R = 512; C = '1' , R(8..0) = "000000000"). In this way we can visualise the adder longest propagation path from B(0) to C in the logic analyser and oscilloscope.

Instrumentation setup

Fig. 14. Driving B(0) with a square wave from the function generator. Other output signals can be reassigned in the same way to measure for example the multiplier propagation delay.

The only hardware modification required is to reassign B(0) to the function generator square wave (using the push-button and  socket RTN_L input  (W5 pin). Let us use the test point header (TP_C1) in the output the C flag to visualise the signal. In this way we can measure the propagation delay using cursors and the two oscilloscope channels and also two channels of the logic analyser. Compare and discuss your lab results with measurements from Quartus Prime timing analyser.

instrumentation

Fig. 15. Instrument and circuit setup for measuring the longest propagation path. The circuit can be powered using the computer's USB port or the VB8012 power supply adjusted at +5 V.

Example printed waveforms.

Waveforms at 100 kHz

Fig. 16. Function generator switching B(0) at 100 kHz.

Overshoot, ringing and noise is appreciated at the oscilloscope signals at higher frequencies. Thus, the digital acquisition at 1 GS/s (1 ns resolution) captures this noire as extra digital pulses. It is clear that poor lengthy flat cables and standard probes do not allow much higher frequencies.

Waveforms at 1 MHz

Fig. 17. Function generator switching at 1 MHz.

 

Low to high edge

Fig. 18. Measuring the propagation delay tPLH = 15 ns from B(0) to Cout zooming at the low-to-high edge.

 

Power consumption

We can measure the current required by all the chips and circuits the DE10-Lite FPGA board, and thus calculate the static power at 1 Hz when the result is 512 (PS =  625 mW) and when is 511 (PS =  715 mW). The difference is in the number of LED lighting. Thus <PS> = 670 mW. Measuring the total power consumption at 1 MHz (PS + PD  = 755 mW), we can deduce the dynamic power of the FPGA chip at such frequency of operation PD = 85 mW. 

Pwer consumption dynamic

Power consumption

Fig. 19. Power consumption.

Optional (out of the CSD scope). At this level, many more questions may be asked regarding results and waveforms from these experiments. For example:

- What is the effect of the long flat cable? What kind of circuit are required to drive digital signals over long wires?

- How to drive the same B(0) repeating the same experiments using an internal CLK signal that probably will generate clearer digital waves?

Fig. 20 shows the experient when using an internal CLK_Generator (from lecture L8.2). This is the full experiment in case you like to try it in the lab: ALU_9bit_top_CLK.zip.

Example experiment to drive B(0) internally

Fig. 20. Let us measure the circuit performance when using an internal CLK signal instead of the external VB8012 function generator. Let us see whether we can obtain clearer waveforms. B(0) used as output is available at the same pin W5 to be monitored at the oscilloscope CH1.

 

And, after this initial question, many more advanced ones appear. For instance:

- How to design the PCB and the instrument probe setup so that signal bouncing, noise and ringing can be reduced or even eliminated to maintain signal integrity?

 


NOTE : A prototyping PCB board like this one expanding the DE10-Lite resources with many more input and outputs, once manufactured, can be used for other laboratory projects such counters, shift registers, etc. We just need to rename the wires in the KiCad schematic and reassign FPGA pins.

 


Annexes

A1. Example of planning the Adder_9bit required in this project using the CLA option from the Adder_16bit studied in this tutorial laboratory. 

Timing diagram

Fig. 1. Adder_9bit symbol and truth table.

We can invent it as the other similar circuits, using plan C2.

Adder_9bit

Fig. 2. Proposed Adder_9bit.

This is the VHDL translation of the Fig. 2 schematic: Adder_9bit.vhd , Adder_4bit (it includes the  Adder_4bit and the Carry_Generator),  Adder_1bit (for example plan A)