ECE 5760: Laboratory 3
Mandelbrot set visualization.
In this assignment you will implement a system to compute and draw the Mandelbrot
set, and to zoom in on pieces of the set. A matlab program shows
how the computation is done. Note that the numbers are all complex. The image
below is from the program output. The x,y axis units on the image are pixels, not natural
units. The actual x axis range is
[-2,1] and the y axis
[-1,1]. The rectangle is the largest rectangle I could fit into the set.
Former TA, Shiva Rajagopal, found a video for Mandelbrot Set made by Cornell students! https://www.youtube.com/watch?v=ES-yKOYaXq0
Relevant lectures from 2017 might be lectures 19 to 21.
But remember that lab 4 in 2017 is lab 3 today.
- There may be a HPS to control computation and set parameters.
Baseline perfromance that you must beat is my best single-processor HPS code.
Read the University computer page, Mandelbrot example.
- You will use custom bus-master hardware (or direct write to display memory) to compute
the image. The computation of a 640x480 image is compute intensive, so you
are going to want to implement multiple computational units to speed up the
process. Spend some time considering load balancing since some regions of the
complex plane are much faster to compute than others. Initial testing suggests that you can calculate and display the mandebrot set approximately 30 times faster on FPGA hardware, than on the HPS. The computation must output directly to VGA and not pass results through the HPS.
- If you write directly to Qsys VGA display memory you use a scheme like that entitled:
GPU with FAST display from SRAM on the bus-master page.
For maximum write speed you can give each iterator it's own M10k block memory.
If you do this, you will need to use a custom VGA interface which multiplexes through the iterator memories.
- For the custom hardware, I suggest using a minimum of 4.23 fixed point notation for
interating the quadratic
complex number calculation. The number of bits carried will limit the zoom depth. Every bit gives one level of zoom, but you need 10 bits to represent the screen width, so 23 bits gives only 13 zoom levels. The top bit will be the usual 2's comp sign bit.
The 4.23 notation means that a single DSP unit can do one multiply. This format give the numbers a dynamic range of +/-7. A matlab
program to check the effect
of limiting the range shows that the computation works without overflow.
Examples from class:
- 2018 -- video1, video2, video3 ; 20 solvers, 100 MHz.
Effective arithmetic rate is about 10 billion operations/sec.
Thanks: Xitang Zhao and Vidya Ramesh
- Skyler Schneider 1, 2, 3, 4
- Venkatesh Santhanagopalan and Rick Wong 1, 2, 3
- Jinda Cui and Jiawei Yang, Full set.;
Satellite double spiral: x_mid=-0.743 643 900 055; y_mid=0.131 825 890 901; zoom level=3000;
neural network: range mode: x=(-0.37432239 -0.373655726 ); y=(0.65938433 0.66005100); zoom mode: x_mid=-0.37398906; y_mid=0.65971767; zoom level=3000;
devil's eye: range mode: x=(-1.87 -1.85); y=(-0.01 0.01); zoom mode: x_mid=-1.86; y_mid=0; zoom level=100;
alien's egg: range mode: x=(-1.863 -1.85); y=(-0.00153846 0.00153846); zoom mode: x_mid=-1.8615; y_mid=0; zoom level=650;
lava: range mode: x=(-1.38 -1.36);y=(-0.026 -0.006); zoom mode: x_mid=-1.37; y_mid=-0.016; zoom level=100;
- Sandeep Gangundi and Sion Wang 600x_zoom
- Mohammad Saifee Dohadwala
||complex cood x
||complex cood y
|| x = [-1.339301 , -1.336371]
||y = [-0.060742 , 0.058789]
|| x = [0.385683 , 0.388613]
|| y = [-0.268013 , -0.266060]
|| x = [0.379813 , 0.381278]
||y = [-0.275228 , -0.274251]
|| x = [-0.670565, -0.669833]
||y = [-0.458300, -0.457812]
Weekly check points
- Week 1:
Demonstrate a simulation of one complex iterator pipeline. The number of iterations required should match the equivalent matlab point in the complex plane for at least two points. Use synthesizable Verilog running in simulation in ModelSim.
- Week 2:
Demo more than one complex iterator writing to the VGA didplay to show full Mandelbrot Set on the VGA.
You could connect the iterators to the Qsys VGA as a bus-master
or you could write directly to VGA display memory.
-- Demo interactive zooming on the VGA.
- Week 3: demo full system below
- You may use HPS for control, but you do not have to.
- You do need the best HPS time for comparision to FPGA solution time.
- The corners of the region being plotted should be displayed either on the VGA screen, or on a serial console.
- You should be able
to choose the maximum number of iterations using serial input the HPS, or toggle switches,
or however you want to do it.
- Upon pressing KEY3, the system should compute and display the full Mandelbrot
as shown above.
- The computation must be a bus-master output directly to the VGA bus-slave, and not pass results through the HPS. The VGA graphics interface should run the VGA at 640x480 resolution with at least 8-bit color per pixel. Colors may aid in
debugging and certainly look cool. You may want to map color to the number of iterations
to divergence, or to the log of that number. A reasonable approximation of
the log is to just use the position of the largest non-zero bit in the count
A modified matlab
program and image show the effect
of log-compressing the numerical range.
The zoomed image (see below) also
shows different detail.
- There will be a zoom interface using push buttons, or serial communication
to a terminal or a mouse interface. It is sufficient to pick a center coordinate and zoom in/out by a factor of two.
When you trigger the zoom calculation, the specified
corners should be reassigned to the corners of the display and 640x480 new points in those intervals should be iterated
- Part of your grade
will depend upon how fast you can render the full 640x480 set and two other regions picked on demo day.
elapsed time to draw the screen (in milliseconds) should be shown on the VGA screen.
-- if you use a custom VGA driver you may display the time on the HPS console.
- You will need to show how your solution time for the full set depends on the number of solver pipelines used.
Be prepared to demo your design to your TA in lab.
Your written lab report should include the sections mentioned in the policy page, and :
- A table of times to compute various regions of the Mandelbrot set close
to the areas specified in the list.
x=[-2 1], y=[-1 1]
x=[-1.45, -1.3], y=[-0.07, 0.07]
- A collection of photographs of your favorite regions, and their coordinates.
The camera should be good enough to resolve individual pixels.
- A table of your solution time for the full set versus the number of solver pipelines used
and for FPGA solution versus HPS solution time.
- A detailed dsecription of your Qsys design.
- A heavily commented listing of your Verilog design (for FPGA) and GCC code (for HPS).
Copyright Cornell University March 15, 2019