Advanced

Design Issues in VLSI Implementation of Image Processing Hardware Accelerators - Methodology and Implementation

Jiang, Hongtu LU (2007)
Abstract
With the increasing capacity in today's hardware system design enabled by technology scaling, image processing algorithms with substantially more complexity can be implemented in a single chip with real-time performance. Combined with the demand for low power consumption or larger resolution seen in many applications such as mobile devices and HDTV, new design methodologies and hardware architectures are constantly called for to bridge the gap between designers productivity and what the technology could offer.



This thesis tries to address several issues commonly encountered in the implementations of real-time image processing system designs. Two implementations are presented to focus on different design issues in... (More)
With the increasing capacity in today's hardware system design enabled by technology scaling, image processing algorithms with substantially more complexity can be implemented in a single chip with real-time performance. Combined with the demand for low power consumption or larger resolution seen in many applications such as mobile devices and HDTV, new design methodologies and hardware architectures are constantly called for to bridge the gap between designers productivity and what the technology could offer.



This thesis tries to address several issues commonly encountered in the implementations of real-time image processing system designs. Two implementations are presented to focus on different design issues in hardware design for image processing systems.



In the first part, a real-time video surveillance system is presented by combining five papers. The segmentation unit is part of a real-time automated video surveillance system developed at the department, aiming for tracking people in an indoor environment. Alternative segmentation algorithms are elaborated, and various modifications to the selected segmentation algorithm is made aiming for potential hardware efficiency. In order to bridge the memory bandwidth issue which is identified as the bottleneck of the segmentation unit, combined memory bandwidth reduction schemes with pixel locality and wordlength reduction are utilized, resulting in an over 70% memory bandwidth reduction. Together with morphology, labeling and tracking unit developed by two other Ph.D. students, the whole surveillance system is prototyped on an Xilinx VirtexII pro VP30 FPGA, with a real-time performance at a 25 fps with a resolution of 320 × 240.



For the second part, two papers are extended to discuss issues of a controller design implementation a control intensive algorithm. To avoid tedious and error prone procedure of hand coding FSMs in VHDL, a controller synthesis tool is modified to automate a controller design flow from C-like control algorithm specification to controller implementation in VHDL. To address issues of memory bandwidth as well as power consumptions, a three levels of memory hierarchical architecture is implemented, resulting in off-chip memory bandwidth reduction from N2 per clock cycle to only 1 per pixel operation. Furthermore, potential power consumption reduction of over 2.5 times can be obtained with the architecture. Together with a controller synthesized from the developed tool, a real-time image convolution system is implemented on an Xilinx VirtexE FPGA platform. (Less)
Please use this url to cite or link to this publication:
author
supervisor
opponent
  • Ph.D. Dokladal, Petr, Centre de Morphologie Mathématique, Ecole des Mines de Paris, France
organization
publishing date
type
Thesis
publication status
published
subject
keywords
Signal processing, Bildbehandling, image processing, Imaging, Teknik, Technological sciences, Controller Synthesis, Circuit design, FPGA, Image convolution, Image processing, Video segmentation, Signalbehandling, Electronics, Elektronik
pages
139 pages
publisher
Department of Electroscience, Lund University
defense location
Room E:1406 Electrical Engineering Building Faculty of Engineering, Lund University Ole Römers väg 3 Lund
defense date
2007-02-23 10:15
ISSN
1402-8662
language
English
LU publication?
yes
id
aed7f2d3-f03c-47fb-8f62-8ee8d890842c (old id 26970)
date added to LUP
2007-06-05 13:45:02
date last changed
2016-09-19 08:44:53
@phdthesis{aed7f2d3-f03c-47fb-8f62-8ee8d890842c,
  abstract     = {With the increasing capacity in today's hardware system design enabled by technology scaling, image processing algorithms with substantially more complexity can be implemented in a single chip with real-time performance. Combined with the demand for low power consumption or larger resolution seen in many applications such as mobile devices and HDTV, new design methodologies and hardware architectures are constantly called for to bridge the gap between designers productivity and what the technology could offer.<br/><br>
<br/><br>
This thesis tries to address several issues commonly encountered in the implementations of real-time image processing system designs. Two implementations are presented to focus on different design issues in hardware design for image processing systems.<br/><br>
<br/><br>
In the first part, a real-time video surveillance system is presented by combining five papers. The segmentation unit is part of a real-time automated video surveillance system developed at the department, aiming for tracking people in an indoor environment. Alternative segmentation algorithms are elaborated, and various modifications to the selected segmentation algorithm is made aiming for potential hardware efficiency. In order to bridge the memory bandwidth issue which is identified as the bottleneck of the segmentation unit, combined memory bandwidth reduction schemes with pixel locality and wordlength reduction are utilized, resulting in an over 70% memory bandwidth reduction. Together with morphology, labeling and tracking unit developed by two other Ph.D. students, the whole surveillance system is prototyped on an Xilinx VirtexII pro VP30 FPGA, with a real-time performance at a 25 fps with a resolution of 320 × 240.<br/><br>
<br/><br>
For the second part, two papers are extended to discuss issues of a controller design implementation a control intensive algorithm. To avoid tedious and error prone procedure of hand coding FSMs in VHDL, a controller synthesis tool is modified to automate a controller design flow from C-like control algorithm specification to controller implementation in VHDL. To address issues of memory bandwidth as well as power consumptions, a three levels of memory hierarchical architecture is implemented, resulting in off-chip memory bandwidth reduction from N2 per clock cycle to only 1 per pixel operation. Furthermore, potential power consumption reduction of over 2.5 times can be obtained with the architecture. Together with a controller synthesized from the developed tool, a real-time image convolution system is implemented on an Xilinx VirtexE FPGA platform.},
  author       = {Jiang, Hongtu},
  issn         = {1402-8662},
  keyword      = {Signal processing,Bildbehandling,image processing,Imaging,Teknik,Technological sciences,Controller Synthesis,Circuit design,FPGA,Image convolution,Image processing,Video segmentation,Signalbehandling,Electronics,Elektronik},
  language     = {eng},
  pages        = {139},
  publisher    = {Department of Electroscience, Lund University},
  school       = {Lund University},
  title        = {Design Issues in VLSI Implementation of Image Processing Hardware Accelerators - Methodology and Implementation},
  year         = {2007},
}