Design of an Application-specific VLIW Vector Processor for ORB Feature Extraction

Ferreira, Lucas; Malkowsky, Steffen; Persson, Patrik; Karlsson, Sven, et al. (2023). Design of an Application-specific VLIW Vector Processor for ORB Feature Extraction. Journal of Signal Processing Systems, 95, (7), 863 - 875
Download:
DOI:
| Published | English
Authors:
Ferreira, Lucas ; Malkowsky, Steffen ; Persson, Patrik ; Karlsson, Sven , et al.
Department:
LTH Profile Area: AI and Digitalization
Integrated Electronic Systems
LTH Profile Area: Nanoscience and Semiconductor Technology
ELLIIT: the Linköping-Lund initiative on IT and mobile communication
Mathematics (Faculty of Engineering)
LTH Profile Area: Engineering Health
eSSENCE: The e-Science Collaboration
Mathematical Imaging Group
Research Group:
Integrated Electronic Systems
Mathematical Imaging Group
Abstract:

In computer-vision feature extraction algorithms, compressing the image into a sparse set of trackable keypoints, empowers navigation-critical systems such as Simultaneous Localization And Mapping (SLAM) in autonomous robots, and also other applications such as augmented reality and 3D reconstruction. Most of those applications are performed in battery-powered gadgets featuring in common a very stringent power-budget. Near-to-sensor computing of feature extraction algorithms allows for several design optimizations. First, the overall on-chip memory requirements can be lessened, and second, the internal data movement can be minimized. This work explores the usage of an Application Specific Instruction Set Processor (ASIP) dedicated to perform feature extraction in a real-time and energy-efficient manner. The ASIP features a Very Long Instruction Word (VLIW) architecture comprising one RV32I RISC-V and three vector slots. The on-chip memory sub-system implements parallel multi-bank memories with near-memory data shuffling to enable single-cycle multi-pattern vector access. Oriented FAST and Rotated BRIEF (ORB) are thoroughly explored to validate the proposed architecture, achieving a throughput of 140 Frames-Per-Second (FPS) for VGA images for one scale, while reducing the number of memory accesses by 2 orders of magnitude as compared to other embedded general-purpose architectures.

Keywords:
ASIP ; Feature extraction ; ORB ; Vision-based SLAM
ISSN:
1939-8018
LUP-ID:
1d39bd4e-ab21-47a3-be19-fd60a5609995 | Link: https://lup.lub.lu.se/record/1d39bd4e-ab21-47a3-be19-fd60a5609995 | Statistics

Cite this