Doubly-Block Circulant Kernel Matrix Exploitation in Convolutional Accelerators

Ferreira, Lucas; Malkowsky, Steffen; Persson, Patrik; Astrom, Karl, et al. (2023). Doubly-Block Circulant Kernel Matrix Exploitation in Convolutional Accelerators 2023 IEEE 66th International Midwest Symposium on Circuits and Systems, MWSCAS 2023, 236 - 240. 2023 IEEE 66th International Midwest Symposium on Circuits and Systems, MWSCAS 2023. Tempe, United States: IEEE - Institute of Electrical and Electronics Engineers Inc.
Download:
DOI:
Conference Proceeding/Paper | Published | English
Authors:
Ferreira, Lucas ; Malkowsky, Steffen ; Persson, Patrik ; Astrom, Karl , et al.
Department:
Department of Electrical and Information Technology
Integrated Electronic Systems
LTH Profile Area: AI and Digitalization
ELLIIT: the Linköping-Lund initiative on IT and mobile communication
LTH Profile Area: Nanoscience and Semiconductor Technology
Mathematics (Faculty of Engineering)
LU Profile Area: Nature-based future solutions
LU Profile Area: Light and Materials
LU Profile Area: Proactive Ageing
LU Profile Area: Natural and Artificial Cognition
LTH Profile Area: Engineering Health
Mathematical Imaging Group
Research Group:
Integrated Electronic Systems
Mathematical Imaging Group
Abstract:

In this paper, we present a novel algorithmic and hardware co-design approach specifically tailored for efficient 2D convolution implementations, a crucial operation in convolutional neural networks (CNNs). Our method addresses the limitations of existing software-based solutions and hardware-based architectures, delivering significant improvements in asymptotic behavior for generic convolution cases. By leveraging the distinctive geometry of doubly block circulant unrolled kernel matrices, our approach eliminates the need for input and weight buffers, optimizes output memory usage, and minimizes redundant memory accesses. A comprehensive comparative analysis with state-of-the-art techniques showcases the key advantages and superior performance of our proposed method, achieving substantial reductions in memory requirements and high throughput.

Keywords:
2D Convolution ; Doubly-Blocked Circulant Matrix ; Systolic Array ; Unrolled Kernel Matrix
ISBN:
9798350302103
LUP-ID:
75f5ede4-3210-4e97-8eeb-b97f5ffdc838 | Link: https://lup.lub.lu.se/record/75f5ede4-3210-4e97-8eeb-b97f5ffdc838 | Statistics

Cite this