多分支卷积神经网络的FPGA设计与优化-AET-电子技术应用

多分支卷积神经网络的FPGA设计与优化

2021年电子技术应用第7期

谢思璞，魏榕山

福州大学物理与信息工程学院，福建福州350108

摘要： 针对拓宽神经网络的结构会导致计算量增大，计算性能降低，需要针对并行的网络进行更有效的优化以及调度。通过分析FPGA平台上实现卷积神经网络的计算吞吐量和所需的带宽，在计算资源和访存带宽的限制下，采用了屋顶模型进行了设计空间的探索，提出了在不同支的卷积神经网络中使用不同的循环展开因子，从而实现同一卷积层中不同支神经网络的并行计算，保证计算资源和内存资源的合理分配。实验结果表明，所提出的设计与先前研究相比获得了1.31×的性能提升。

关键词： 多分支卷积神经网络 FPGA 屋顶模型并行计算

中图分类号： TN409
文献标识码： A
DOI：10.16157/j.issn.0258-7998.211279
中文引用格式： 谢思璞，魏榕山. 多分支卷积神经网络的FPGA设计与优化[J].电子技术应用，2021，47(7)：97-101.
英文引用格式： Xie Sipu，Wei Rongshan. FPGA design and optimization of multi-branch CNN[J]. Application of Electronic Technique，2021，47(7)：97-101.

FPGA design and optimization of multi-branch CNN

Xie Sipu，Wei Rongshan

School of Physics and Information Engineering，Fuzhou University，Fuzhou 350108，China

Abstract： Broadening the structure of the neural network will lead to the increase of the amount of computation and the decrease of the computational performance, it is necessary to optimize and schedule the parallel network more effectively. By analyzing the throughput and bandwidth of convolutional neural network on FPGA platform, the roof model is used to explore the design space under the limitation of computing resources and memory access bandwidth. It is proposed to use different cycle expansion factors in different branches of convolutional neural network, so as to realize the parallel computing of different branches of neural network in the same convolution layer and ensure the computing efficiency reasonable allocation of resources and memory resources. The experimental results show that the performance of the proposed design is improved by 1.31×compared with the previous research.

Key words : multi-branch convolutional neural network；FPGA；roofline model；parallel computing

0 引言

近年来，神经网络受到了广泛热议，成为了学术界和工业界的热门议题，Google、Microsoft和Facebook等科技公司都建立了相关的研究小组，以探索CNN的新架构^[1-3]。通过对CNN架构上的创新改善CNN性能，利用空间和通道信息，结构的深度和宽度以及多路径信息处理等方法引起了广泛的讨论。

在众多新型CNN架构中，基于宽度扩展的多支并行的CNN得到了国内外学术届的重视。KAWAGUCHI K等人提出网络的宽度是影响网络精度与准确度的一个重要指标^[4]。通过在层中并行使用多个处理单元，可以得到比感知器更为复杂的映射。GoogLeNet中的Inception模块是一种典型的多支网络架构，并使用了不同尺寸的卷积核^[5]。2017年，DEL COCO M等人^[6]利用多分支结构引入了并行的多尺度分析，减小了神经网络的深度，克服了过拟合问题。拓宽网络宽度的多支并行卷积神经网络在图像分割以及识别等任务中，提高了网络在不同尺度上的特征提取能力,受到了国内外研究机构的重视^[7-9]。

本文详细内容请下载：http://www.chinaaet.com/resource/share/2000003664。

作者信息：

谢思璞，魏榕山

(福州大学物理与信息工程学院，福建福州350108)

原创声明：此内容为AET网站原创，未经授权禁止转载。

相关内容