数据流
计算机科学
现场可编程门阵列
卷积神经网络
钥匙(锁)
人工智能
计算机体系结构
加速度
计算机工程
资源(消歧)
机器学习
能量(信号处理)
人工神经网络
高效能源利用
深层神经网络
深度学习
功率(物理)
计算复杂性理论
嵌入式系统
能源消耗
逻辑门
模棱两可
理论计算机科学
作者
Junye Jiang,Zhou, Yaan,Yuanhao Gong,Haoxuan Yuan,Shuanglong Liu
标识
DOI:10.48550/arxiv.2505.13461
摘要
Convolutional Neural Networks (CNNs) are fundamental to deep learning, driving applications across various domains. However, their growing complexity has significantly increased computational demands, necessitating efficient hardware accelerators. Field-Programmable Gate Arrays (FPGAs) have emerged as a leading solution, offering reconfigurability, parallelism, and energy efficiency. This paper provides a comprehensive review of FPGA-based hardware accelerators specifically designed for CNNs. It presents and summarizes the performance evaluation framework grounded in existing studies and explores key optimization strategies, such as parallel computing, dataflow optimization, and hardware-software co-design. It also compares various FPGA architectures in terms of latency, throughput, compute efficiency, power consumption, and resource utilization. Finally, the paper highlights future challenges and opportunities, emphasizing the potential for continued innovation in this field.
科研通智能强力驱动
Strongly Powered by AbleSci AI