库达
加速
计算机科学
并行计算
粒子群优化
数据并行性
并行算法
算法
维数之咒
图形处理单元
平行性(语法)
人工智能
作者
Yanhong Zhuo,Tao Zhang,Feng Du,Ruilin Liu
标识
DOI:10.1016/j.asoc.2023.110499
摘要
Parallel computing is the main way to improve the computational efficiency of metaheuristic algorithms for solving high-dimensional, nonlinear optimization problems. Previous studies have typically only implemented local parallelism for the particle swarm optimization (PSO) algorithm. In this study, we proposed a new parallel particle swarm optimization algorithm (GPU-PSO) based on the Graphics Processing Units (GPU) and Compute Unified Device Architecture (CUDA), which uses a combination of coarse-grained parallelism and fine-grained parallelism to achieve global parallelism. In addition, we designed a data structure based on CUDA features and utilized a merged memory access mode to further improve data-parallel processing and data access efficiency. Experimental results show that the algorithm effectively reduces the solution time of PSO for solving high-dimensional, large-scale optimization problems. The speedup ratio increases with the dimensionality of the objective function, where the speedup ratio is up to 2000 times for the high-dimensional Ackley function.
科研通智能强力驱动
Strongly Powered by AbleSci AI