可扩展性
计算机科学
加速
格子Boltzmann方法
超级计算机
计算科学
并行计算
Petascale计算
区域分解方法
工作量
计算流体力学
内存带宽
分布式计算
计算机工程
航空航天工程
有限元法
物理
工程类
操作系统
热力学
数据库
量子力学
作者
Lei Zhao,Xuesen Chu,Xiaojing Lv,Hongsong Meng,Shupeng Shi,Wenji Han,Jia Xu,Haohuan Fu,Guangwen Yang
标识
DOI:10.1109/ipdps.2019.00065
摘要
The Lattice Boltzmann Method (LBM) is a relatively new class of Computational Fluid Dynamics methods. In this paper, we report our work on SunwayLB, which enables LBM based solutions aiming for industrial applications. We propose several techniques to boost the simulation speed and improve the scalability of SunwayLB, including a customized multi-level domain decomposition and data sharing scheme, a carefully orchestrated strategy to fuse kernels with different performance constraints for a more balanced workload, and optimization strategies for assembly code, which bring up to 137x speedup. Based on these optimization schemes, we manage to perform the largest direct numerical simulation which involves up to 5.6 trillion lattice cells, achieving 11,245 billion cell updates per second (GLUPS), 77% memory bandwidth utilization and a sustained performance of 4.7 PFlops. We also demonstrate a series of computational experiments for extreme-large scale fluid flow, as examples of real-world applications, to check the validity and performance of our work. The results show that SunwayLB is competent for a practical solution for industrial applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI