SIMD公司
计算机科学
矢量化(数学)
并行计算
编译程序
交错
计算
方案(数学)
程序设计语言
操作系统
数学分析
数学
作者
Dorit Nuzman,Ira Rosen,Ayal Zaks
出处
期刊:Sigplan Notices
[Association for Computing Machinery]
日期:2006-06-11
卷期号:41 (6): 132-143
被引量:143
标识
DOI:10.1145/1133255.1133997
摘要
Most implementations of the Single Instruction Multiple Data (SIMD) model available today require that data elements be packed in vector registers. Operations on disjoint vector elements are not supported directly and require explicit data reorganization manipulations. Computations on non-contiguous and especially interleaved data appear in important applications, which can greatly benefit from SIMD instructions once the data is reorganized properly. Vectorizing such computations efficiently is therefore an ambitious challenge for both programmers and vectorizing compilers. We demonstrate an automatic compilation scheme that supports effective vectorization in the presence of interleaved data with constant strides that are powers of 2, facilitating data reorganization. We demonstrate how our vectorization scheme applies to dominant SIMD architectures, and present experimental results on a wide range of key kernels, showing speedups in execution time up to 3.7 for interleaving levels (stride) as high as 8.
科研通智能强力驱动
Strongly Powered by AbleSci AI