简体繁体中英

Getting auto-vectorization with gcc?

原文 2011-06-22 08:48:43 8 2 c++/ gcc

In the context of evaluating negative-log-likelihoods, I have to perform a bunch of operations that could benefit from vectorization

0) for (i = 1...n) { a[i] = 0; } // but this I think

std::fill( a.begin(), a.end(), 0 ) is already optimal

1) for (i = 1...n) { a[i] += b * c[i]; }

2) sum = 0; for (i = 1 .. n) { sum += a[i] * log( b[i] / c ); }

do you know if there's any hope to get gcc 434 to do auto-vectorization, and how should I code the loop to help him (eg using indices vs using iterators, should I break up (2) in simpler loops, ...) up to now I'm using doubles, have to check if I can move to floats at least for (1).

2 answers

http://gcc.gnu.org/projects/tree-ssa/vectorization.html

Use the required options, -O3 -msse2

For more options, read the documentation above.

for autovectorization of floating point reductions like 2) you need to enable -funsafe-math-optimizations

on i386 like targets you also need to add -mfpmath=sse

gcc auto-vectorization fails in a reduction loop

Array vs pointer auto-vectorization in gcc

What do gcc's auto-vectorization messages mean?

C++ auto-vectorization requirements for gcc, clang and msvc

GCC 4.8.2 auto-vectorization fail due to cout

Alignment attribute to force aligned load/store in auto-vectorization of GCC/CLang

Auto-vectorization of scalar product in loop

sum of overlapping arrays, auto-vectorization, and restrict

Intel Auto-Vectorization Trip Count Explanation?

C++ Matrix Multiplication Auto-Vectorization

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question gcc auto-vectorization fails in a reduction loop Array vs pointer auto-vectorization in gcc What do gcc's auto-vectorization messages mean? C++ auto-vectorization requirements for gcc, clang and msvc GCC 4.8.2 auto-vectorization fail due to cout Alignment attribute to force aligned load/store in auto-vectorization of GCC/CLang Auto-vectorization of scalar product in loop sum of overlapping arrays, auto-vectorization, and restrict Intel Auto-Vectorization Trip Count Explanation? C++ Matrix Multiplication Auto-Vectorization

Related Tags

Getting auto-vectorization with gcc?

Question

2 answers

solution1
2 ACCPTED 2011-06-22 08:54:04

solution2
0 2012-12-11 17:07:21

Getting auto-vectorization with gcc?

Question

2 answers

solution1 2 ACCPTED 2011-06-22 08:54:04

solution2 0 2012-12-11 17:07:21

solution1
2 ACCPTED 2011-06-22 08:54:04

solution2
0 2012-12-11 17:07:21