为什么这个矩阵乘法算法比另一个更快？

Question

int mmult_omp(double *c,
           double *a, int aRows, int aCols,
           double *b, int bRows, int bCols, int numThreads)
{
  for (i = 0; i < aRows; i++) {
for (j = 0; j < bCols; j++) {
  c[i*bCols + j] = 0;
}
for (k = 0; k < aCols; k++) {
  for (j = 0; j < bCols; j++) {
                c[i*bCols + j] += a[i*aCols + k] * b[k*bCols + j];
  }
}

} }

for (i = 0; i < aRows; i++) {
    for (j = 0; j < bCols; j++) {
    c[i*bCols + j] = 0;
    for (k = 0; k < aCols; k++) {
    c[i*bCols + j] += a[i*aCols + k] *  b[k*bCols + j];
  }
}

} }

Why is the first algorithm faster than the second?为什么第一个算法比第二个算法快？ I've used C's time library and the first algorithm is objectively faster than the second.我使用了 C 的时间库，第一个算法客观上比第二个算法快。 Why is that?这是为什么？

Answer 1

This code is very hard to understand.这段代码很难理解。 I had to copy it and reformat it to see what loops were what.我不得不复制它并重新格式化它以查看循环是什么。 I'm not really sure why one is faster but here's a great resource to see why.我不确定为什么一个更快，但这里有一个很好的资源来了解原因。

Here are links to inspect the assembly output:以下是检查程序集输出的链接：

link for #1 #1 的链接
link for #2 #2 的链接

为什么这个矩阵乘法算法比另一个更快？

问题描述

1 个解决方案

解决方案1
-1 2019-03-16 22:33:21

为什么这个矩阵乘法算法比另一个更快？

问题描述

1 个解决方案

解决方案1 -1 2019-03-16 22:33:21

解决方案1
-1 2019-03-16 22:33:21