数组中不同的浮点值会影响性能 10 倍 - 为什么？

Question

please check out my code and the quesion below - thanks请查看我的代码和下面的问题 - 谢谢

Code:代码：

#include <iostream>
#include <chrono>

using namespace std;

int bufferWriteIndex = 0;
float curSample = 0;

float damping[5] = { 1, 1, 1, 1, 1 };

float modeDampingTermsExp[5] = { 0.447604, 0.0497871, 0.00247875, 0.00012341, 1.37263e-05 };
float modeDampingTermsExp2[5] = { -0.803847, -3, -6, -9, -11.1962 };


int main(int argc, char** argv) {

    float subt = 0;
    int subWriteIndex = 0;
    auto now = std::chrono::high_resolution_clock::now();


    while (true) {

        curSample = 0;

        for (int i = 0; i < 5; i++) {

            //Slow version
            damping[i] = damping[i] * modeDampingTermsExp2[i];

            //Fast version
            //damping[i] = damping[i] * modeDampingTermsExp[i];
            float cosT = 2 * damping[i];

            for (int m = 0; m < 5; m++) {
                curSample += cosT;

            }
        }

        //t += tIncr;
        bufferWriteIndex++;


        //measure calculations per second
        auto elapsed = std::chrono::high_resolution_clock::now() - now;
        if ((elapsed / std::chrono::milliseconds(1)) > 1000) {
            now = std::chrono::high_resolution_clock::now();
            int idx = bufferWriteIndex;
            cout << idx - subWriteIndex << endl;
            subWriteIndex = idx;
        }

    }
}

As you can see im measuring the number of calculations or increments of bufferWriteIndex per second.正如您所看到的，我正在测量每秒的计算次数或bufferWriteIndex的增量。

Question:问题：

Why is performance faster when using modeDampingTermsExp - Program output:为什么使用modeDampingTermsExp时性能更快 - 程序 output：

vs using modeDampingTermsExp2 ?与使用modeDampingTermsExp2 ？

It's about 10x faster.它大约快 10 倍。 It seems like the numbers in those 2 arrays have an impact on calculation time.似乎那些 2 arrays 中的数字对计算时间有影响。 Why?为什么？

I am using Visual Studio 2019 with the following flags: /O2 /Oi /Ot /fp:fast我正在使用带有以下标志的 Visual Studio 2019：/O2 /Oi /Ot /fp:fast

Answer 1

This is because you are hitting denormal numbers (also see this question ).这是因为您遇到了非正规数字（另请参阅此问题）。

You can get rid of denormals like so:您可以像这样摆脱非规范化：

#include <cmath>

// [...]

for (int i = 0; i < 5; i++) {
    damping[i] = damping[i] * modeDampingTermsExp2[i];
    if (std::fpclassify(damping[i]) == FP_SUBNORMAL) {
        damping[i] = 0; // Treat denormals as 0.
    }

    float cosT = 2 * damping[i];

    for (int m = 0; m < 5; m++) {
        curSample += cosT;
    }
}

数组中不同的浮点值会影响性能 10 倍 - 为什么？

问题描述

1 个解决方案

解决方案1
1 已采纳 2020-05-16 15:40:04

数组中不同的浮点值会影响性能 10 倍 - 为什么？

问题描述

1 个解决方案

解决方案1 1 已采纳 2020-05-16 15:40:04

解决方案1
1 已采纳 2020-05-16 15:40:04