计算以字节为单位设置的位数

Question

我很感兴趣，这是通过这种方式计算字节中设置的位数的最佳方法

template< unsigned char byte > class BITS_SET
{
public:
    enum {
     B0 = (byte & 0x01) ? 1:0,
     B1 = (byte & 0x02) ? 1:0,
     B2 = (byte & 0x04) ? 1:0,
     B3 = (byte & 0x08) ? 1:0,
     B4 = (byte & 0x10) ? 1:0,
     B5 = (byte & 0x20) ? 1:0,
     B6 = (byte & 0x40) ? 1:0,
     B7 = (byte & 0x80) ? 1:0
    };
public:
 enum{RESULT = B0+B1+B2+B3+B4+B5+B6+B7};
};

也许在运行时知道字节值时它是最佳的？ 是否建议在代码中使用它？

Answer 1

对于一个字节的数据，同时考虑速度和内存消耗的最佳方式：

uint8_t count_ones (uint8_t byte)
{
  static const uint8_t NIBBLE_LOOKUP [16] =
  {
    0, 1, 1, 2, 1, 2, 2, 3, 
    1, 2, 2, 3, 2, 3, 3, 4
  };


  return NIBBLE_LOOKUP[byte & 0x0F] + NIBBLE_LOOKUP[byte >> 4];
}

在大多数系统上，从 for 循环调用这个函数应该会产生一个非常有效的程序。 它非常通用。

Answer 2

对于 8 位值，只需使用 256 个元素的查找表。

对于较大尺寸的输入，它稍微不那么琐碎。 Sean Eron Anderson 在他的Bit Twiddling Hacks 页面上为此提供了几个不同的功能，所有功能都具有不同的性能特征。 没有一个是最快的版本，因为它取决于您的处理器的性质（管道深度、分支预测器、缓存大小等）和您使用的数据。

Answer 3

为什么不直接使用标准库？ 这样，最佳方式应由实现确定，并且可能比您实际编写的任何符合标准的代码都要好。 例如，如果您使用的是 x86，这将编译为一条指令，但前提是您的目标是支持它的 CPU 。

#include <bitset>
#include <iostream>

int main() {
  unsigned char bitfield = 17;
  std::cout << std::bitset<8>(bitfield).count() <<
    std::endl;
}

Answer 4

对于单字节值，最快的方法是将答案存储在用该值索引的 256 字节数组中。 例如， bits_set[] = {0, 1, 1, 2, ...

Answer 5

“进行 bitcount 的最快方法”的通常答案是“在数组中查找字节”。 这种方式适用于字节，但您需要为它支付实际的内存访问费用。 如果你只是偶尔这样做一次，它可能是最快的，但如果你只是偶尔这样做，那么你不需要最快的。

如果你经常这样做，你最好将字节分批成字或双字，然后对这些进行快速的 bitcount 操作。 这些往往是纯算术，因为您实际上无法在数组中查找 32 位值以获取其位数。 相反，您通过以巧妙的方式移动和屏蔽来组合值。

这样做的巧妙技巧的一个重要来源是Bit Hacks 。

这是那里发布的用于计算 C 中 32 位字中的位数的方案：

 unsigned int v; // count bits set in this (32-bit value)
 unsigned int c; // store the total here

 v = v - ((v >> 1) & 0x55555555);                    // reuse input as temporary
 v = (v & 0x33333333) + ((v >> 2) & 0x33333333);     // temp
 c = ((v + (v >> 4) & 0xF0F0F0F) * 0x1010101) >> 24; // count

Answer 6

#include <iostream>
#include <climits> // for CHAR_BIT (most likely to be 8)
#include <cstring> // for memset
#include <new> 

static const int DUMMY = -1;

// first approch : activate the O(8) function in first get try... after that its O(1);
class bitsInByteflyLUT
{
    typedef unsigned char byte;

    public:
        bitsInByteflyLUT();     //CTOR - throws std::bad_alloc
        ~bitsInByteflyLUT();    //DTOR


        int Get_bitsInByte(byte _byte);     


    private:
        // CLASS DATA
        int*    flyLUT;

        // PRIVATE FUNCTIONS
        int bitsInByte(byte _byte);
        // O(8) for finding how many bits are ON in a byte.
        // answer can be between 0 to CHAR_BIT.

        bitsInByteflyLUT(const bitsInByteflyLUT & _class); // COPY CTOR - forbidden
        const bitsInByteflyLUT & operator= (const bitsInByteflyLUT& _class);
        // ASSIGN OPERATOR - forbidden

};

bitsInByteflyLUT::bitsInByteflyLUT()
{
    size_t nIndexes = 1 << CHAR_BIT;
    try
    {
        flyLUT =  new int[nIndexes];
    }
    catch (std::bad_alloc& ba)
    {
        throw;
    }
    memset(flyLUT, DUMMY, sizeof(int)*nIndexes);
}


bitsInByteflyLUT::~bitsInByteflyLUT()
{
    delete[] flyLUT;
}


int bitsInByteflyLUT::Get_bitsInByte(byte _byte)
{
    if (flyLUT[_byte] == DUMMY) // if its first time we try to get answer for this char.
    {
        flyLUT[_byte] = bitsInByte(_byte); // O(8)
    }
    return flyLUT[_byte]; // O(1) 
}

int bitsInByteflyLUT::bitsInByte(byte _byte)
{   
    byte nBits = CHAR_BIT;
    byte counter = 0;
    byte mask = 1;
    while(nBits--)
    {
        if(mask & _byte)
        {
            ++counter;
        }
        mask <<= 1;
    }
    return counter;
}





int main ()
{
    using std::cout;
    using std::endl;

    bitsInByteflyLUT flut;

    for (unsigned int i = 0; i < (1 << CHAR_BIT); i += 1)
    {   
        cout << i << " " << flut.Get_bitsInByte(i) << endl;
    }

    return 0;
}

Answer 7

使用 C++17，您可以使用 constexpr lambda 预先计算查找表。 更容易推理它的正确性，而不是一个现成的复制粘贴表。

#include <array>
#include <cstdint>

static constexpr auto bitsPerByteTable = [] {
  std::array<uint8_t, 256> table{};
  for (decltype(table)::size_type i = 0; i < table.size(); i++) {
    table.at(i) = table.at(i / 2) + (i & 1);
  }
  return table;
}();

Answer 8

C++20 std::popcount文件<bit>引入std::popcount

std::popcount(0b1101u)将返回 3

有关更多详细信息，请参阅https://en.cppreference.com/w/cpp/numeric/popcount 。

Answer 9

为什么不左移并屏蔽其余部分？

int countBits(unsigned char byte){
    int count = 0;
    for(int i = 0; i < 8; i++)
        count += (byte >> i) & 0x01; // Shift bit[i] to the first position, and mask off the remaining bits.
    return count;
}

通过简单地计算被计数的值中有多少位，然后在计数器循环中使用该值，这可以很容易地适应处理任何大小的整数。 这一切都非常简单。

int countBits(unsigned long long int a){
    int count = 0;
    for(int i = 0; i < sizeof(a)*8; i++)
        count += (a >> i) & 0x01;
    return count;
}

Answer 10

在 gcc 中，您可以使用 __builtin_popcount(unsigned) 函数。
它应该有效地使用目标硬件平台的最佳解决方案。
使用 -march=core-avx2（与我的 cpu 兼容的最高级别）使用 popcntl x86_64 汇编指令，在硬件中执行。
使用默认的 x86_64 指令集调用 popcntl 函数来实现最佳的 C（聪明的黑客）算法。
还有 __builtin_popcountl 和 __builtin_popcountll 用于 unsigned long 和 unsigned long long。

Answer 11

int count(int a){ return a == 0 ? 0 : 1 + count(a&(a-1)); }

Answer 12

#include <ctime>
#include <iostream>
using namespace std;

int count1s(unsigned char byte) {
  if (byte == 0) {
    return 0;
  }

  if (byte & 0x01) {
    return 1 + count1s(byte >> 1);
  }
  return count1s(byte >> 1);
}

int count1s2(unsigned char byte) {
  static const int ones[256] = {
      0, 1, 1, 2, 1, 2, 2, 3, 1, 2, 2, 3, 2, 3, 3, 4, 1, 2, 2, 3, 2, 3, 3, 4,
      2, 3, 3, 4, 3, 4, 4, 5, 1, 2, 2, 3, 2, 3, 3, 4, 2, 3, 3, 4, 3, 4, 4, 5,
      2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6, 1, 2, 2, 3, 2, 3, 3, 4,
      2, 3, 3, 4, 3, 4, 4, 5, 2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6,
      2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6, 3, 4, 4, 5, 4, 5, 5, 6,
      4, 5, 5, 6, 5, 6, 6, 7, 1, 2, 2, 3, 2, 3, 3, 4, 2, 3, 3, 4, 3, 4, 4, 5,
      2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6, 2, 3, 3, 4, 3, 4, 4, 5,
      3, 4, 4, 5, 4, 5, 5, 6, 3, 4, 4, 5, 4, 5, 5, 6, 4, 5, 5, 6, 5, 6, 6, 7,
      2, 3, 3, 4, 3, 4, 4, 5, 3, 4, 4, 5, 4, 5, 5, 6, 3, 4, 4, 5, 4, 5, 5, 6,
      4, 5, 5, 6, 5, 6, 6, 7, 3, 4, 4, 5, 4, 5, 5, 6, 4, 5, 5, 6, 5, 6, 6, 7,
      4, 5, 5, 6, 5, 6, 6, 7, 5, 6, 6, 7, 6, 7, 7, 8};

  return ones[(int)byte];
}

int main() {
  time_t start = clock();
  int c = count1s(205);
  time_t end = clock();
  cout << "count1: " << c << " time: " << double(end - start) << endl;
  start = clock();
  c = count1s2(205);
  end = clock();
  cout << "count2: " << c << " time: " << double(end - start) << endl;
  return 0;
}

计算以字节为单位设置的位数

问题描述

12 个解决方案

解决方案1
25 2014-09-12 12:40:46

解决方案2
19 已采纳 2012-03-30 20:27:54

解决方案3
11 2015-01-20 12:02:59

解决方案4
4 2012-03-30 20:25:05

解决方案5
2 2012-03-30 20:34:49

解决方案6
1 2017-05-19 13:53:49

解决方案7
1 2020-07-05 11:57:54

解决方案8
1 2021-04-15 21:16:54

解决方案9
1 2012-03-30 21:31:24

解决方案10
0 2020-10-26 11:31:06

解决方案11
0 2012-03-30 20:30:21

解决方案12
-2 2016-07-07 19:41:47

计算以字节为单位设置的位数

问题描述

12 个解决方案

解决方案1 25 2014-09-12 12:40:46

解决方案2 19 已采纳 2012-03-30 20:27:54

解决方案3 11 2015-01-20 12:02:59

解决方案4 4 2012-03-30 20:25:05

解决方案5 2 2012-03-30 20:34:49

解决方案6 1 2017-05-19 13:53:49

解决方案7 1 2020-07-05 11:57:54

解决方案8 1 2021-04-15 21:16:54

解决方案9 1 2012-03-30 21:31:24

解决方案10 0 2020-10-26 11:31:06

解决方案11 0 2012-03-30 20:30:21

解决方案12 -2 2016-07-07 19:41:47

解决方案1
25 2014-09-12 12:40:46

解决方案2
19 已采纳 2012-03-30 20:27:54

解决方案3
11 2015-01-20 12:02:59

解决方案4
4 2012-03-30 20:25:05

解决方案5
2 2012-03-30 20:34:49

解决方案6
1 2017-05-19 13:53:49

解决方案7
1 2020-07-05 11:57:54

解决方案8
1 2021-04-15 21:16:54

解决方案9
1 2012-03-30 21:31:24

解决方案10
0 2020-10-26 11:31:06

解决方案11
0 2012-03-30 20:30:21

解决方案12
-2 2016-07-07 19:41:47