理解strlen實現中的代碼

Question

關於glibc中string.h中strlen的實現，我有兩個問題。

該實現使用帶有“洞”的幻數。 我無法理解這是如何工作的。 有人可以幫我理解這個片段：

 size_t strlen (const char *str) { const char *char_ptr; const unsigned long int *longword_ptr; unsigned long int longword, himagic, lomagic; /* Handle the first few characters by reading one character at a time. Do this until CHAR_PTR is aligned on a longword boundary. */ for (char_ptr = str; ((unsigned long int) char_ptr & (sizeof (longword) - 1)) != 0; ++char_ptr) if (*char_ptr == '\\0') return char_ptr - str; /* All these elucidatory comments refer to 4-byte longwords, but the theory applies equally well to 8-byte longwords. */ longword_ptr = (unsigned long int *) char_ptr; /* Bits 31, 24, 16, and 8 of this number are zero. Call these bits the "holes." Note that there is a hole just to the left of each byte, with an extra at the end: bits: 01111110 11111110 11111110 11111111 bytes: AAAAAAAA BBBBBBBB CCCCCCCC DDDDDDDD The 1-bits make sure that carries propagate to the next 0-bit. The 0-bits provide holes for carries to fall into. */ himagic = 0x80808080L; lomagic = 0x01010101L; if (sizeof (longword) > 4) { /* 64-bit version of the magic. */ /* Do the shift in two steps to avoid a warning if long has 32 bits. */ himagic = ((himagic << 16) << 16) | himagic; lomagic = ((lomagic << 16) << 16) | lomagic; } if (sizeof (longword) > 8) abort (); /* Instead of the traditional loop which tests each character, we will test a longword at a time. The tricky part is testing if *any of the four* bytes in the longword in question are zero. */ for (;;) { longword = *longword_ptr++; if (((longword - lomagic) & ~longword & himagic) != 0) { /* Which of the bytes was the zero? If none of them were, it was a misfire; continue the search. */ const char *cp = (const char *) (longword_ptr - 1); if (cp[0] == 0) return cp - str; if (cp[1] == 0) return cp - str + 1; if (cp[2] == 0) return cp - str + 2; if (cp[3] == 0) return cp - str + 3; if (sizeof (longword) > 4) { if (cp[4] == 0) return cp - str + 4; if (cp[5] == 0) return cp - str + 5; if (cp[6] == 0) return cp - str + 6; if (cp[7] == 0) return cp - str + 7; }}}

用於的神奇數字是多少？

為什么不簡單地將指針遞增到NULL字符並返回計數？ 這種方法更快嗎？ 為什么會這樣？

Answer 1

這用於一次查看4個字節（32位）或甚至8個（64位），以檢查其中一個是否為零（字符串結束），而不是單獨檢查每個字節。

以下是檢查空字節的一個示例：

unsigned int v; // 32-bit word to check if any 8-bit byte in it is 0
bool hasZeroByte = ~((((v & 0x7F7F7F7F) + 0x7F7F7F7F) | v) | 0x7F7F7F7F);

對於更多人來說，看看Bit Twiddling Hacks 。

這里使用的那個（32位示例）：

還有一種更快的方法 - 使用hasless（v，1），定義如下; 它適用於4個操作，不需要后續驗證。 它簡化為

#define haszero(v) (((v) - 0x01010101UL) & ~(v) & 0x80808080UL)

子表達式（v - 0x01010101UL），只要v中的相應字節為零或大於0x80，就會在任何字節中設置為高位設置。 子表達式~v＆0x80808080UL評估為以字節為單位設置的高位，其中v的字節沒有設置其高位（因此字節小於0x80）。 最后，通過對這兩個子表達式進行AND運算，結果是高位設置，其中v中的字節為零，因為由於第一個子表達式中大於0x80的值而設置的高位被第二個子表達式屏蔽掉。

一次查看一個字節的成本至少與查看完整的整數值（寄存器寬）一樣多。 在此算法中，檢查完整整數以查看它們是否包含零。 如果沒有，則使用很少的指令，並且可以跳轉到下一個完整的整數。 如果內部有一個零字節，則進一步檢查以查看它的確切位置。

理解strlen實現中的代碼

問題描述

1 個解決方案

解決方案1
14 已采納 2016-01-06 21:22:55

理解strlen實現中的代碼

問題描述

1 個解決方案

解決方案1 14 已采納 2016-01-06 21:22:55

解決方案1
14 已采納 2016-01-06 21:22:55