python有soundex函数吗？

Question

Is there a soundex function for python and if not how would you go about making a soundex code?是否有用于 python 的 soundex 函数，如果没有，您将如何制作 soundex 代码？

Soundex
Code    Letters 
1   B, F, P, V  
2   C, G, J, K, Q, S, X, Z  
3   D, T    
4   L   
5   M, N    
6   R   
SKIP   A, E, H, I, O, U, W, Y, H, W, and Y

For example:例如：

Jackson = J250杰克逊 = J250

Washington = W252华盛顿 = W252

Clement = C455克莱门特 = C455

Ashcraft = A261 Ashcraft = A261

Wu = W000吴 = W000

Answer 1

Yes , you can use Fuzzy which is a python library implementing some phonetic algorithms.是的，您可以使用Fuzzy ，它是一个实现一些语音算法的 Python 库。

sudo pip install fuzzy

>>> import fuzzy
>>> soundex = fuzzy.Soundex(4)
>>> soundex("Jackson")
'J250'
>>> soundex("Washington")
'W252'
>>> soundex("Clement")
'C453'
>>> soundex("Ashcraft")
'A261'
>>> soundex("Wu")
'W000'

Answer 2

You can use jellyfish你可以用海蜇

sudo pip install jellyfish

print "Soundex\t\t=", jellyfish.soundex("Ala ma kaca")
>Soundex                = A452
#...
>Metaphone              = AL M KK
>NYSIIS                 = AL
>Match rating codex     = ALMKC

Answer 3

Use the below soundex() function directly without installing any package!直接使用下面的soundex()函数，无需安装任何包！

Snippet taken from package Jellyfish > _jellyfish.py摘自Jellyfish > _jellyfish.py包的片段

Examples例子

print(soundex('kent')) # K530
print(soundex('Paul')) # P400
print(soundex('amnesty')) # A523

Code代码

import unicodedata
def soundex(s):

    if not s:
        return ""

    s = unicodedata.normalize("NFKD", s)
    s = s.upper()

    replacements = (
        ("BFPV", "1"),
        ("CGJKQSXZ", "2"),
        ("DT", "3"),
        ("L", "4"),
        ("MN", "5"),
        ("R", "6"),
    )
    result = [s[0]]
    count = 1

    # find would-be replacment for first character
    for lset, sub in replacements:
        if s[0] in lset:
            last = sub
            break
    else:
        last = None

    for letter in s[1:]:
        for lset, sub in replacements:
            if letter in lset:
                if sub != last:
                    result.append(sub)
                    count += 1
                last = sub
                break
        else:
            if letter != "H" and letter != "W":
                # leave last alone if middle letter is H or W
                last = None
        if count == 4:
            break

    result += "0" * (4 - count)
    return "".join(result)

python有soundex函数吗？

问题描述

3 个解决方案

解决方案1
4 2017-01-01 12:11:38

解决方案2
3 2017-08-04 14:02:28

解决方案3
1 2021-04-21 14:27:56

python有soundex函数吗？

问题描述

3 个解决方案

解决方案1 4 2017-01-01 12:11:38

解决方案2 3 2017-08-04 14:02:28

解决方案3 1 2021-04-21 14:27:56

解决方案1
4 2017-01-01 12:11:38

解决方案2
3 2017-08-04 14:02:28

解决方案3
1 2021-04-21 14:27:56