Python 3：如何檢查字符串是否可以是有效變量？

Question

我有一個字符串，想檢查它是否可以用作有效變量而不會出現語法錯誤。 例如

def variableName(string):
    #if string is valid variable name:
        #return True
    #else:
        #return False

input >>> variableName("validVariable")
output >>> True
input >>> variableName("992variable")
output >>> False

我不想使用.isidentifier（）。 我想發揮自己的作用。

Answer 1

以下答案僅適用於“舊式” Python-2.7標識符；

"validVariable".isidentifier()
#True
"992variable".isidentifier()
#False

由於您在我發布答案后更改了問題，因此請考慮編寫正則表達式：

re.match(r"[_a-z]\w*$", yourstring,flags=re.I)

Answer 2

在Python 3中，有效的標識符可以包含ASCII范圍之外的字符，因為您不想使用str.isidentifier ，可以在Python中編寫自己的版本。

它的規范可以在這里找到： https : //www.python.org/dev/peps/pep-3131/#specification-of-language-changes

實現方式：

import keyword
import re
import unicodedata


def is_other_id_start(char):
    """
    Item belongs to Other_ID_Start in
    http://unicode.org/Public/UNIDATA/PropList.txt
    """
    return bool(re.match(r'[\u1885-\u1886\u2118\u212E\u309B-\u309C]', char))


def is_other_id_continue(char):
    """
    Item belongs to Other_ID_Continue in
    http://unicode.org/Public/UNIDATA/PropList.txt
    """
    return bool(re.match(r'[\u00B7\u0387\u1369-\u1371\u19DA]', char))


def is_xid_start(char):

    # ID_Start is defined as all characters having one of
    # the general categories uppercase letters(Lu), lowercase
    # letters(Ll), titlecase letters(Lt), modifier letters(Lm),
    # other letters(Lo), letter numbers(Nl), the underscore, and
    # characters carrying the Other_ID_Start property. XID_Start
    # then closes this set under normalization, by removing all
    # characters whose NFKC normalization is not of the form
    # ID_Start ID_Continue * anymore.

    category = unicodedata.category(char)
    return (
        category in {'Lu', 'Ll', 'Lt', 'Lm', 'Lo', 'Nl'} or
        is_other_id_start(char)
    )


def is_xid_continue(char):
    # ID_Continue is defined as all characters in ID_Start, plus
    # nonspacing marks (Mn), spacing combining marks (Mc), decimal
    # number (Nd), connector punctuations (Pc), and characters
    # carryig the Other_ID_Continue property. Again, XID_Continue
    # closes this set under NFKC-normalization; it also adds U+00B7
    # to support Catalan.

    category = unicodedata.category(char)
    return (
        is_xid_start(char) or
        category in {'Mn', 'Mc', 'Nd', 'Pc'} or
        is_other_id_continue(char)
    )


def is_valid_identifier(name):
    # All identifiers are converted into the normal form NFKC
    # while parsing; comparison of identifiers is based on NFKC.
    name = unicodedata.normalize(
        'NFKC', name
    )

    # check if it's a keyword
    if keyword.iskeyword(name):
        return False

    # The identifier syntax is <XID_Start> <XID_Continue>*.
    if not (is_xid_start(name[0]) or name[0] == '_'):
        return False

    return all(is_xid_continue(char) for char in name[1:])

if __name__ == '__main__':
    # From goo.gl/pvpYg6
    assert is_valid_identifier("a") is True
    assert is_valid_identifier("Z") is True
    assert is_valid_identifier("_") is True
    assert is_valid_identifier("b0") is True
    assert is_valid_identifier("bc") is True
    assert is_valid_identifier("b_") is True
    assert is_valid_identifier("µ") is True
    assert is_valid_identifier("𝔘𝔫𝔦𝔠𝔬𝔡𝔢") is True

    assert is_valid_identifier(" ") is False
    assert is_valid_identifier("[") is False
    assert is_valid_identifier("©") is False
    assert is_valid_identifier("0") is False

您可以在此處和此處分別檢查CPython和Pypy的實現。

Answer 3

您可以使用正則表達式。

例如：

isValidIdentifier = re.match("[A-Za-z_](0-9A-Za-z_)*",identifier)

請注意，他只檢查字母數字字符。 實際標准支持其他字符。 看到這里： https : //www.python.org/dev/peps/pep-3131/

您可能還需要排除諸如def，True，False等保留字...請參見此處： https ： //www.programiz.com/python-programming/keywords-identifier

Python 3：如何檢查字符串是否可以是有效變量？

問題描述

3 個解決方案

解決方案1
6 2018-03-17 02:00:34

解決方案2
3 2018-03-17 03:30:39

實現方式：

解決方案3
0 2018-03-17 02:26:10

Python 3：如何檢查字符串是否可以是有效變量？

問題描述

3 個解決方案

解決方案1 6 2018-03-17 02:00:34

解決方案2 3 2018-03-17 03:30:39

實現方式：

解決方案3 0 2018-03-17 02:26:10

解決方案1
6 2018-03-17 02:00:34

解決方案2
3 2018-03-17 03:30:39

解決方案3
0 2018-03-17 02:26:10