在Python3.3中从字符串中删除除字母和空格以外的所有内容

Question

I have this example string: happy t00 go 129.129 and I want to keep only the spaces and letters. 我有这个示例字符串： happy t00 go 129.129 ，我只想保留空格和字母。 All I have been able to come up with so far that is pretty efficient is: 到目前为止，我所能想到的是非常有效的：

print(re.sub("\d", "", 'happy t00 go 129.129'.replace('.', '')))

but it is only specific to my example string. 但这仅适用于我的示例字符串。 How can remove all characters other than letters and spaces? 如何删除字母和空格以外的所有字符？

Answer 1

whitelist = set('abcdefghijklmnopqrstuvwxyz ABCDEFGHIJKLMNOPQRSTUVWXYZ')
myStr = "happy t00 go 129.129$%^&*("
answer = ''.join(filter(whitelist.__contains__, myStr))

Output: 输出：

>>> answer
'happy t go '

Answer 2

使用一组补码：

re.sub(r'[^a-zA-Z ]+', '', 'happy t00 go 129.129')

Answer 3

Slight variation on inspectorG4dget's method - import from string & generator comprehension: inspectorG4dget方法的细微变化-从string和生成器理解中导入：

from string import ascii_letters

allowed = set(ascii_letters + ' ')
myStr = 'happy t00 go 129.129'
answer = ''.join(l for l in myStr if l in allowed)
answer
# >>> 'happy t go '

Performance comparison: 性能比较：

(I made myStr a bit longer and pre-compiled the regex to make things a bit more interesting) （我使myStr更长，并预编译了正则表达式，使事情变得更加有趣）

import re
from string import ascii_letters, digits
myStr = 'happy t00 go 129.129'*20
allowed = set(ascii_letters + ' ')

# Generator
%timeit answer = ''.join(l for l in myStr if l in allowed)

# filter/__contains__
%timeit answer = ''.join(filter(allowed.__contains__, myStr))

# Regex
pat = re.compile(r'[^a-zA-Z ]+')
%timeit answer = re.sub(pat, '', myStr)

53 µs ± 6.43 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each) 每个循环53 µs±6.43 µs（平均±标准偏差，共运行7次，每个循环10000个）
43.3 µs ± 7.48 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each) 每个回路43.3 µs±7.48 µs（平均±标准偏差，共运行7次，每个回路10000个）
26 µs ± 509 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each) 每个循环26 µs±509 ns（平均±标准偏差，共运行7次，每个循环10000个）

在Python3.3中从字符串中删除除字母和空格以外的所有内容

问题描述

3 个解决方案

解决方案1
15 已采纳 2014-02-04 22:15:18

解决方案2
10 2014-02-04 22:15:28

解决方案3
3 2018-03-26 07:46:08

Performance comparison: 性能比较：

在Python3.3中从字符串中删除除字母和空格以外的所有内容

问题描述

3 个解决方案

解决方案1 15 已采纳 2014-02-04 22:15:18

解决方案2 10 2014-02-04 22:15:28

解决方案3 3 2018-03-26 07:46:08

Performance comparison: 性能比较：

解决方案1
15 已采纳 2014-02-04 22:15:18

解决方案2
10 2014-02-04 22:15:28

解决方案3
3 2018-03-26 07:46:08