For example I have a sentence
"He is so .... cool!"
Then I remove all the punctuation and make it in a list.
["He", "is", "so", "", "cool"]
How do I remove or ignore the empty string?
You can use filter
, with None
as the key function, which filters out all elements which are False
ish (including empty strings)
>>> lst = ["He", "is", "so", "", "cool"]
>>> filter(None, lst)
['He', 'is', 'so', 'cool']
Note however, that filter
returns a list in Python 2, but a generator in Python 3. You will need to convert it into a list in Python 3, or use the list comprehension solution.
False
ish values include:
False
None
0
''
[]
()
# and all other empty containers
You can filter it like this
orig = ["He", "is", "so", "", "cool"]
result = [x for x in orig if x]
Or you can use filter
. In python 3 filter
returns a generator, thus list()
turns it into a list. This works also in python 2.7
result = list(filter(None, orig))
You can use a list comprehension:
cleaned = [x for x in your_list if x]
Although I would use regex to extract the words:
>>> import re
>>> sentence = 'This is some cool sentence with, spaces'
>>> re.findall(r'(\w+)', sentence)
['This', 'is', 'some', 'cool', 'sentence', 'with', 'spaces']
I'll give you the answer to the question you should have asked -- how to avoid the empty string altogether. I assume you do something like this to get your list:
>>> "He is so .... cool!".replace(".", "").split(" ")
['He', 'is', 'so', '', 'cool!']
The point is that you use .split(" ")
to split on space characters. However, if you leave out the argument to split
, this happens:
>>> "He is so .... cool!".replace(".", "").split()
['He', 'is', 'so', 'cool!']
Quoth the docs:
If sep is not specified or is None, a different splitting algorithm is applied: runs of consecutive whitespace are regarded as a single separator, and the result will contain no empty strings at the start or end if the string has leading or trailing whitespace.
So you really don't need to bother with the other answers (except Blender's, which is a totally different approach), because split can do the job for you!
>>> from string import punctuation
>>> text = "He is so .... cool!"
>>> [w.strip(punctuation) for w in text.split() if w.strip(punctuation)]
['He', 'is', 'so', 'cool']
You can filter out empty strings very easily using a list comprehension:
x = ["He", "is", "so", "", "cool"]
x = [str for str in x if str]
>>> ['He', 'is', 'so', 'cool']
Python 3 从filter
返回一个iterator
filter
,所以应该包含在对 list() 的调用中
str_list = list(filter(None, str_list)) # fastest
lst = ["He", "is", "so", "", "cool"]
lst = list(filter(str.strip, lst))
You can do this with a filter
.
a = ["He", "is", "so", "", "cool"]
filter(lambda s: len(s) > 0, a)
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.