如何提取字符串之前的单词？

Question

I have several strings like this: 我有几个这样的字符串：

mylist = ['pearsapple','grapevinesapple','sinkandapple'...]

I want to parse the parts before apple and then append to a new list: 我想在苹果之前解析这些部分，然后追加到新列表中：

new = ['pears','grapevines','sinkand']

Is there a way other than finding starting points of 'apple' in each string and then appending before the starting point? 除了在每个字符串中找到“苹果”的起点，然后在起点之前追加之外，还有其他方法吗？

Answer 1

By using slicing in combination with the index method of strings. 通过结合使用切片和字符串的index方法。

>>> [x[:x.index('apple')] for x in mylist]
['pears', 'grapevines', 'sinkand']

You could also use a regular expression 您也可以使用正则表达式

>>> import re
>>> [re.match('(.*?)apple', x).group(1) for x in mylist]
['pears', 'grapevines', 'sinkand']

I don't see why though. 我不明白为什么。

Answer 2

我希望apple这个词将是固定的（固定长度的词），然后我们可以使用：

second_list = [item[:-5] for item in mylist]

Answer 3

If some elements in the list don't contain 'apple' at the end of the string, this regex leaves the string untouched: 如果列表中的某些元素在字符串的末尾不包含'apple' ，则此正则表达式将使字符串保持不变：

>>> import re
>>> mylist = ['pearsapple','grapevinesapple','sinkandapple', 'test', 'grappled']
>>> [re.sub('apple$', '', word) for word in mylist]
['pears', 'grapevines', 'sinkand', 'test', 'grappled']

Answer 4

By also using string split and list comprehension 通过还使用字符串拆分和列表理解

new = [x.split('apple')[0] for x in mylist]
['pears', 'grapevines', 'sinkand']

Answer 5

One way to do it would be to iterate through every string in the list and then use the split() string function. 一种方法是遍历列表中的每个字符串，然后使用split()字符串函数。

for word in mylist:
    word = word.split("apple")[0]

如何提取字符串之前的单词？

问题描述

5 个解决方案

解决方案1
2 2017-05-23 13:21:14

解决方案2
2 2017-05-23 13:25:48

解决方案3
2 2017-05-23 13:28:56

解决方案4
1 已采纳 2017-05-23 13:24:02

解决方案5
0 2017-05-23 13:25:46

如何提取字符串之前的单词？

问题描述

5 个解决方案

解决方案1 2 2017-05-23 13:21:14

解决方案2 2 2017-05-23 13:25:48

解决方案3 2 2017-05-23 13:28:56

解决方案4 1 已采纳 2017-05-23 13:24:02

解决方案5 0 2017-05-23 13:25:46

解决方案1
2 2017-05-23 13:21:14

解决方案2
2 2017-05-23 13:25:48

解决方案3
2 2017-05-23 13:28:56

解决方案4
1 已采纳 2017-05-23 13:24:02

解决方案5
0 2017-05-23 13:25:46