pandas 删除特定单词之前的所有单词并获取该特定单词之后的前 n 个单词

Question

I have a dataframe like this:我有一个像这样的 dataframe：

df=pd.DataFrame({'caption':'hello this pack is for you: Jake Peralta. Thanks'})
df

caption
hello this pack is for you: Jake Peralta. Thanks
...
...
...

I'm trying to get the recipient's first and last name here.我正在尝试在这里获取收件人的名字和姓氏。 The format of the caption column is always the same.标题栏的格式始终相同。 So delete everything before for you: and get the first 2(this number may change) words after for you:因此，为您删除之前的所有内容：并为您获取后面的前 2 个（此数字可能会更改）单词：

Answer 1

Takes care of leading spaces in name:处理名称中的前导空格：

>>> df.caption.str.split(".").str[0].str.split(":").str[1].str.strip()

1    Jake Peralta
Name: caption, dtype: object

Answer 2

here is one way:这是一种方法：

df.caption.apply(lambda st: st[st.find(":")+2:st.find(".")])

output: output：

0     Jake Peralta
Name: caption, dtype: object

Answer 3

May be you can try like this也许你可以这样尝试

df['caption'].str.split("for you: ").str[1].str.split('.').str[0]

output: output：

0    Jake Peralta
1      first last

pandas 删除特定单词之前的所有单词并获取该特定单词之后的前 n 个单词

问题描述

3 个解决方案

解决方案1
1 2022-09-08 15:28:02

解决方案2
0 已采纳 2022-09-08 15:00:52

解决方案3
0 2022-09-08 15:12:54

pandas 删除特定单词之前的所有单词并获取该特定单词之后的前 n 个单词

问题描述

3 个解决方案

解决方案1 1 2022-09-08 15:28:02

解决方案2 0 已采纳 2022-09-08 15:00:52

解决方案3 0 2022-09-08 15:12:54

解决方案1
1 2022-09-08 15:28:02

解决方案2
0 已采纳 2022-09-08 15:00:52

解决方案3
0 2022-09-08 15:12:54