[英]Extracting last occurrence of a certain string from each row in a pandas series
from each row i want to extract the last occurrence of the word "user" + the number that follow right after it from a pandas series.从每一行中,我想从 pandas 系列中提取单词“user”的最后一次出现 + 紧随其后的数字。 everything else can be discarded.
其他一切都可以丢弃。 how would you perform this action?
您将如何执行此操作? thanks!!!
谢谢!!!
here's an example of the series:这是该系列的一个示例:
0 1 - Unassigned, 2 - User 397335
1 1 - Unassigned, 2 - User 525767, 3 - Unassigned
2 1 - Unassigned
3 1 - Unassigned
4 1 - Unassigned
...
163678 1 - Unassigned
163679 1 - Unassigned, 2 - User 347991, 3 - Unassigned
163680 1 - Unassigned
163681 1 - Unassigned
163682 1 - Unassigned, 2 - User 663455, 3 - Unassigned
Use str.findall
:使用
str.findall
:
>>> df['A'].str.findall(r'User \d+').str[-1]
0 User 397335
1 User 525767
2 NaN
3 NaN
4 NaN
163678 NaN
163679 User 347991
163680 NaN
163681 NaN
163682 User 663455
Name: A, dtype: object
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.