[英]How to Search and Extract string in dataframe into new column if match on particular symbol?
I have a df that has non readable alpha-numerical values for column names. 我有一个df,它的列名具有不可读的字母数字值。
I want to extract entire email addresses from any column (cell) that has the @
symbol in it. 我想从其中带有
@
符号的任何列(单元格)中提取整个电子邮件地址。 How would I search the entire dataframe regardless of column for the @
symbol? 我如何在整个数据帧中搜索
@
符号,而不管其列如何?
Input df:
a.1 b.2 c.1
aa a@a.com 12
bb b@b.com 29
cc
dd d@d.com
example df I want post extraction: 我想要后期提取的示例df:
a.1 b.2 c.1 email
aa a@a.com 12 a@a.com
bb b@b.com 29 b@b.com
cc
dd d@d.com d@d.com
Using str.contains
使用
str.contains
df['email']=df.loc[df['b.2'].str.contains('@'),'b.2']
Seems like you need 好像你需要
df['email']=df[df.apply(lambda x : x.str.contains('@')).eq(True)].ffill(1).iloc[:,-1]
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.