[英]How to take part of string value from column in DataFrame in Python Pandas?
[英]Python 2.7: How to identify unique string from string in pandas dataframe and print designated value in a specified column based on the result?
dataframe = pd.DataFrame([['Age is 83,sex is man'],
['sex is woman,age is 74']],
columns=['info'])
mw = dataframe['info'].str.extract(r'sex is (woman|man)', expand=False)
pd.concat([dataframe, pd.get_dummies(mw).astype(int)], axis=1)
dataframe['man'] = dataframe['info'].str.match(r'^.*sex is man.*$', re.I).astype(int)
dataframe['woman'] = dataframe['info'].str.match(r'^.*sex is woman.*$', re.I).astype(int)
dataframe
你可能也覺得這很有意思。
這有效
import string
df['woman'] = df['info'].map(lambda x: x.translate(None, string.punctuation)).map(lambda x: 1 if 'woman' in x.lower().split() else 0)
df['man'] = df['info'].map(lambda x: x.translate(None, string.punctuation)).map(lambda x: 1 if 'man' in x.lower().split() else 0)
df
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.