[英]filter pandas.DataFrame with python list in cell
I have pasdas.DataFrame like this:我有这样的 pasdas.DataFrame:
import pandas as pd
data = {'name' : ['Alice', 'Bob', 'Eve'],
'age' : ['20', '35', '40'],
'stuff' : [['computer', 'phone', 'bike'], ['bike', 'skateboard', 'phone'],
['computer', 'phone', 'skateboard']]}
frame = pd.DataFrame(data)
How can I select rows where age > 30 and stuff contains 'computer'?如何选择年龄 > 30 且内容包含“计算机”的行?
I've tried solve this with DataFrame.loc :我试过用 DataFrame.loc 解决这个问题:
filteredFrame = frame.loc[(frame.age > 30)&('computer' in frame.stuff)]
But it doesn't work但它不起作用
First convert column year
to numbers, if necessary:如有必要,首先将列year
转换为数字:
frame.age = frame.age.astype(int)
Use Series.map
or Series.apply
:使用Series.map
或Series.apply
:
filteredFrame = frame.loc[(frame.age > 30)&(frame.stuff.map(lambda x: 'computer' in x))]
filteredFrame = frame.loc[(frame.age > 30)&(frame.stuff.apply(lambda x: 'computer' in x))]
Or list comprehension:或列表理解:
filteredFrame = frame.loc[(frame.age > 30)&(['computer' in x for x in frame.stuff])]
print (filteredFrame)
name age stuff
2 Eve 40 [computer, phone, skateboard]
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.