![](/img/trans.png)
[英]Select rows from Pandas dataframe where a specific column contains numbers
[英]Pandas dataframe - Select rows where one column's values contains a string and another column's values starts with specific strings
我希望选择state
包含单词Traded的行,而trading _book
不以字母'E','L','N'开头
Test_Data = [('originating_system_id', ['RBCL', 'RBCL', 'RBCL','RBCL']),
('rbc_security_type1', ['CORP', 'CORP','CORP','CORP']),
('state', ['Traded', 'Traded Away','Traded','Traded Away']),
('trading_book', ['LCAAAAA','NUBBBBB','EDFGSFG','PDFEFGR'])
]
dfTest_Data = pd.DataFrame.from_items(Test_Data)
display(dfTest_Data)
originating_system_id rbc_security_type1 state trading_book
RBCL CORP Traded LCAAAAA
RBCL CORP Traded Away NUBBBBB
RBCL CORP Traded EDFGSFG
RBCL CORP Traded Away PDFEFGR
期望的输出:
originating_system_id rbc_security_type1 state trading_book
RBCL CORP Traded Away PDFEFGR
我虽然会这样做:
prefixes = ['E','L','N']
df_Traded_Away_User = dfTest_Data[
dfTest_Data[~dfTest_Data['trading_book'].str.startswith(tuple(prefixes))] &
(dfTest_Data['state'].str.contains('Traded'))
][['originating_system_id','rbc_security_type1','state','trading_book']]
display(df_Traded_Away_User)
但我得到了:
ValueError: Must pass DataFrame with boolean values only
我建议分别创建每个布尔掩码以获得更好的可读代码,然后用&
链接它们:
prefixes = ['E','L','N']
m1 = ~dfTest_Data['trading_book'].str.startswith(tuple(prefixes))
m2 = dfTest_Data['state'].str.contains('Traded')
cols = ['originating_system_id','rbc_security_type1','state','trading_book']
df_Traded_Away_User = dfTest_Data.loc[m1 & m2, cols]
print (df_Traded_Away_User)
originating_system_id rbc_security_type1 state trading_book
3 RBCL CORP Traded Away PDFEFGR
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.