根据从另一个数据框构建的条件列表，选择一个熊猫数据框的子集

Question

Suppose we have the following DataFrame 假设我们有以下DataFrame

>>> import pandas as pd

>>> df_org = pd.DataFrame({'A' : [1,2,3,4,5,6], 
                           'B' : [1,1,1,1,2,2],
                           'C' : [1,2,3,4,1,2]})
   A  B  C
0  1  1  1
1  2  1  2
2  3  1  3
3  4  1  4
4  5  2  1
5  6  2  2

And this another one, df_criteria , that has some of the columns of df_org and from which we will build our criteria. 而这一个又一个， df_criteria ，有一些列的df_org并从其中我们将建立我们的标准。 For instance: 例如：

>>> df_criteria = pd.DataFrame({'B' : [1,2], 
                                'C' : [1,1]}) 

   B  C
0  1  1
1  2  1

I'd like to be able to fetch the value of A in the df_org DataFrame for which the corresponding values of the B and C match the ones listed in the df_criteria DataFrame. 我希望能够在df_org帧中获取A的值， df_org B和C的对应值与df_criteria帧中列出的df_criteria匹配。 In this examples, I would like to have a subset of df_org that contains its rows '0' and '4', like so: 在此示例中，我想要一个df_org的子集，其中包含其行“ 0”和“ 4”，如下所示：

   A  B  C
0  1  1  1
4  5  2  1

Being a newbie in pandas, the way I've implemented this is using the for -loop mindset: by iterating over the rows of df_criteria and querying df_org for each row. 作为熊猫的新手，我实现此目标的方法是使用for -loop思维方式：通过遍历df_criteria的行并为每行查询df_org 。 However, this is very slow and I have the impression that there must be a more pythonic (and faster) way that does not make use of for -loops. 但是，这非常慢，我的印象是必须有一种不使用for -loops的更pythonic（且更快）的方式。 I've also explored the use of DataFrame.lookup , however it is not useful in my case because the indices in df_criteria and df_org do not necessarily match. 我还探讨了DataFrame.lookup ，但是在我的情况下它没有用，因为df_criteria和df_org的索引不一定匹配。

Any suggestion would be very much appreciated. 任何建议将不胜感激。 Many thanks! 非常感谢！

Answer 1

A simple inner merge would work: 一个简单的内部合并将起作用：

In [285]:

df_org.merge(df_criteria, on=['B','C'])
Out[285]:
   A  B  C
0  1  1  1
1  5  2  1

根据从另一个数据框构建的条件列表，选择一个熊猫数据框的子集

问题描述

1 个解决方案

解决方案1
7 已采纳 2014-09-19 15:22:44

根据从另一个数据框构建的条件列表，选择一个熊猫数据框的子集

问题描述

1 个解决方案

解决方案1 7 已采纳 2014-09-19 15:22:44

解决方案1
7 已采纳 2014-09-19 15:22:44