Pandas DataFrame行明智比较

Question

I have a pandas DataFrame like following. 我有一个像下面这样的pandas DataFrame 。

       id  label_x label_y
0        1    F    R
1        2    F    F
2        3    F    F
3        4    F    F
4        5    F    F

Now I want to count occurrences of label_x and label_y are equal and not equal. 现在我想计算label_x和label_y的出现次数是否相等而不相等。 In this case there is only one occurrence of not equal and 4 occurrences of equal. 在这种情况下，只有一次出现不相等且出现次数相等。

df = pd.DataFrame({'id' : ["1","2","3","4","5"],
                'label_x'  : ["F","F","F","F","F"], 'label_y' : ["R","F","F","F","F"]})

Answer 1

(df.label_x == df.label_y).value_counts()

Many ways to to that, including the above... 许多方法，包括上述......

In [43]: (df.label_x == df.label_y).value_counts()
Out[43]:
True     4
False    1
dtype: int64

Answer 2

I came up with this solution. 我想出了这个解决方案。 Is that the best one? 这是最好的吗？

def compare(x):
    if x[1] == x[2]:
        return 'yes'
    else:
        return 'no'

df['result'] =  df.apply(compare, axis=1)

df2 = pd.DataFrame({'count' : df.groupby( ["result"] ).size()}).reset_index()

Pandas DataFrame行明智比较

问题描述

2 个解决方案

解决方案1
2 已采纳 2014-11-18 07:21:05

解决方案2
1 2014-11-18 07:22:43

Pandas DataFrame行明智比较

问题描述

2 个解决方案

解决方案1 2 已采纳 2014-11-18 07:21:05

解决方案2 1 2014-11-18 07:22:43

解决方案1
2 已采纳 2014-11-18 07:21:05

解决方案2
1 2014-11-18 07:22:43