遍历 Panda dataframe 中的多个列并找到计数唯一值

Question

I am working with a dataset which looks like below:我正在使用如下所示的数据集：

I have imported this dataset to my code using the panda library.我已使用熊猫库将此数据集导入到我的代码中。 My goal is to find unique entries of the programming languages from columns 2, 3, 4. I wish the output to be:我的目标是从第 2、3、4 列中找到编程语言的唯一条目。我希望 output 是：

    Python 4
    Perl 3
    C++ 3
....

Any leads would be helpful任何线索都会有所帮助

Answer 1

Use DataFrame.filter with DataFrame.stack and Series.value_counts :将DataFrame.filter与DataFrame.stack和Series.value_counts一起使用：

s = df.filter(like='Language').stack().value_counts()

Answer 2

This is an alternative way这是另一种方法

df['lang1'].value_counts() + df['lang2'].value_counts() + df['lang3'].value_counts()

or要么

cols = ['lang1', 'lang2', 'lang2']
sum([df[col].value_counts() for col in cols])

遍历 Panda dataframe 中的多个列并找到计数唯一值

问题描述

2 个解决方案

解决方案1
0 已采纳 2020-10-28 13:28:26

解决方案2
0 2020-10-28 13:40:47

遍历 Panda dataframe 中的多个列并找到计数唯一值

问题描述

2 个解决方案

解决方案1 0 已采纳 2020-10-28 13:28:26

解决方案2 0 2020-10-28 13:40:47

解决方案1
0 已采纳 2020-10-28 13:28:26

解决方案2
0 2020-10-28 13:40:47