计算每列的不同值，返回数据框，并对值进行排序

Question

This is my code: 这是我的代码：

for column in list(data.columns):
    pd.DataFrame(columns=column, data=data.columns.nunique()).sort_values()

This doesn't work...... 这行不通......

Either to define a function or use a for loop, how can I make it simple to check the unique value number of each column? 定义函数还是使用for循环，如何使检查每一列的唯一值编号变得简单？

Answer 1

There are many issues with your code, but the two main ones: 您的代码有很多问题，但有两个主要问题：

You don't store, print, or append to a list the pd.DataFrame objects you create. 您不存储，打印或将创建的pd.DataFrame对象添加到列表中。
You are applying nunique to column labels, not to the data within your dataframe. 您将nunique应用于列标签，而不是数据nunique的数据。

Here you can use nunique directly with your dataframe, then sort_values : 在这里，您可以直接在数据nunique使用nunique ，然后再使用sort_values ：

np.random.seed(0)
df = pd.DataFrame(np.random.randint(0, 10, (5, 5)))

res = df.nunique()\
        .sort_values(ascending=False)\
        .rename('Count')\
        .to_frame()

print(res)

   Count
1      5
3      4
2      4
0      4
4      3

计算每列的不同值，返回数据框，并对值进行排序

问题描述

1 个解决方案

解决方案1
2 已采纳 2018-11-17 22:36:55

计算每列的不同值，返回数据框，并对值进行排序

问题描述

1 个解决方案

解决方案1 2 已采纳 2018-11-17 22:36:55

解决方案1
2 已采纳 2018-11-17 22:36:55