简体   繁体   English

Pandas DataFrame:如何将二进制列转换为一个分类列?

[英]Pandas DataFrame: How to convert binary columns into one categorical column?

Given a pandas DataFrame, how does one convert several binary columns (where 1 denotes the value exists, 0 denotes it doesn't) into a single categorical column? 给定一个熊猫DataFrame,如何将几个二进制列(其中1表示该值存在,0表示不存在)转换为单个分类列?

Another way to think of this is how to perform the "reverse pd.get_dummies() "? 另一种思考方式是如何执行“ reverse pd.get_dummies() ”?

Here is an example of converting a categorical column into several binary columns: 这是将分类列转换为几个二进制列的示例:

import pandas as pd
s = pd.Series(list('ABCDAB'))
df = pd.get_dummies(s)
df
   A  B  C  D
0  1  0  0  0
1  0  1  0  0
2  0  0  1  0
3  0  0  0  1
4  1  0  0  0
5  0  1  0  0

What I would like to accomplish is given a dataframe 我想完成的是给出一个数据框

df1
   A  B  C  D
0  1  0  0  0
1  0  1  0  0
2  0  0  1  0
3  0  0  0  1
4  1  0  0  0
5  0  1  0  0

could do I convert it into 我可以把它转换成

df1
   A  B  C  D   category
0  1  0  0  0   A
1  0  1  0  0   B
2  0  0  1  0   C
3  0  0  0  1   D
4  1  0  0  0   A
5  0  1  0  0   B

One way would be to use idxmax to find the 1s: 一种方法是使用idxmax查找1:

In [32]: df["category"] = df.idxmax(axis=1)

In [33]: df
Out[33]: 
   A  B  C  D category
0  1  0  0  0        A
1  0  1  0  0        B
2  0  0  1  0        C
3  0  0  0  1        D
4  1  0  0  0        A
5  0  1  0  0        B

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM