用熊猫将多行转置为列

Question

我在 jupyter notebook 中用 pandas 读取了这个 excel 表。 我想将表格的上行侧融合为列。 该表如下所示：

                              ori code   cgk       cgk    clg       clg
                              ori city   jakarta   NaN    cilegon   NaN
                              ori prop   jakarta   NaN    banten    NaN
                              ori area   jawa      NaN    jawa      NaN

code    city       district   island    type a     days    type b    days
001     jakarta    jakarta    jawa      12000       2      13000      3
002     surabaya   surabaya   jawa      13000       3      14000      4

我意识到 df.melt 应该用来转置上面的行，但是type & days列，还有 4 行和上面的 NaN 值让我对如何正确地做到这一点感到困惑。

我需要的愿望清洁数据框如下：

code   city        district    island    type       price_type   days    ori_code   ori_city     ori_prop   ori_area
001    jakarta     jakarta     jawa      type a     12000         2         cgk     jakarta    jakarta      jawa
001    jakarta     jakarta     jawa      type b     13000         3         clg     cilegon    banten       jawa 
002    surabaya    surabaya    jawa      type a     13000         3         cgk     jakarta    jakarta      jawa
002    surabaya    surabaya    jawa      type b     14000         4         clg     cilegon    banten       jawa

ori_code, ori_city, ori_prop, ori_area将成为列名。

到目前为止，我所做的是设置固定索引名称，即代码、城市、地区和岛屿。

df = df.set_index(['code','city','district','island'])

谁能帮我解决这个问题？ 任何帮助将不胜感激。 先感谢您。

Answer 1

为此，您可以像这样使用 pandas melt 函数：

import pandas as pd

# Set the index for the DataFrame
df = df.set_index(['code', 'city', 'district', 'island'])

# Use pd.melt to reshape the data
df = pd.melt(df, id_vars=['code', 'city', 'district', 'island'], var_name='type', value_name='price_type')

# Split the 'type' column into two columns: 'type' and 'days'
df[['type', 'days']] = df['type'].str.split(' ', expand=True)

# Drop the 'ori code', 'ori city', 'ori prop', and 'ori area' columns
df = df.drop(columns=['ori code', 'ori city', 'ori prop', 'ori area'])

# Reorder the columns
df = df[['code', 'city', 'district', 'island', 'type', 'price_type', 'days', 'ori_code', 'ori_city', 'ori_prop', 'ori_area']]

# Display the resulting DataFrame
print(df)

用熊猫将多行转置为列

问题描述

1 个解决方案

解决方案1
1 2022-12-21 04:22:19

用熊猫将多行转置为列

问题描述

1 个解决方案

解决方案1 1 2022-12-21 04:22:19

解决方案1
1 2022-12-21 04:22:19