用拆分列替換數據框列

Question

拆分后如何用列替換數據框列？ 我知道如何拆分列，但不知道如何用拆分值列替換它。

輸入：

import pandas as pd

df = pd.DataFrame({'id': [101, 102],
                   'full_name': ['John Brown', 'Bob Smith'],
                   'birth_year': [1960, 1970]})
df_new = df['full_name'].str.split(" ", expand=True)
print(df)
print(df_new)

輸出：

    id   full_name  birth_year
0  101  John Brown        1960
1  102   Bob Smith        1970
      0      1
0  John  Brown
1   Bob  Smith

預期輸出：

    id first_name last_name  birth_year
0  101       John     Brown        1960
1  102        Bob     Smith        1970

Answer 1

df.join(df.full_name.str.split('\s', expand = True) \
                                    .set_axis(['first_name', 'last_name'], axis = 1)) \
                                                [['id', 'first_name', 'last_name', 'birth_year']]

輸出：

    id   full_name  birth_year
0  101  John Brown        1960
1  102   Bob Smith        1970

Answer 2

策略是獲取您希望替換的列的位置，創建新列，並根據您希望替換的列的位置連接新舊數據框：

#get the position of the column to be replaced
col_position = df.columns.get_loc('full_name')

#create new dataframe that holds the new columns
insert_df = (df
            .pop('full_name')
            .str.split(expand=True)
            .set_axis(['first_name','last_name'],axis='columns')
            )

df_by_positions = (#this is the dataframe before col_position
                   [df.iloc[:,:col_position],
                   #this is the dataframe we are inserting
                   insert_df,
                  #this is the dataframe after col_position
                  df.iloc[:,col_position:]
                  ]
                  )

pd.concat(df_by_positions,axis=1)

     id first_name  last_name   birth_year
0   101   John       Brown       1960
1   102   Bob        Smith       1970

用拆分列替換數據框列

問題描述

2 個解決方案

解決方案1
1 2020-03-30 19:41:40

解決方案2
1 已采納 2020-03-30 22:33:07

用拆分列替換數據框列

問題描述

2 個解決方案

解決方案1 1 2020-03-30 19:41:40

解決方案2 1 已采納 2020-03-30 22:33:07

解決方案1
1 2020-03-30 19:41:40

解決方案2
1 已采納 2020-03-30 22:33:07