使用 pandas 连接向数据框添加一列

Question

I have "train_df" data frame which:我有“train_df”数据框，其中：

print(train_df.shape)

returns (997, 600).返回 (997, 600)。

now I want to concatenate a column to this data frame which:现在我想将一列连接到此数据框，其中：

print(len(local_df["target"]))

returns 997.返回 997。

so it seems that everything is ok with the dimensions.所以看起来尺寸一切正常。

but the problem is that:但问题是：

final_df = pd.concat([train_df, local_df["target"]], axis=1)
print(final_df.shape)

returns (1000, 601).返回 (1000, 601)。 while it should be (997, 601).而它应该是 (997, 601)。

Do you know what is the problem?你知道问题出在哪里吗？

Answer 1

I think problem is with different index values, so solution is create same by reset_index with parameter drop=True :我认为问题出在不同的索引值上，所以解决方案是通过reset_index和参数drop=True创建相同的：

final_df = pd.concat([train_df.reset_index(drop=True), 
                     local_df["target"].reset_index(drop=True)], axis=1)
print(final_df.shape)

Or set index of local_df by train_df.index :或者通过train_df.index设置local_df的train_df.index ：

final_df = pd.concat([train_df, 
                     local_df["target"].set_index(train_df.index)], axis=1)
print(final_df.shape)

Answer 2

You can assign a numpy array as a new column.您可以assign numpy 数组assign为新列。

final_df = train_df.assign(target=local_df["target"].values)

For pandas >= 0.24,对于 >= 0.24 的熊猫，

final_df = train_df.assign(target=local_df["target"].to_numpy())

Answer 3

How about join?:加盟怎么样？：

import pandas as pd
df=pd.DataFrame({'a':[1,2,3],'b':[4,5,6]})
df2=pd.DataFrame({'c':[232,543,562]})
print(df.reset_index(drop=True).join(df2.reset_index(drop=True), how='left'))

Output:输出：

   a  b    c
0  1  4  232
1  2  5  543
2  3  6  562

Answer 4

Not sure if this is most efficient不确定这是否最有效

Adding a new column y to a dataframe df from another dataframe df2 which has this column y从另一个具有此列y的 dataframe df2向 dataframe df添加一个新列y

 df = df.assign(y=df2["y"].reset_index(drop=True))

使用 pandas 连接向数据框添加一列

问题描述

4 个解决方案

解决方案1
2 2018-08-03 07:46:38

解决方案2
2 已采纳 2018-08-03 07:47:57

解决方案3
0 2018-08-03 07:51:45

解决方案4
0 2022-04-28 12:04:43

使用 pandas 连接向数据框添加一列

问题描述

4 个解决方案

解决方案1 2 2018-08-03 07:46:38

解决方案2 2 已采纳 2018-08-03 07:47:57

解决方案3 0 2018-08-03 07:51:45

解决方案4 0 2022-04-28 12:04:43

解决方案1
2 2018-08-03 07:46:38

解决方案2
2 已采纳 2018-08-03 07:47:57

解决方案3
0 2018-08-03 07:51:45

解决方案4
0 2022-04-28 12:04:43