使用来自另一个数据帧中匹配索引的值设置数据帧列

Question

I would like to set values in col2 of DF1 using the value held at the matching index of col2 in DF2 :我想使用保存在DF2中col2匹配索引处的值在DF1 col2中设置值：

DF1 : DF1 ：

         col1    col2
index
    0       a
    1       b
    2       c
    3       d
    4       e
    5       f

DF2 : DF2 ：

         col1    col2
index
    2       a      x
    3       d      y
    5       f      z

DF3 : DF3 ：

         col1    col2
index
    0       a     NaN
    1       b     NaN
    2       c       x
    3       d       y
    4       e     NaN
    5       f       z

If I just try and set DF1['col2'] = DF2['col2'] then col2 comes out as all NaN values in DF3 - I take it this is because the indices are different.如果我只是尝试设置DF1['col2'] = DF2['col2']然后col2作为DF3所有NaN值出现 - 我认为这是因为索引不同。 However when I try and use map() to do something like:但是，当我尝试使用map()执行以下操作时：

DF1.index.to_series().map(DF2['col2'])

then I still get the same NaN column, but I thought it would map the values over where the index matches...然后我仍然得到相同的NaN列，但我认为它会将值映射到索引匹配的位置...

What am I not getting?我没有得到什么？

Answer 1

You need join or assign :您需要join或assign ：

df = df1.join(df2['col2'])
print (df)
      col1 col2
index          
0        a  NaN
1        b  NaN
2        c    x
3        d    y
4        e  NaN
5        f    z

Or:或者：

df1 = df1.assign(col2=df2['col2']) 
#same like
#df1['col2'] = df2['col2']
print (df1)

      col1 col2
index          
0        a  NaN
1        b  NaN
2        c    x
3        d    y
4        e  NaN
5        f    z

If no match and all values are NaN s check if indices have same dtype in both df :如果不匹配且所有值都是NaN请检查索引在两个df是否具有相同的数据类型：

print (df1.index.dtype)
print (df2.index.dtype)

If not, then use astype:如果没有，则使用 astype：

df1.index = df1.index.astype(int)
df2.index = df2.index.astype(int)

Bad solution (check index 2):错误的解决方案（检查索引 2）：

df = df2.combine_first(df1)
print (df)
      col1 col2
index          
0        a  NaN
1        b  NaN
2        a    x
3        d    y
4        e  NaN
5        f    z

Answer 2

You can simply concat as you are combining based on index您可以在根据索引进行组合时简单地进行连接

df = pd.concat([df1['col1'], df2['col2']],axis = 1)

        col1    col2
index       
0       a   NaN
1       b   NaN
2       c   x
3       d   y
4       e   NaN
5       f   z

使用来自另一个数据帧中匹配索引的值设置数据帧列

问题描述

2 个解决方案

解决方案1
8 已采纳 2017-10-23 18:30:07

解决方案2
1 2017-10-23 18:33:31

使用来自另一个数据帧中匹配索引的值设置数据帧列

问题描述

2 个解决方案

解决方案1 8 已采纳 2017-10-23 18:30:07

解决方案2 1 2017-10-23 18:33:31

解决方案1
8 已采纳 2017-10-23 18:30:07

解决方案2
1 2017-10-23 18:33:31