通过替换不同列中的nans来合并数据框内的行

Question

I have a df: 我有一个df：

df = pd.DataFrame([[1, np.nan, "filled", 3], [1, "filled", np.nan, 3], [1, "filled", np.nan, 4]], columns = ["a", "b", "c", "d"])
    a   b   c   d
0   1   NaN filled  3
1   1   filled  NaN 3
2   1   filled  NaN 4

And my end result should be: 我的最终结果应该是：

df = pd.DataFrame([[1, "filled", "filled", 3], [1, "filled", np.nan, 4]], columns = ["a", "b", "c", "d"])
    a   b   c   d
0   1   filled  filled  3
1   1   filled  NaN 4

So I want to merge the rows that are identical in all respects other than the column b and c. 所以我想合并除了列b和c以外的所有方面相同的行。 The issue is that not always there will be a another row identical except for columns b and c. 问题是除了列b和c之外，并不总是会有另一行相同。

Can't think how to use df.groupby(["a", "d"]).apply() to get what I want. 想不出怎么用df.groupby(["a", "d"]).apply()得到我想要的东西。

Answer 1

You can check with groupby + first , it will select the first not NaN value as output 您可以检查groupby + first ，它会选择先不要NaN值作为输出

df.groupby(['a','d'],as_index=False).first()
Out[897]: 
   a  d       b       c
0  1  3  filled  filled
1  1  4  filled     NaN

通过替换不同列中的nans来合并数据框内的行

问题描述

1 个解决方案

解决方案1
3 已采纳 2018-12-04 16:06:18

通过替换不同列中的nans来合并数据框内的行

问题描述

1 个解决方案

解决方案1 3 已采纳 2018-12-04 16:06:18

解决方案1
3 已采纳 2018-12-04 16:06:18