在数据框的列中填充连续NAN

Question

I have a dataframe having a column C, I want to fill consecutive blanks by the same number because later I need to group that row. 我有一个具有C列的数据框，我想用相同的数字填充连续的空白，因为以后我需要将该行分组。

eg 例如

A B C
 1 2 Nan
 1 2 Nan
 1 2 3
 1 2 Nan
 1 2 Nan

the output I want is something like this 我想要的输出是这样的

I tried using shift() to compare but didn't come to the desired output. 我尝试使用shift（）进行比较，但未达到所需的输出。

Answer 1

You can use fillna by new Series created by cumsum by boolean mask : 您可以使用fillna由boolean mask创建的新Series的cumsum ：

df['C'] = df['C'].fillna(df['C'].notnull().cumsum() + 1)

print (df)
   A  B    C
0  1  2  1.0
1  1  2  1.0
2  1  2  3.0
3  1  2  2.0
4  1  2  2.0

Detail : 详细说明 ：

print (df['C'].notnull().cumsum())
0    0
1    0
2    1
3    1
4    1
Name: C, dtype: int32

Answer 2

The function fillna is your solution: 函数fillna是您的解决方案：

dataframe['yourColumn'] = dataframe['yourColumn'] .fillna( 1 , inplace=True)

Moreover you can put whatever value you want to substitute the nan values. 此外，您可以放置任何要替换nan值的值。 For instance, you coul set the mean: 例如，您可以设置均值：

dataframe['yourColumn']= dataframe['yourColumn'].fillna(dataset['yourColumn'] .mean(), inplace=True)

在数据框的列中填充连续NAN

问题描述

2 个解决方案

解决方案1
2 2018-07-12 09:04:29

解决方案2
0 2018-07-12 09:10:22

在数据框的列中填充连续NAN

问题描述

2 个解决方案

解决方案1 2 2018-07-12 09:04:29

解决方案2 0 2018-07-12 09:10:22

解决方案1
2 2018-07-12 09:04:29

解决方案2
0 2018-07-12 09:10:22