Python: how to fill up the mean value referencing from another dataframe column

Question

I have a housing dataframe:

where there are missing values in the Price column. I wish to fill the missing values by the mean price in the respective suburb.

This is my code for filling up the mean price by the same column:

all_housing_df['Price'].fillna(all_housing_df['Price'].mean())

How to fill in the mean price by the respective suburb?

Answer 1

You can use transform to fill missing values with the full list after grouping by Suburb

all_housing_df["Price"].fillna(all_housing_df.groupby("Suburb")["Price"].transform("mean"))

Answer 2

You can group by Suburb , get the mean Price and save this as a dictionary to conditionally replace null values.

# Create dictionary for NaN values
nan_dict = all_housing_df.groupby('Suburb')['Price'].mean().to_dict()

# Replace NaN with dictionary
all_housing_df['Price'].fillna(all_housing_df['Suburb'].map(nan_dict))

Python: how to fill up the mean value referencing from another dataframe column

Question

2 answers

solution1
1 2021-04-11 06:07:44

solution2
1 2021-04-11 06:13:27

Python: how to fill up the mean value referencing from another dataframe column

Question

2 answers

solution1 1 2021-04-11 06:07:44

solution2 1 2021-04-11 06:13:27

solution1
1 2021-04-11 06:07:44

solution2
1 2021-04-11 06:13:27