Remove specific words string (both prefix and postfix) from a pandas dataframe column

Question

I want to keep the latest rows with the same ID and also the rows that match certain column values.

Sample Input:

ID                  Address
1                   PALLABI- F #1st Floor, SEC #10, Pallabi, MIRPUR
2                   H#22(2nd floor),Extended Rupnagar Area, Pallabi Mirpur, Dhaka.
3                   Uttar khan-F #3rd floor, Kuripara, Dhaka
4                   F-1,H-43,Chalabon,D.khan, Uttarkhan

PREFIX

ID 1 has a word PALLABI- i want to remove that part. Similar goes for ID 3 where Uttar khan- should be removed. After removal Uttar khan or PALLABI part should add to the postfix of the string only if the prefix doesn't contain any of these words.

POSTFIX

Another part is to remove Dhaka at the end of the string.

Output:

ID                  Address
1                   F #1st Floor, SEC #10, Pallabi, MIRPUR
2                   H#22(2nd floor),Extended Rupnagar Area, Pallabi Mirpur
3                   F #3rd floor, Kuripara, Uttar khan
4                   F-1,H-43,Chalabon,D.khan, Uttarkhan

Thanks in advance.

Answer 1

You can apply a function to the column;

def my_function(string): 
    # if your word is the first in the string
    if 'your_word' == string.split()[0]:
        # replace it with your requirement (it can be nothing (''))
        string = string.replace('what/you/need/to/replace', 'replacement') 

    return string

df['column'] = df['column'].apply(my_function)

Remove specific words string (both prefix and postfix) from a pandas dataframe column

Question

1 answers

solution1
0 2022-01-06 05:42:33

Remove specific words string (both prefix and postfix) from a pandas dataframe column

Question

1 answers

solution1 0 2022-01-06 05:42:33

solution1
0 2022-01-06 05:42:33