[英]Best way to remove specific words from column in pandas dataframe?
[英]Remove specific words string (both prefix and postfix) from a pandas dataframe column
我想保留具有相同 ID 的最新行以及与某些列值匹配的行。
Sample Input:
ID Address
1 PALLABI- F #1st Floor, SEC #10, Pallabi, MIRPUR
2 H#22(2nd floor),Extended Rupnagar Area, Pallabi Mirpur, Dhaka.
3 Uttar khan-F #3rd floor, Kuripara, Dhaka
4 F-1,H-43,Chalabon,D.khan, Uttarkhan
字首
ID 1有一个词PALLABI-我想删除该部分。 ID 3也是如此,应该删除Uttar khan- 。 删除Uttar khan或PALLABI部分后,仅当前缀不包含任何这些单词时,才应添加到字符串的后缀。
后缀
另一部分是删除字符串末尾的Dhaka 。
Output:
ID Address
1 F #1st Floor, SEC #10, Pallabi, MIRPUR
2 H#22(2nd floor),Extended Rupnagar Area, Pallabi Mirpur
3 F #3rd floor, Kuripara, Uttar khan
4 F-1,H-43,Chalabon,D.khan, Uttarkhan
提前致谢。
您可以将 function 应用于该列;
def my_function(string):
# if your word is the first in the string
if 'your_word' == string.split()[0]:
# replace it with your requirement (it can be nothing (''))
string = string.replace('what/you/need/to/replace', 'replacement')
return string
df['column'] = df['column'].apply(my_function)
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.